Pricing
$8.99 / 1,000 results
Stanford University Scraper
Scrapes Stanford Profiles directory listings with pagination to gather profile URLs, then extracts detailed data; name, email, departments, affiliations, education, honors, bio, and more. Ideal for academic research, lead generation, or structured datasets.
Pricing
$8.99 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
2
Bookmarked
16
Total users
0
Monthly active users
18 days ago
Last modified
Categories
Share
Stanford Profiles Full Scraper
Combine directory crawling and profile-detail extraction in one goβno more juggling two actors.
π What it does β’ List phase: starts at your School of H&S directory page, follows real βnextβ links (or ?p=), and collects only valid profile URLs. β’ Detail phase: fetches each profile and pulls out name, email, departments, affiliations, education histories, honors, bio, and more. β’ Smart stops: if a page yields no new links, the crawl haltsβno phantom pages, no wasted cycles.
π οΈ Key features β’ All-in-one workflow: one actor, one dataset, zero hassle. β’ Pagination guardrails: leverages and βno-new-linksβ breaks to avoid endless loops. β’ Configurable concurrency: scrape dozens of profiles in parallel without overwhelming the server. β’ Flexible inputs: start from a URL, inject extra URLs by textarea or dataset ID, and cap list pages if you want.
π₯ Inputs
Field Type Default Notes startUrl String https://profiles.stanford.edu/browse/school-of-humanities-and-sciences?affiliations=capFaculty Directory page to kick off the crawl maxPages Integer 0 Max list pages (0 = unlimited) urlsText String "" Extra profile URLs (newline/comma separated) sourceDatasetId String "" Dataset ID whose items contain a url field maxConcurrency Integer 5 Worker threads for detail scraping
π€ Output
A single dataset of JSON records, e.g.:
{ "url": "https://profiles.stanford.edu/jane-doe", "name": "Jane Doe", "email": "jane.doe@stanford.edu", "departments": ["History"], "faculty": ["School of Humanities and Sciences"], "personalWebsite": "https://janedoe.com", "bio": "Professor of Historyβ¦", "academicAppointments": [ "Associate Professor (2020β)" ], "professionalEducation": [ "PhD, Stanford University (2015)" ], "honorsAwards": [ "Guggenheim Fellowship (2023)" ] }
π‘ Use cases β’ Academic research: map faculty expertise, collaborations, or department distributions. β’ Lead generation: gather contact info for outreach to professors or postdocs. β’ Data integration: power your internal CRM, directory, or AI pipeline with fresh Stanford data.
βΈ»
Got feedback or hitting a wall? Drop a note on the actorβs issue trackerβand happy scraping!
