VOOZH about

URL: https://apify.com/parseforge/y-combinator-scraper

โ‡ฑ Y Combinator Scraper - Startup Directory Data ยท Apify


Pricing

Pay per event

Go to Apify Store

Y Combinator Companies Scraper

Extract company profiles, founders, and open job listings from the Y Combinator directory. Filter by batch, industry, subindustry, region, and hiring status. Covers 5,700+ funded startups from W05 to the latest YC cohort. Includes growth stage, equity ranges, salary data, and contact emails.

Pricing

Pay per event

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

1

Bookmarked

28

Total users

12

Monthly active users

25 days ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿš€ Y Combinator Companies Scraper

๐Ÿš€ Pull 5,700+ Y Combinator-funded startups in minutes. Companies, founders, batches, industries, open jobs. No API key, no manual CSV wrangling.

๐Ÿ•’ Last updated: 2026-05-08 ยท ๐Ÿ“Š 30+ fields per company ยท ๐ŸŽ“ W05 to current batch ยท ๐Ÿ’ผ Open jobs included ยท ๐Ÿšซ No auth required

Pull live company data from the Y Combinator directory, the canonical record of every YC-funded startup since the very first batch. The actor walks the YC catalog with your filter combination, paginates through results, fetches each company detail page, and returns one structured record per company ready for investor research, sales prospecting, lead-gen, or talent sourcing.

Every run fetches data live so you get the current state of the YC directory, not a stale dump. Records include logo URL, batch, batchName, growth stage, year founded, team size, location, founder names with bios, social handles, current job listings with salary and equity ranges, and a back-reference URL to the canonical YC profile.

๐Ÿ‘ฅ Built for๐ŸŽฏ Primary use cases
Venture capital and angelsTrack new YC batches as they launch
Sales and BD teamsBuild prospect lists of YC-funded startups
RecruitersSource candidates from YC company hiring pages
Founders and operatorsMap competitor landscape and funding signals
Researchers and journalistsStudy startup ecosystem trends across batches
BizDev and partnershipsIdentify integration partners by industry

๐Ÿ“‹ What the Y Combinator Companies Scraper does

  • ๐ŸŽ“ Filter by batch. Pass batch codes like W25, S25, X25, F25 or full names.
  • ๐Ÿญ Industry filters. B2B, Consumer, Healthcare, Fintech, Industrials, Real Estate, Education, Government.
  • ๐Ÿ”ฌ Sub-industries. Drill down into Payments, Drug Discovery, Engineering, Product and Design, etc.
  • ๐ŸŒ Region filters. USA, Europe, Latin America, South Asia, Southeast Asia, Africa, India, UK.
  • ๐Ÿ“Š Status and stage. Active, Public, Acquired, Inactive plus the YC growth stage.
  • ๐Ÿ’ผ Hiring filter. Return only companies with open job listings.
  • โญ Top Companies. Filter to YC's curated Top Companies list.

The scraper accepts any combination of these filters, builds the matching YC search URL, and walks the result pages. For each company it fetches the detail page to extract founders, social handles, job listings (with salary and equity ranges), and the full company description.

๐Ÿ’ก Why it matters: the YC directory is the canonical record of YC-funded startups but its UI is paginated, JS-rendered, and lacks bulk export. A live, structured pull beats manual sourcing for VC research, BD outreach, and recruiting at scale.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing setup, a live run, and how to pipe results into Salesforce or Airtable via Apify integrations.


โš™๏ธ Input

FieldTypeNameDescription
startUrlsarrayCompany URLsSpecific YC company URLs (e.g. https://www.ycombinator.com/companies/airbnb). When provided, all other filters are ignored.
maxItemsintegerMax CompaniesFree users: limited to 10 items (preview). Paid users: optional, max 1,000,000.
querystringSearch QueryFull-text search across name, description, keywords.
batchesarrayBatchesBatch codes (W25, S25, X25, F25) or full names.
industriesarrayIndustriesTop-level industry tags.
subindustriesarraySubindustriesDrill-down sub-industry tags.
regionsarrayRegionsGeographic region filters.
companyStatusenumCompany StatusActive, Public, Acquired, Inactive.
isHiringbooleanHiring OnlyOnly companies with open job listings.
nonprofitbooleanNonprofits OnlyOnly nonprofit YC companies.
topCompaniesOnlybooleanTop Companies OnlyOnly YC's curated Top Companies list.

Example 1. Hiring fintech startups from W25, USA only.

{
"batches":["W25"],
"industries":["Fintech"],
"regions":["United States of America"],
"isHiring":true,
"maxItems":50
}

Example 2. Direct lookup of two specific YC companies.

{
"startUrls":[
"https://www.ycombinator.com/companies/airbnb",
"https://www.ycombinator.com/companies/stripe"
],
"maxItems":2
}

โš ๏ธ Good to Know: when startUrls is set, every other filter is ignored. Use it for ad-hoc enrichment of known YC companies.


๐Ÿ“Š Output

The dataset returns one structured record per YC company. Each record carries identifiers, batch metadata, growth stage, location, team size, founders, social handles, open job listings, and a back-reference URL. Consume the dataset as JSON, CSV, Excel, XML, or RSS via the Apify console or API.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ–ผ๏ธ logoUrlstring (url)https://bookface-images.s3.amazonaws.com/logos/abc.png
๐Ÿ†” idstring5234
๐Ÿข namestringAirbnb
๐Ÿท๏ธ slugstringairbnb
๐Ÿ”— urlstring (url)https://www.ycombinator.com/companies/airbnb
๐ŸŒ websitestring (url)https://airbnb.com
๐Ÿ“ oneLinerstringBook accommodations around the world
๐ŸŽ“ batchstringW09
๐Ÿท๏ธ batchNamestringWinter 2009
๐Ÿ“Š statusstringPublic
๐Ÿ“ˆ stagestringPublic
๐Ÿ—“๏ธ yearFoundednumber2008
๐Ÿ‘ฅ teamSizenumber6132
๐Ÿ“ locationstringSan Francisco, CA, USA
๐Ÿท๏ธ industriesarray["Travel"]
๐ŸŒ regionsarray["United States of America"]
๐Ÿ‘ฅ foundersarray[{"name":"Brian Chesky","title":"CEO","linkedin":"..."}]
๐Ÿ’ผ jobsarray[{"title":"Senior Engineer","equity":"0.01-0.05%","salary":"$200K-$300K"}]
๐Ÿฆ twitterstringhttps://twitter.com/airbnb
๐Ÿ’ผ linkedinstringhttps://linkedin.com/company/airbnb
๐Ÿ“ž contactEmailstringpress@airbnb.com
โญ isTopCompanybooleantrue
๐Ÿค isNonprofitbooleanfalse
๐Ÿ“ descriptionstringAirbnb is an online marketplace for...
๐Ÿ“… scrapedAtISO datetime2026-05-08T12:00:00.000Z

๐Ÿ“ฆ Sample records

1. Public top company (Airbnb)

{
"logoUrl":"https://bookface-images.s3.amazonaws.com/logos/airbnb.png",
"id":"5234",
"name":"Airbnb",
"slug":"airbnb",
"url":"https://www.ycombinator.com/companies/airbnb",
"website":"https://airbnb.com",
"oneLiner":"Book accommodations around the world",
"batch":"W09",
"batchName":"Winter 2009",
"status":"Public",
"stage":"Public",
"yearFounded":2008,
"teamSize":6132,
"location":"San Francisco, CA, USA",
"industries":["Consumer","Travel"],
"regions":["United States of America"],
"founders":[
{"name":"Brian Chesky","title":"CEO","linkedin":"https://linkedin.com/in/brianchesky"},
{"name":"Joe Gebbia","title":"Co-founder"},
{"name":"Nathan Blecharczyk","title":"Co-founder"}
],
"twitter":"https://twitter.com/airbnb",
"linkedin":"https://linkedin.com/company/airbnb",
"isTopCompany":true,
"isNonprofit":false,
"scrapedAt":"2026-05-08T12:00:00.000Z"
}

2. Hiring early-stage company (W25 batch)

{
"logoUrl":"https://bookface-images.s3.amazonaws.com/logos/acme.png",
"id":"32145",
"name":"Acme AI",
"slug":"acme-ai",
"website":"https://acme-ai.com",
"oneLiner":"AI agents for B2B back-office workflows",
"batch":"W25",
"batchName":"Winter 2025",
"status":"Active",
"stage":"Seed",
"yearFounded":2024,
"teamSize":5,
"location":"San Francisco, CA, USA",
"industries":["B2B"],
"regions":["United States of America"],
"founders":[
{"name":"Jane Smith","title":"CEO"},
{"name":"John Doe","title":"CTO"}
],
"jobs":[
{"title":"Founding Engineer","equity":"0.5-2.0%","salary":"$150K-$200K","location":"SF (in-person)"},
{"title":"Founding Designer","equity":"0.3-1.0%","salary":"$130K-$180K","location":"SF (in-person)"}
],
"scrapedAt":"2026-05-08T12:00:00.000Z"
}

3. Acquired company (sparse fields)

{
"id":"1234",
"name":"Old Startup",
"slug":"old-startup",
"batch":"S15",
"batchName":"Summer 2015",
"status":"Acquired",
"stage":"Acquired",
"yearFounded":2014,
"isTopCompany":false,
"scrapedAt":"2026-05-08T12:00:00.000Z"
}

โœจ Why choose this Actor

Capability
๐ŸŽฏBuilt for the job. Scoped specifically to the Y Combinator directory so you skip the parser engineering entirely.
๐Ÿ”–Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines.
โšกFast. Optimized request patterns return results in seconds, not minutes.
๐Ÿ”Always fresh. Every run pulls live data, so the dataset reflects YC as of run time.
๐ŸŒNo infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage.
๐Ÿ›ก๏ธReliable. Battle-tested across many runs and edge cases, with graceful error handling.
๐ŸšซNo code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK.

๐Ÿ“Š Production-grade structured startup data without the engineering overhead of building and maintaining your own scraper.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ Y Combinator Companies Scraper (this Actor)$5 free credit, then pay-per-useFull YC directory (5,700+)Live per runBatch, industry, region, stage, hiringโšก 2 min
Build your own scraperEngineering hoursFull once builtWhenever you maintain itCustom code๐Ÿข Days to weeks
Paid VC databases$$$ monthly per seatVendor-definedPeriodicVendor-definedโณ Hours
Manual sourcingHours per companyLimitedStaleManual filter clicking๐Ÿ•’ Variable

Pick this Actor when you want broad coverage, source-native filtering, and no pipeline maintenance.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the Y Combinator Companies Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set filters. Pick batch, industry, region, and other filters, then set maxItems.
  4. ๐Ÿš€ Run it. Click Start and let the Actor collect your data.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐Ÿ“Š VC and investor research

  • Track new YC batches as they launch
  • Build watchlists by industry and stage
  • Map founder profiles across YC cohorts
  • Surface hiring signals as growth indicators

๐Ÿข Sales and BD

  • Build outbound prospect lists of YC startups
  • Filter by hiring status to find well-funded buyers
  • Source product partner candidates by industry
  • Power CRM enrichment with batch and stage data

๐ŸŽฏ Recruiting

  • Source candidates from YC company hiring pages
  • Build talent pipelines by industry vertical
  • Map technical leadership across YC alumni
  • Track which YC companies are hiring in your stack

๐Ÿ› ๏ธ Engineering and product

  • Prototype startup-data products without owning a crawler
  • Replace fragile in-house YC scrapers
  • Wire datasets into your apps via the Apify API or webhooks
  • Skip the proxy, retry, and parsing maintenance entirely

๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

๐ŸŽจ Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

๐Ÿค Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

๐Ÿงช Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

๐Ÿ”Œ Automating Y Combinator Companies Scraper

This Actor exposes a REST endpoint, so you can drive it from any language or workflow tool.

Schedules. Use Apify Scheduler to run hourly, daily, or weekly snapshots. Combine with the Apify dataset diff tools to track new YC companies between runs.


โ“ Frequently Asked Questions

๐Ÿ”Œ Integrate with any app

Y Combinator Companies Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe results into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh data into your product backend or alert your team in Slack.


๐Ÿ”— Recommended Actors

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom project, or report an issue.


โš ๏ธ Disclaimer. This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Y Combinator or any of its subsidiaries. All trademarks mentioned are the property of their respective owners. The scraper accesses only publicly available pages and is intended for legitimate research, analytics, and lead-generation use. Users are responsible for compliance with the source site's Terms of Service and applicable law.

You might also like

Y Combinator Startups Scraper

automation-lab/ycombinator-scraper

Extract Y Combinator startup data: company names, websites, descriptions, team sizes, batches, industries, and hiring status. Filter by batch (W24, S23), status, industry, or tags. Uses the official YC API โ€” no proxy needed. Export as JSON, CSV, or Excel.

๐Ÿ‘ User avatar

Stas Persiianenko

49

Y Combinator Scraper with Founders & Emails

fatihtahta/y-combinator-directory-scraper

Scrape the Y Combinator directory and get rich company profiles with socials, founder details + emails, hiring status/job links, and news mentions. Perfect for lead gen, market mapping, recruiting, and competitor tracking.

185

2.7

Y Combinator Scraper

michael.g/y-combinator-scraper

Extract startup leads, founder emails, LinkedIn profiles, hiring data, and more from YC companies and founders. Export scraped data, schedule via API, and integrate with other tools or AI workflows.

1.4K

5.0

Y Combinator Jobs Scraper

artemlazarevm/yc-jobs-scraper

Scrape Y Combinator companies and job listings. 2,500+ startups, 2,400+ jobs, 3,300+ founders. Free dataset: https://www.kaggle.com/datasets/lazarun/y-combinator-jobs-enriched (scraped with this API).

106

Crunchbase Scraper

parseforge/crunchbase-scraper

Extract company data from Crunchbase profiles. Get funding rounds, investor lists, employee details, social links, operating status, and more from any company URL. No Crunchbase subscription needed. Process hundreds of profiles in a single run and export structured data as JSON, CSV, or Excel.

LinkedIn Easy Apply Bot โ€” Auto-Apply with AI Filters

sunny_spade/linkedin-easy-apply-bot

Automatically applies to LinkedIn Easy Apply jobs matching your profile. Filters by language, role relevance, and Language fluency requirements. Fills all form fields using your profile data. Requires a valid LinkedIn session cookie.

Y Combinator Scraper - 5000+ Startups & 8000+ Founders

clearpath/ycombinator-api-scraper

Extract complete Y Combinator ecosystem data - 5000+ companies, 8000+ founders, 3500+ jobs. Perfect for VCs, recruiters, and researchers. Get startup intelligence, funding trends, team data, and job listings. Reliable Python scraper with proxy support. Start at $3.50.

329

4.3

Y Combinator Companies Scraper

jungle_synthesizer/y-combinator-scraper

Extract company profiles from the Y Combinator startup directory. Covers 5,700+ funded startups across all YC batches. Returns name, website, one-liner, description, team size, location, industry, batch, stage, hiring status. Filter by batch, industry, region, or hiring status.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

Y Combinator Jobs Scraper

parsebird/yc-jobs-scraper

Scrape Y Combinator startup job listings with salary, equity, visa sponsorship, founder data, and full descriptions. Filter by role and location. Job-centric output ready for recruiting, salary benchmarking, and startup research.

261

3.9

YCombinator Companies Scraper | 5,900+ YC Startup Directory

haketa/ycombinator-companies-scraper

Y Combinator companies scraper & API: export the YC startup directory by batch, industry & status โ€” company name, description, website, batch, team size, location, founders, tags and YC profile URL. Startup intelligence, VC research and B2B lead lists โ€” fast, no login.