VOOZH about

URL: https://apify.com/khadinakbar/ycombinator-scraper

โ‡ฑ Y Combinator Scraper ยท Apify


๐Ÿ‘ Y Combinator Scraper โ€” Companies, Founders & Jobs avatar

Y Combinator Scraper โ€” Companies, Founders & Jobs

Pricing

from $5.00 / 1,000 result scrapeds

Go to Apify Store

Y Combinator Scraper โ€” Companies, Founders & Jobs

Scrape the YC ecosystem in one actor: 5,700+ companies, 8,000+ founders with socials, 3,000+ jobs with salary/equity/visa, news & launches. Paste any YC URL or filter by batch, industry, region, hiring. MCP-ready, $0.005/result.

Pricing

from $5.00 / 1,000 result scrapeds

Rating

0.0

(0)

Developer

๐Ÿ‘ Khadin Akbar

Khadin Akbar

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

9 days ago

Last modified

Share

Y Combinator Scraper โ€” Companies, Founders, Jobs & All Data Points

Scrape the entire Y Combinator ecosystem from one Apify Actor. 5,700+ companies (W05 to the latest cohort), 8,000+ founders with social URLs, 3,000+ open jobs with salary, equity, visa, and skills. Plus news mentions, YC Launch posts, app/demo-day videos, and the YC application Q&A when public.

Paste any YC URL โ€” /companies?...filters, /companies/{slug}, /companies/{slug}/jobs/{slug}, /jobs?role=... โ€” or pick from the filter UI. The actor auto-routes each URL to the right extractor.

What you get

SurfaceData points
Companyid, slug, name, batch (W26/S26/F25/etc.), status, one-liner, long description, website, team size, year founded, industry + sub-industry, regions, HQ city/country, location string, tags, top-company flag, hiring flag, nonprofit flag, launched-at date, logo URLs, LinkedIn / Twitter / Facebook / Crunchbase / GitHub URLs, app video URL, demo-day video URL, app answers (when public), primary YC partner
FounderuserId, full name, title, bio, LinkedIn URL, Twitter URL, avatar, hasEmail flag, isActive flag
Open Job (per company)id, title, jobUrl, applyUrl, location, type (full-time/part-time/contract/intern), role + roleSpecificType + prettyRole, salaryRange, equityRange, visa sponsorship, skills[], minExperience, createdAt, lastActive
Job (jobs mode)All open-job fields plus full long-form description + hiring manager when scrapeJobDescriptions is on
News & Launch postsNews title + URL + date; YC Launch post title + tagline + permalink
Run summaryCounts, errors, duration, mode

Why this actor

  • One actor, every surface. No need to chain three scrapers โ€” companies, founders, jobs, news come back in one record per company.
  • MCP-native. Single auto-routing input, structured agent-friendly output, predictable per-event pricing. Drop into Claude / GPT / Apify MCP and it answers questions like "find YC fintech startups in Europe hiring engineers" in one tool call.
  • Reliable. Uses YC's public Algolia index for company discovery (no proxy needed, no rate limit) + residential-proxy HTML enrichment with session pool + circuit-breaker. Target: 95%+ run success rate.
  • Premium-tier completeness, fair price. $0.005 per record (between the cheap $0.001/1K actors that drop fields and the premium leader at $0.01โ€“0.15). Founders and jobs are billed separately ($0.001 each) so you only pay for what you turn on.

Pricing

EventPrice
Actor start$0.00005 (per GB RAM)
Result (one company OR one job)$0.005
Enrichment (one founder OR one embedded job per company)$0.001

Typical run cost: 100 companies + founders enrichment + 3 founders avg/company โ‰ˆ 100 ร— $0.005 + 300 ร— $0.001 = $0.80.

Pay-Per-Usage is also enabled for power users running multi-million-record jobs โ€” compute units + proxy passthrough are billed directly by Apify.

Quick start

Get the Spring 2026 batch with founders

{
"mode":"companies",
"batches":["Spring 2026"],
"scrapeFounders":true,
"maxResults":100
}

Get all hiring B2B startups in Europe

{
"mode":"companies",
"industries":["B2B"],
"regions":["Europe","United Kingdom"],
"isHiring":true,
"scrapeFounders":true,
"scrapeOpenJobs":true,
"maxResults":200
}

Get every open software-engineer job in San Francisco

{
"mode":"jobs",
"jobRole":"eng",
"jobLocation":"san-francisco",
"scrapeJobDescriptions":true,
"maxResults":500
}

Paste any YC URL

{
"startUrls":[
"https://www.ycombinator.com/companies?batch=Spring%202026&industry=Fintech",
"https://www.ycombinator.com/companies/stripe",
"https://www.ycombinator.com/jobs?role=design"
],
"scrapeFounders":true
}

Daily incremental "what's new" run

{
"mode":"jobs",
"jobRole":"eng",
"monitoringMode":true,
"maxResults":0
}

The actor persists every emitted job id in the STATE key-value collection. On the next scheduled run, only IDs not yet seen are emitted. Perfect for cron-driven email alerts and pipelines.

Code examples

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_APIFY_TOKEN")
run = client.actor("khadinakbar/y-combinator-scraper").call(run_input={
"mode":"companies",
"batches":["Spring 2026","Winter 2026"],
"industries":["B2B"],
"scrapeFounders":True,
"maxResults":50,
})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item["name"], item["batch"],len(item.get("founders",[])))

Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'YOUR_APIFY_TOKEN'});
const run =await client.actor('khadinakbar/y-combinator-scraper').call({
mode:'companies',
batches:['Spring 2026'],
scrapeFounders:true,
scrapeOpenJobs:true,
maxResults:50,
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
for(const item of items){
console.log(item.name, item.founders?.length, item.openJobs?.length);
}

MCP (Claude / Cursor / Continue)

Use the apify--y-combinator-scraper tool name through Apify MCP at https://mcp.apify.com.

Find me 20YC fintech startups in Europe that are currently hiring,
and include each founder's LinkedIn URL.

The agent calls apify--y-combinator-scraper with industries=["Fintech"], regions=["Europe"], isHiring=true, scrapeFounders=true, maxResults=20.

Filters reference

Companies mode

FilterTypeExample
querystring"developer tools", "AI agents"
batchesarray["Spring 2026", "Winter 2026"]
industriesarray["B2B", "Fintech"]
regionsarray["Europe", "Remote"]
topCompanybooleantrue = curated Top Companies only
isHiringbooleantrue = currently hiring only
nonprofitbooleantrue = YC nonprofits only
minTeamSize / maxTeamSizeinteger10 / 50

Jobs mode

FilterTypeExample
querystring"react", "machine learning"
jobRoleenumeng, design, product, ops, marketing, sales, support, recruiting-hr, science
jobLocationstringsan-francisco, new-york, london, remote, india

Enrichment toggles

ToggleWhenCost
scrapeFoundersCompanies mode$0.001 per founder added
scrapeOpenJobsCompanies mode$0.001 per job added
scrapeNewsAndLaunchesCompanies modefree (same HTTP fetch)
scrapeJobDescriptionsJobs modefree (covered by base $0.005/job)

Output shape

Each company record looks like:

{
"itemType":"company",
"id":271,
"slug":"airbnb",
"name":"Airbnb",
"batch":"W09",
"batchName":"Winter 2009",
"status":"Public",
"oneLiner":"Book accommodations around the world.",
"website":"http://airbnb.com",
"teamSize":6132,
"industries":["Consumer"],
"regions":["United States of America"],
"tags":["Marketplace","Travel"],
"topCompany":true,
"isHiring":false,
"linkedinUrl":"https://www.linkedin.com/company/airbnb/",
"twitterUrl":"https://twitter.com/airbnb",
"githubUrl":"https://github.com/airbnb",
"logoUrl":"https://...",
"ycUrl":"https://www.ycombinator.com/companies/airbnb",
"primaryPartner":"Garry Tan",
"founders":[
{
"fullName":"Brian Chesky",
"title":"Founder/CEO",
"linkedinUrl":"https://www.linkedin.com/in/brianchesky/",
"twitterUrl":"https://twitter.com/bchesky",
"founderBio":"...",
"hasEmail":true,
"isActive":true
}
]
}

Each job record looks like:

{
"itemType":"job",
"id":47049,
"title":"Frontend Engineer, Identity",
"jobUrl":"https://www.ycombinator.com/companies/stripe/jobs/jdBhPmD-frontend-engineer-identity",
"applyUrl":"https://...workatastartup.com/application?...",
"location":"United States / Remote",
"jobType":"Full-time",
"prettyRole":"Engineering",
"roleSpecificType":"Frontend",
"salaryRange":"",
"equityRange":"",
"visa":"Will sponsor",
"skills":["JavaScript"],
"minExperience":"6+ years",
"companyName":"Stripe",
"companyBatch":"S09",
"companyOneLiner":"Economic infrastructure for the internet.",
"createdAt":"over 4 years",
"lastActive":null
}

Reliability notes

  • Companies discovery uses YC's public Algolia index (YCCompany_production). No proxy, no rate limit. We re-acquire the dynamic API key from ycombinator.com/companies once at run start.
  • HTML enrichment (/companies/{slug}, /jobs, /jobs/{slug}) uses Apify residential proxy by default, with Crawlee session pool (max 20 sessions, max 30 uses each, retire on error score 3+).
  • Retries: 5 per request with exponential backoff and jitter. Sessions retire on 403/429.
  • Graceful failure: when a target URL fails after retries, the URL + reason is recorded in the run summary errors[] array and the run continues. Partial success is success.

Related actors

  • LinkedIn Jobs Scraper โ€” broader job-market signal beyond the YC ecosystem (salary + skills extraction).
  • Indeed Job Scraper โ€” Indeed jobs with full details to compare YC roles against the wider market.
  • B2B Lead Finder & Enrichment โ€” find decision-makers at the YC companies you just scraped.
  • GitHub Scraper โ€” repos, issues, PRs & contributors for YC dev-tool companies' open-source footprint.
  • Hugging Face Scraper โ€” models, datasets & Spaces to map AI YC startups against ML community activity.

FAQ

Q: Is this legal? Y Combinator publishes the entire company directory + jobs board for public, unauthenticated viewing. This actor reads only those public pages and the public Algolia index that YC's own frontend uses. We do not log in, do not bypass any auth wall, and do not access Work At A Startup's authenticated areas.

Q: How fresh is the data? Every run hits YC live โ€” there's no cached snapshot. Algolia is updated in near real-time by YC's content team. Company HTML and jobs pages are server-rendered fresh.

Q: Why are some applyUrl values pointing to account.ycombinator.com/authenticate? YC routes apply links through their authentication system on the way to Work At A Startup. That's their design โ€” the URL is correct, and a human user who clicks it from a browser will be prompted to sign in once, then forwarded to the actual job application form.

Q: Why is salaryRange empty on some jobs? Many YC startups don't publish salary bands on their YC job posting. That's a YC company choice, not a scraper limitation. When present, it's returned verbatim ("$180K - $230K").

Legal disclaimer

This actor scrapes only publicly accessible pages on ycombinator.com. By using this actor you agree to:

  • Comply with Y Combinator's Terms of Service.
  • Use the data for legitimate purposes (research, hiring, BD, journalism).
  • Respect the personal data of founders (do not spam, comply with GDPR/CCPA, honor opt-outs).
  • The actor author is not responsible for misuse.

You might also like

Y Combinator Scraper

solidcode/ycombinator-scraper

[๐Ÿ’ฐ $2.0 / 1K] Extract YC-funded companies, founders, and open jobs from the Y Combinator startup directory. Filter by batch, industry, region, status, team size, or hiring activity โ€” or paste any directory URL with filters pre-applied.

YC Companies Scraper | $5/1K | Founders, Socials, Batches

apivault_labs/yc-companies-scraper

Scrape Y Combinator companies directory: name, batch, industry, team size, website, LinkedIn, Twitter, founders with bios. 5000+ startups. Official public data source.

4

Y Combinator Scraper - YC Companies Directory

logiover/y-combinator-companies-directory-scraper

Scrape Y Combinator companies without login. Export the YC startup directory to CSV/JSON, an unofficial API alternative with founders and LinkedIn.

Y Combinator Companies Scraper

parseforge/y-combinator-scraper

Extract company profiles, founders, and open job listings from the Y Combinator directory. Filter by batch, industry, subindustry, region, and hiring status. Covers 5,700+ funded startups from W05 to the latest YC cohort. Includes growth stage, equity ranges, salary data, and contact emails.

YC Startup & Jobs Scraper โ€” Companies, Jobs & Founders

scrapepilot/yc-startup-jobs-scraper----companies-jobs-founders

Scrape YC-backed startup data from workatastartup.com. Returns company details, active jobs with salary ranges, founder LinkedIn profiles, team size and YC batch. Filter by industry, batch, remote-only and hiring status. Perfect for recruiting, sales and investor research.

Y Combinator Companies Scraper. Batch, Industry, Hiring

seemuapps/yc-companies-scraper

Scrape every Y Combinator startup with batch, industry, region, status, hiring, team size, founders' description, website, and YC profile URL. Filter by W24/S24/X25 batches or any criteria.

Y Combinator ยท YC ยท Only $1๐Ÿ’ฐ ยท Jobs & Companies scraper

memo23/y-combinator-scraper

๐Ÿ’ฐ $1/1K One actor for Y Combinator jobs (Work at a Startup) and companies (Startup Directory). Paste any YC URL โ€” auto-routed โ€” or use filters. Companies via Algolia: no proxy, clean schema. Optional founder enrichment: LinkedIn/Twitter URLs, company socials, open jobs. Full batch history to 2005.

๐Ÿ‘ User avatar

Muhamed Didovic

80

5.0

YC Intelligence Pro

genz-coder/yc-intelligence-pro

YC Scraper - Extract 40+ Fields from Y Combinator Companies

YCombinator Companies Scraper | 5,900+ YC Startup Directory

haketa/ycombinator-companies-scraper

Y Combinator companies scraper & API: export the YC startup directory by batch, industry & status โ€” company name, description, website, batch, team size, location, founders, tags and YC profile URL. Startup intelligence, VC research and B2B lead lists โ€” fast, no login.