VOOZH about

URL: https://apify.com/nexgendata/sequoia-portfolio-scraper

โ‡ฑ Sequoia Capital Portfolio Scraper ยท Apify


Pricing

from $750.00 / 1,000 portfolio companies

Go to Apify Store

Sequoia Capital Portfolio Scraper

The only structured Sequoia + Peak XV portfolio feed (~710 companies).

Pricing

from $750.00 / 1,000 portfolio companies

Rating

0.0

(0)

Developer

๐Ÿ‘ NexGenData

NexGenData

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

11 days ago

Last modified

Share

The only structured feed of the full Sequoia + Peak XV portfolio. Returns ~710 companies across Sequoia Capital (US, ~406) and Peak XV Partners (former Sequoia India / Southeast Asia, ~304). One dataset row per portfolio company, with the fields VC sourcing analysts actually look up: sector, partnered year, founded year, current status (Active / Acquired / Public), partner name, founders, website, twitter, linkedin, geography.

Why this exists (and why no one else has it)

Sequoia's public site at https://www.sequoiacap.com/our-companies/ shows only ~52 curated highlights out of a 400+ company portfolio. Same on Peak XV. There is no public portfolio export โ€” no CSV, no API, no downloadable list.

This scraper consumes the canonical XML sitemap (/company-sitemap.xml) that Sequoia uses to feed search engines, then visits each /companies/{slug}/ detail page in parallel. Result: a clean, structured, complete portfolio you can import into a CRM, an outbound sourcing pipeline, or a CB Insights / PitchBook replacement.

Premium positioning: $0.75 per company. That is roughly 5โ€“10ร— the entry-tier VC scrapers because (a) Sequoia is a top-three brand, (b) data scarcity is real, (c) the analyst on the buyer side earns this price back the first time they don't pay a $5K/seat database subscription for the same lookup.

What you get

Per portfolio company:

  • name โ€” company name
  • slug โ€” URL-safe identifier (matches the Sequoia detail page slug)
  • url โ€” Sequoia / Peak XV detail page URL
  • description โ€” company one-liner
  • sector / sectors[] โ€” Sequoia's own sector taxonomy (Consumer, Enterprise, Healthcare, Financial Services, GTM, Bio, Crypto, AI, Infrastructure, FinTech, HealthTech, EdTech, D2C, etc.)
  • founded_year โ€” year the company was founded
  • partnered_year / year_invested โ€” year Sequoia first invested
  • acquired_year โ€” year of acquisition (if exited)
  • ipo_year โ€” year listed publicly (if IPO'd)
  • status โ€” derived label: Active (Private) / Acquired (YYYY) / Public/IPO (YYYY)
  • partner / partners[] โ€” Sequoia investment partner(s) on the deal (Bryan Schreier, Andrew Reed, Jim Goetz, Shailendra Singh, etc.)
  • founders[] โ€” founder names (when surfaced on the detail page)
  • website โ€” company website
  • twitter_url, linkedin_url โ€” official social profiles
  • geography โ€” US (Sequoia Capital), India, SEA, or India/SEA (Peak XV)
  • source_firm โ€” Sequoia Capital or Peak XV (Sequoia India/SEA)
  • source_sitemap_lastmod โ€” when Sequoia last updated this company's page
  • scraped_at โ€” UTC ISO timestamp of this run

Input

FieldTypeDescription
geographyFilterenumGlobal (default) / US / India / SEA. Picks which sitemap to crawl.
sectorFilterstring[]Optional substring filter on sector pills (e.g. ["FinTech", "AI"]).
stageFilterstring[]Optional substring filter on derived status (e.g. ["Acquired"] for M&A leads, ["Public"] for IPOs).
yearFromToobjectInclusive bounds on partnered_year, e.g. {"from": 2020, "to": 2026} for recent vintage.
maxResultsintegerCap on dataset rows (1โ€“1000, default 100).

Use cases

  • VC sourcing & competitive intel. Pull the full Sequoia + Peak XV roster, dedupe against your CRM, find overlap with YC, a16z, Founders Fund, Bessemer, Greylock, Lightspeed.
  • M&A target lists. Filter stageFilter=["Acquired"] to study Sequoia's exit playbook, or stageFilter=["Active"] with a vintage window to find growth-stage targets approaching their exit window.
  • Founder enrichment. Cross-reference founders[] and partners[] with LinkedIn / Crunchbase enrichment actors to build a deal-team map.
  • BD prospecting. Sequoia-backed companies are pre-qualified buyers for many enterprise SaaS categories. Filter by sector, send a sequence.
  • Market mapping. Group by sector + partnered_year to see where Sequoia leaned in each vintage.

Companion actors

  • YC Companies Directory โ€” Y Combinator alumni; large overlap with Sequoia's early-stage portfolio (Stripe, Airbnb, DoorDash were both YC and Sequoia).
  • a16z Portfolio Scraper โ€” direct competitor portfolio for cross-firm comparison.
  • Startup Funding Tracker โ€” recent rounds + valuations to enrich Sequoia-backed companies.
  • 500 Global Companies Directory โ€” accelerator alumni at the early-stage entry point.

Combine any of these to build the most complete sourcing pipeline on the market.

Pricing

Pay-per-event:

  • Per company โ€” $0.75 (primary event, charged on every dataset row)
  • Actor start โ€” $0.00005 (negligible)

You only pay for the rows you actually receive. Filtering by sector / status / year does not cost extra.

Technical notes

  • Static HTML parsing. No JS rendering, no headless browser.
  • Apify residential proxy used by default for IP rotation.
  • Concurrent fetch (8-way) โ€” full Global crawl (~710 companies) completes in ~3โ€“5 minutes.
  • Detail-page selectors handle both the Sequoia clist__item markup and the Peak XV company__milestone markup.
  • Robust to either site reordering the milestones / partners / categories blocks.

Limitations

  • Sequoia China (HongShan) is not in scope. Sequoia US, Sequoia China, and Peak XV (India/SEA) split into three independent firms in 2023; HongShan runs separate infrastructure that does not expose a comparable public sitemap. This actor covers Sequoia US + Peak XV only.
  • The original stage-invested label (Series A vs B vs C at time of investment) is not surfaced on Sequoia detail pages and therefore not in the output. We expose partnered_year (first-investment year) and status (current state) instead โ€” sufficient for almost all sourcing use cases.
  • For companies that have been acquired but later acqui-spun-back-out, the acquired_year reflects what Sequoia shows on the page, which may lag real-world events by 0โ€“6 months.

You might also like

Hongshan (Sequoia China) Portfolio Scraper

jungle_synthesizer/hongshan-sequoia-china-portfolio-scraper

Scrapes the complete HongShan (formerly Sequoia Capital China) portfolio from hongshan.com โ€” the definitive map of PRC and South-East Asia high-growth tech companies. Outputs names, descriptions, sectors, websites and logos in both Chinese and English.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

VC Portfolio Aggregator โ€” a16z, Sequoia, Greylock & More

jungle_synthesizer/a16z-sequoia-vc-portfolio-aggregator-scraper

Scrapes portfolio companies from top-tier VC firms including a16z, Sequoia, and Greylock. Returns name, website, description, sector, and VC source in one unified dataset.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

Techstars Portfolio Companies Scraper

scraped/techstars-portfolio-companies-scraper

Scrape all Portfolio Companies from Techstars

VC Portfolio Jobs Aggregator

parseforge/vc-portfolio-jobs-aggregator-scraper

Aggregate startup jobs from a16z, Sequoia, Greylock, Kleiner Perkins, Accel, Index, YC and 6 more top VC portfolio job boards. Pull role, company, location, remote flag and direct apply links. Built for startup recruiters and engineers eyeing VC backed roles.

500 Global Portfolio Scraper

automation-lab/500-global-portfolio-scraper

Extract public 500 Global portfolio companies with websites, industries, locations, stages, batches, and investment metadata.

๐Ÿ‘ User avatar

Stas Persiianenko

2

China Tier-1 VC Portfolio Aggregator

jungle_synthesizer/china-tier1-vc-portfolio-aggregator-scraper

Aggregates portfolio companies from 9 top-tier Chinese VC firms: HongShan, IDG Capital, ZhenFund, Sinovation Ventures, Qiming, Hillhouse, 5Y Capital, Matrix Partners China, and Legend Capital. Returns unified records with company names, sector, stage, website, and investment metadata.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

Cometa VC Portfolio & Partners Scraper

jungle_synthesizer/cometa-vc-portfolio-partners-scraper

Scrapes the full Cometa VC portfolio from cometa.vc/portfolio. Extracts company name, sector, category, description, founders, investment year, country, website, and LinkedIn URL for every partner company in the Spanish-speaking LATAM fund's portfolio.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

TinySeed Portfolio Scraper

automation-lab/tinyseed-portfolio-scraper

๐ŸŒฑ Scrape TinySeed portfolio companies with cohorts, locations, categories, descriptions, logos, and websites for startup research.

๐Ÿ‘ User avatar

Stas Persiianenko

2

Kaszek Ventures LATAM Portfolio Companies Scraper

jungle_synthesizer/kaszek-ventures-latam-portfolio-companies-scraper

Scrape the full Kaszek Ventures portfolio โ€” LATAM's leading VC firm. Extracts company name, tagline, description, founders, location, website, LinkedIn, sector, investment status and more from each portfolio company's profile page.

๐Ÿ‘ User avatar

BowTiedRaccoon

2