VOOZH about

URL: https://apify.com/nexgendata/crunchbase-news-scraper?fpr=2ayu9b

โ‡ฑ Crunchbase News Scraper โ€” VC Funding, M&A & Startup News API ยท Apify


๐Ÿ‘ ๐Ÿ“ฐ Crunchbase News Scraper โ€” Daily Funding & M&A Headlines avatar

๐Ÿ“ฐ Crunchbase News Scraper โ€” Daily Funding & M&A Headlines

Pricing

from $100.00 / 1,000 news articles

Go to Apify Store

๐Ÿ“ฐ Crunchbase News Scraper โ€” Daily Funding & M&A Headlines

Daily VC funding rounds, M&A, IPO, startup news from Crunchbase News. Structured JSON with entity extraction, funding amount parsing, round-type classification. Bloomberg / Reuters Eikon / Refinitiv / Mergermarket / TechCrunch alternative. Pay-per-article.

Pricing

from $100.00 / 1,000 news articles

Rating

0.0

(0)

Developer

๐Ÿ‘ NexGenData

NexGenData

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 hours ago

Last modified

Categories

Share

VC funding rounds. Acquisitions. IPOs. Quarterly recaps. Startup launches. One Actor, the full firehose from news.crunchbase.com โ€” the editorial side of Crunchbase, not the locked-down data platform. Pay-per-article, no $30K/seat Bloomberg or $20K/year Mergermarket contract.

๐Ÿ“Š Sample Output

๐Ÿ‘ ๐Ÿ“ฐ Crunchbase News Scraper โ€” Daily Funding & M&A Headlines sample output โ€” ๐Ÿ“ฐ Crunchbase News Scraper โ€” Daily Funding & M&A Headlines, premium API, JSON output, NexGenData premium dataset for analysts,

โ–ถ๏ธ Try this Actor โ†’ โ€” first article on us, $0.10 per article after.


๐ŸŽฏ Who this is for

  • VC associates running daily intel โ€” every Series A/B/C ann, every M&A whisper, in one structured JSON feed.
  • Sales / BD teams chasing founders who just raised. Trigger an outreach the morning the round is announced.
  • Corporate development tracking acquisitions in their sector โ€” see who Adobe, Salesforce, Cisco, Snowflake just bought.
  • Investor relations running competitive intel for their portfolio companies.
  • Journalists & analysts building newsletters, Substacks, sector reports.
  • Quant funds wiring "headline alpha" feeds into event-driven strategies.

If you've ever paid $1,800/year for Term Sheet, $20K/year for Mergermarket, or $24K/seat for Reuters Eikon just to get the same headlines that Crunchbase News publishes for free โ€” this Actor turns that page into structured rows with entity extraction, funding amount parsing, and round-type classification.


๐Ÿ’ธ Pricing

What you payWhen
$0.01 actor startOnce per run (covers compute kickoff)
$0.10 per articleEach article record pushed to dataset (the primary unit)

A typical daily intel run pulling the last 24 hours across all categories (~15 articles) is $1.51. A weekly bulk run pulling 100 articles is $10.01. A monthly back-fill of 500 articles is $50.01.

That's <1% of what a Bloomberg Terminal seat costs to surface the same M&A and funding stories โ€” and ours is structured, queryable, and merges cleanly into your data warehouse.


๐Ÿ“ฅ Inputs

FieldTypeDescription
limitintMax articles to return (1โ€“500). Default 25.
categoryarrayRestrict to Crunchbase News sections (venture, ma, startups, business, public-markets, ai, fintech, cybersecurity, crypto, climate-sustainability, health-wellness-biotech, quarterly-and-annual-reports, sales-marketing). Empty = ALL.
date_rangestringtoday (last 24h), week (last 7d), month (last 30d), or all. Default week.
keyword_filterarrayCase-insensitive substrings matched against title + excerpt. Examples: ["Series A","Series B"], ["IPO","SPAC"], ["acquires","acquired"].
include_full_textbooleanIf true, body_markdown contains the full article body and full_content=true. Default true.

All fields are optional. An empty input runs the default "last 7 days, 25 articles, with full text" preset.


๐Ÿ“ค Output schema

Each article record (one dataset row):

FieldTypeExample
titlestring"Exclusive: Xpanner Lands $18M To Offer 'Automation As A Service' To Construction Sites"
slugstring"xpanner-automation-as-a-service-for-construction-sites-..."
urlstringhttps://news.crunchbase.com/real-estate-property-tech/xpanner-...
published_atISO 8601"2026-05-14T14:00:23+00:00"
authorstring"Marlize van Romburgh"
categorystring"Artificial intelligence"
tagsarray["Manufacturing","Real estate & property tech","Robotics","Startups","unicorn"]
excerptstringFirst 200 chars of body text
mentioned_companiesarray["Xpanner"] โ€” entity-extracted from anchor links to crunchbase.com/organization/*
mentioned_investorsarray["Korea Investment Partners","KB Investment Co."] โ€” entity-extracted using investor heuristics
funding_amount_usdint/null18000000 โ€” parsed from headline + first 400 chars
round_typestring"Series B" โ€” Pre-Seed / Seed / Series Aโ€“G / Bridge / IPO / M&A
full_contentbooleantrue if body_markdown is populated
body_markdownstringFull article body in markdown (paragraphs, links, headings)
data_sourcestring"news.crunchbase.com/feed (RSS)" or "news.crunchbase.com (HTML)"

๐Ÿ†š Crunchbase News vs the alternatives

ToolCostFunding roundsM&AIPOEntity extractionStructured JSONAPI access
This Actor$0.10/articleโœ… Dailyโœ… Dailyโœ…โœ… Yesโœ… Yesโœ… Apify
Bloomberg Terminal~$30K/yr/seatโœ…โœ…โœ…โœ…โŒ Terminal-onlyLimited
Reuters Eikon / LSEG~$24K/yr/seatโœ…โœ…โœ…โœ…โŒ Terminal-onlyLimited
Refinitiv Workspace~$22K/yr/seatโœ…โœ…โœ…โœ…โŒ Terminal-onlyLimited
Mergermarket~$20K/yr/seatPartialโœ…โœ…โœ…โœ…โœ…โŒ Web-onlyโŒ
TechCrunch (free)$0โœ…โœ…โœ…โŒ UnstructuredโŒ HTML onlyโŒ
This Actor$0.10/articleโœ…โœ…โœ…โœ…โœ… JSONโœ… REST

You pay per useful row. No per-seat licence, no annual lock-in, no terminal install. Wire it into your warehouse on day one.


๐Ÿงช Quick-start examples

Daily morning brief โ€” every story in the last 24h (~12 articles, ~$1.21):

{"limit":25,"date_range":"today"}

M&A radar โ€” weekly acquisitions (~30 articles, ~$3.01):

{"category":["ma","business"],"date_range":"week","keyword_filter":["acquires","acquired","acquisition","merger"],"limit":50}

Series A/B watcher (~40 articles, ~$4.01):

{"keyword_filter":["Series A","Series B"],"date_range":"week","limit":50}

AI funding firehose (~60 articles, ~$6.01):

{"category":["ai"],"date_range":"month","include_full_text":true,"limit":100}

Headlines-only monitoring (no body โ€” cheapest, ~$2.51 for 25 rows):

{"limit":25,"date_range":"today","include_full_text":false}

Monthly backfill โ€” entire month, all categories (~150 articles, ~$15.01):

{"limit":500,"date_range":"month"}

๐Ÿงฑ How extraction works

  • Funding amount: regex over title + first 400 chars of body, matching patterns like $5M, $1.2 billion, $500K. Converted to USD integers ($18M โ†’ 18000000).
  • Round type: pattern-matched against Pre-Seed, Seed, Series A through Series G, Bridge, IPO, M&A. First match wins, ordered by specificity.
  • Mentioned companies vs investors: anchor links inside the body pointing at crunchbase.com/organization/* are classified by sentence context โ€” anchors near led by, backed by, invested, or whose link text contains Ventures / Capital / Partners / Fund are tagged as investors; everything else is the operating company.
  • Categories & tags: pulled from the WordPress taxonomy attached to each post (multiple per article โ€” an AI fintech story might be tagged AI, Fintech, Startups).
  • Date filtering: applied against the published-at timestamp from RSS / <meta property="article:published_time">. Articles without parseable dates are kept (rather than silently dropped).

๐Ÿชœ Source strategy & rate-limit posture

  1. Primary โ€” news.crunchbase.com/feed/ RSS. Server-rendered, 10 freshest articles, full body in content:encoded. Zero anti-bot, polite User-Agent.
  2. Pagination โ€” when limit > 10, we walk /sections/<slug>/page/N/ HTML pages, harvest article URLs, fetch each one individually with a 400ms delay.
  3. Headers โ€” desktop Chrome User-Agent, no JS execution required (WordPress is server-rendered).
  4. Failure modes โ€” if a category index 404s, we fall back to the all-category feed and filter client-side. Articles with missing fields keep best-effort partial records (no silent drops).

Crunchbase News is a WordPress site, not the locked-down Crunchbase data platform. The data platform requires a $999+/mo enterprise contract and aggressive anti-bot. The news site is editorially free, publicly indexable, and the legal grey-zone risk is the same as scraping any open WordPress blog.


๐Ÿ”— Sister actors in the NexGenData fleet

If Crunchbase News headlines are useful to you, these adjacent Actors are the natural pair:

  • Startup Funding Tracker โ€” round-by-round funding events with lead investor, valuation, post-money. The #1 companion to this Actor: news headlines tell you what happened, this tells you how much and who led.
  • YC Companies Directory Scraper โ€” Y Combinator alumni, every batch since S05 (5,000+ companies). When a YC company shows up in Crunchbase News, this Actor gives you the rest of their profile.
  • Techstars Companies Directory โ€” Techstars accelerator alumni (5,591 companies, 128 programs). Same play as YC: enrich news mentions with full founder/program/cohort context.
  • SEC Form 8-K Material Events Scraper โ€” real-time SEC 8-K filings (acquisitions, departures, material events). Crunchbase News covers private-market events; 8-K covers the public-market regulatory disclosures.
  • IPO Tracker โ€” upcoming + recent IPOs with lockup expirations, pricing, valuations. Crunchbase News announces the IPO; this Actor gives you the structured pricing data.
  • Finance MCP Server โ€” Claude / ChatGPT MCP server bundling the entire NexGenData finance fleet so an LLM can call any of these Actors as a tool.

Stack four of these and you've replicated a Bloomberg + PitchBook + CB Insights workflow for under $100/month.


๐Ÿค Affiliate / support

Built and maintained by NexGenData โ€” leave a star or a review on Apify Console.

Need a custom slice (Crunchbase News + Form D + LinkedIn enrichment for outbound)? Email scrapers@thenextgennexus.com โ€” bulk pricing, white-label feeds, and webhook delivery available.

โ–ถ๏ธ Try this Actor โ†’ โ€” pay-per-article, $0.10 each.


๐Ÿ“ฐ The NexGenData Newswire & News Suite

Don't monitor one wire โ€” cover them all. Pair this with the rest of the suite for complete PR, press-release, and news coverage from a single vendor with one consistent output schema.

Press-release wires

News & headlines

  • AP News โ€” Associated Press breaking news & articles
  • BBC News โ€” global BBC headlines & articles
  • Google News โ€” aggregated headlines & trending topics
  • Hacker News โ€” tech & startup stories and discussion
  • Crunchbase News โ€” funding rounds, M&A & startup headlines (โ† you are here)

Regional / regulatory

About NexGenData

NexGenData publishes 220+ buyer-intent actors covering SEC filings, YC alumni, Delaware DOC, global stock screeners across 30+ exchanges, IPO calendars, IP and patent intelligence, FDA approvals, B2B lead generation, and more. Every actor is pay-per-result with no seat licensing.

Apify affiliate program โ€” free credits + 30% off

Sign up to Apify via our referral link and you'll get:

  • Free starter credits to test this actor and the rest of our 220+ actor fleet
  • 30% off platform fees for the life of your account

Browse the full NexGenData catalog and sign up here โ€” same Apify, same actors, just cheaper for you.

Built and maintained by NexGenData.

You might also like

Crunchbase News Scraper

crawlerbros/crunchbase-news-scraper

Extract startup, funding, M&A, and tech news articles from news.crunchbase.com like title, content, author, date, categories, tags, featured image. Uses the public WordPress REST API. No proxy required.

10

Fundraising Scraper: TechCrunch, Crunchbase & FinSMEs Tracker

complex_intricate_networks/fundraising-and-startup-funding-scraper

Track new startup funding rounds daily. Scrapes company names, amounts, and rounds from TechCrunch, Crunchbase News, and FinSMEs for B2B lead generation.

Crunchbase Scraper [$8๐Ÿ’ฐ] โ€” Companies + Funding Rounds

memo23/crunchbase-scraper

Crunchbase scraper returning ONE clean structured row per company โ€” funding, people, M&A, tech stack, traffic, IT spend, growth/IPO predictions โ€” not a 1,500-line raw blob. Now also reads Discover saved-search URLs for funding-round signals. Cloudflare bypass built in, no token. $8/1k.

๐Ÿ‘ User avatar

Muhamed Didovic

24

Crunchbase Company Scraper โ€” Profiles, Funding & Investors

bovi/crunchbase-scraper

Scrape Crunchbase company profiles: name, description, categories, founding date, headquarters, headcount, website, total funding, funding rounds, investors, and more. Uses the official Crunchbase REST API v4 โ€” structured, reliable data. Free API key required (data.crunchbase.com). Pay per result.

๐Ÿ‘ User avatar

Vitalii Bondarev

2

Crunchbase Scraper - Funding Rounds, Companies & Investors

jungle_synthesizer/crunchbase-pro-companies-scraper

Scrape Crunchbase funding rounds, companies, and investors. Three modes: row-per-round (lead investors, amounts, valuations), row-per-company, row-per-investor. Filter by category, location, round type, amount, and date.

๐Ÿ‘ User avatar

BowTiedRaccoon

41

๐Ÿš€ Startup Funding Tracker โ€” SEC Filings, TechCrunch & YC

nexgendata/startup-funding-tracker

Track startup funding rounds from SEC EDGAR filings, TechCrunch, and Y Combinator. Crunchbase alternative using 100% public/legal data. Filter by amount, industry, date. Competitive intelligence for VCs, sales teams & researchers.

Venture Capital & Startup News Intelligence

visita/venture-capital-startup-intelligence

๐Ÿš€ Turn noisy news into structured Deal Flow. Scrapes top sources (TechCrunch, Sifted) and uses AI to extract Deal Size ๐Ÿ’ฐ, Round Stage, and Investors ๐Ÿค. Perfect for tracking funding rounds and M&A activity.

๐Ÿ‘ User avatar

Visita Intelligence

48