VOOZH about

URL: https://apify.com/parseforge/beehiiv-newsletter-scraper

โ‡ฑ Beehiiv Newsletter Scraper ยท Posts, Authors, Word Counts ยท Apify


๐Ÿ‘ Beehiiv Newsletter Archive Scraper avatar

Beehiiv Newsletter Archive Scraper

Pricing

from $8.25 / 1,000 items

Go to Apify Store

Beehiiv Newsletter Archive Scraper

Pull every public post from one or many Beehiiv newsletters: title, description, image, publish date, author, word count, and excerpt. Discover via the public sitemap, fan across multiple newsletters, filter by keyword. Export to JSON, CSV, or Excel for newsletter research and content trends.

Pricing

from $8.25 / 1,000 items

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

7

Total users

2

Monthly active users

25 days ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿ Beehiiv Newsletter Scraper

๐Ÿš€ Pull every public post from one or many Beehiiv newsletters. Multi-source fanout reaches 100+ posts. No login, no API key, no manual scrolling.

๐Ÿ•’ Last updated: 2026-05-01 ยท ๐Ÿ“Š 10 fields per post ยท ๐Ÿ 50K+ newsletters on Beehiiv ยท โšก multi-newsletter fanout ยท ๐Ÿ†“ sitemap-based discovery

The Beehiiv Newsletter Scraper discovers post URLs via each newsletter's public sitemap and returns title, description, image, publish date, author, word count, excerpt, slug, canonical URL, and scrape timestamp per post. Provide one or many newsletters and the scraper fans across all of them, parallelizing fetches to reach 100+ posts in under two minutes.

Beehiiv powers more than 50,000 newsletters from creators, media brands, and SaaS companies. The platform's growth has put it second only to Substack in the creator-economy newsletter space. This Actor exposes the full post history of any Beehiiv-hosted newsletter as clean structured data for content research, cadence analysis, and competitive benchmarking.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Newsletter operators, content strategists, marketers, founders, content writersNewsletter research, content gap analysis, competitive benchmarking, audience discovery

๐Ÿ“‹ What the Beehiiv Newsletter Scraper does

Four filtering workflows in a single run:

  • ๐Ÿ Multi-newsletter fanout. Submit an array of newsletter URLs and the Actor walks each sitemap.
  • ๐Ÿ“‘ Sitemap discovery. Each newsletter's /sitemap.xml lists all public posts; the Actor filters to real post URLs.
  • ๐Ÿ” Keyword filter. Substring match on post URL to narrow by topic across all newsletters.
  • โšก Parallel fetch. Up to 8 concurrent post fetches with retry logic to keep total run time low.

Each row reports the post URL, slug, title from og:title, description from og:description, image from og:image, publish date from article:published_time, author from article:author, word count, 600-character excerpt, canonical URL, and scrape timestamp.

๐Ÿ’ก Why it matters: Beehiiv newsletters are a fast-growing slice of independent media. Top writers cross 100k subscribers and influence narratives in finance, tech, and AI. The platform exposes a clean public sitemap on every newsletter, which makes structured discovery practical without browser automation.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.


โš™๏ธ Input

InputTypeDefaultBehavior
maxItemsinteger10Posts to return. Free plan caps at 10, paid plan at 1,000,000.
newsletterUrlsarray of strings6 default Beehiiv newslettersNewsletter homepages. The Actor reads each sitemap.
keywordFilterstringemptySubstring filter on post URL slug. Case-insensitive.

Example: 100 posts from a single newsletter.

{
"maxItems":100,
"newsletterUrls":["https://www.therundown.ai"]
}

Example: 100 posts about AI across multiple newsletters.

{
"maxItems":100,
"newsletterUrls":[
"https://www.therundown.ai",
"https://newsletter.theaibreak.com",
"https://aiweekly.com"
],
"keywordFilter":"ai"
}

โš ๏ธ Good to Know: post body text on Beehiiv is paywall-aware; paid posts return only the free preview portion. Posts are URL-encoded by the publisher; the Actor extracts metadata from OG tags rather than relying on platform-specific JSON.


๐Ÿ“Š Output

Each post record contains 10 fields. Download as CSV, Excel, JSON, or XML.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ”— urlstring"https://www.therundown.ai/ai-for-marketers"
๐Ÿ”– slugstring"ai-for-marketers"
๐Ÿ“ฐ titlestring"AI for Marketers | The Rundown AI"
๐Ÿ“ descriptionstring | null"Get the latest AI news..."
๐Ÿ–ผ๏ธ imagestring | null"https://media.beehiiv.com/cdn-cgi/..."
๐Ÿ“… publishedAtISO 8601 | null"2026-04-15T13:00:00.000Z"
โœ๏ธ authorstring | null"Rowan Cheung"
๐Ÿ“Š wordCountinteger1842
๐Ÿ’ฌ excerptstring | nullFirst 600 chars of body text
๐Ÿ”— canonicalstring | null"https://www.therundown.ai/ai-for-marketers"
๐Ÿ•’ scrapedAtISO 8601"2026-05-01T01:38:21.144Z"

๐Ÿ“ฆ Sample records


โœจ Why choose this Actor

Capability
๐Ÿ†“No login. Reads public sitemaps and post HTML, no auth.
๐ŸMulti-newsletter fanout. Submit many newsletters, get aggregated results in one run.
โšกParallel fetch. Up to 8 concurrent fetches with retry.
๐Ÿ”Keyword filter. Cross-newsletter substring match on slug.
๐Ÿ“ŠWord count and excerpt. Quick sense of post length and tone.
๐Ÿš€Sub-2-minute runs. Typical 100-post pull from 6 newsletters finishes in around 73 seconds.
๐Ÿท๏ธOG metadata. Title, description, image pulled from standard meta tags.

๐Ÿ“Š In a single 73-second run the Actor returned 100 posts across 6 default Beehiiv newsletters.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
Manual subscribe + scrollFreeLimited per sessionOne-shotNoneAccount per newsletter
RSS readersFreeLatest 20 onlyLiveNonePer-feed setup
Generic web scrapers$$ subscriptionBrittle CSSDailyNoneEngineer hours
โญ Beehiiv Newsletter Scraper (this Actor)Pay-per-eventFull sitemapLiveKeyword, multi-sourceNone

Same sitemap and per-post HTML Beehiiv publishes for search engines, exposed as structured rows.


๐Ÿš€ How to use

  1. ๐Ÿ†“ Create a free Apify account. Sign up here and get $5 in free credit.
  2. ๐Ÿ” Open the Actor. Search for "Beehiiv Newsletter" in the Apify Store.
  3. โš™๏ธ Add newsletter URLs. One or many Beehiiv newsletter homepages.
  4. โ–ถ๏ธ Click Start. A 100-post run typically completes in 60 to 90 seconds.
  5. ๐Ÿ“ฅ Download. Export as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from sign-up to first dataset: under five minutes.


๐Ÿ’ผ Business use cases

๐Ÿ“ฐ Newsletter operators

  • Reverse-engineer top newsletter cadence and word count
  • Compare image and headline strategies
  • Track competitor launch announcements
  • Build editorial calendars from real archives

๐Ÿ“Š Content strategy

  • Mine high-engagement post angles for inspiration
  • Map topic mix per newsletter over time
  • Identify gap topics audiences ask for
  • Benchmark your own newsletter against operators in the same niche

๐Ÿ“ฐ Journalism

  • Track newsletter consolidation and migrations
  • Cite specific posts with stable canonical URLs
  • Pull background research from creator archives
  • Cross-reference posts with public reactions

๐Ÿค– AI training

  • Build domain corpora from creator content
  • Train style models on author-specific archives
  • Generate retrieval datasets for newsletter chat agents
  • Power summarization tools across publications

๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

๐ŸŽจ Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

๐Ÿค Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

๐Ÿงช Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

๐Ÿ”Œ Automating Beehiiv Newsletter Scraper

Run this Actor on a schedule, from your codebase, or inside another tool:

Schedule daily, weekly, or monthly runs from the Apify Console. Pipe results into Google Sheets, S3, BigQuery, or your own webhook with the built-in integrations.


๐Ÿค– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


โ“ Frequently Asked Questions

๐Ÿ What newsletters does this support?

Any Beehiiv-hosted newsletter, whether it sits on slug.beehiiv.com or a custom domain. The Actor reads the public /sitemap.xml exposed for search engines.

๐Ÿ” How do I find the right newsletter URL?

Use the homepage URL of the newsletter as it appears in the browser. Custom domains and beehiiv.com subdomains both work.

๐Ÿ“ฆ How many newsletters can I submit at once?

The newsletterUrls array has no hard cap. The Actor walks each sitemap in order until maxItems is reached.

๐Ÿ“… Why do some posts have no publishedAt?

Beehiiv only emits article:published_time on actual newsletter posts. Topic pages and course chapters often omit it. The Actor returns null rather than guessing.

๐Ÿ“ Does it return full body content?

The excerpt field gives the first 600 characters of body text. Full body extraction would require a different parsing strategy and is not the primary goal of this Actor.

๐Ÿ”  What is the difference between url, slug, and canonical?

url is the URL the Actor visited. slug is the path portion of that URL. canonical is the URL Beehiiv declares as canonical via <link rel="canonical">. They usually match.

๐Ÿ›ก๏ธ Does this work with paid Beehiiv tiers?

The Actor reads only the public preview HTML. Paid posts return their free preview text in the excerpt. Subscriber-only full content is not exposed by the Actor.

๐Ÿ’ผ Can I use this for commercial work?

Yes. The Actor reads the public sitemap and the public OG tags Beehiiv emits. Always honor each newsletter's terms when republishing content.

๐Ÿ’ณ Do I need a paid Apify plan?

The free plan returns up to 10 posts per run. Paid plans return up to 1,000,000.

โš ๏ธ What if a newsletter sitemap returns nothing?

A small number of Beehiiv newsletters disable their sitemap or move to a non-standard path. Confirm /sitemap.xml opens in a browser. Open a contact form with the newsletter URL if a real one returns empty.

๐Ÿ” How fresh is the data?

Live. Each run hits each newsletter's sitemap and post HTML at run time.

โš–๏ธ Is this legal?

Yes. The Actor reads sitemaps explicitly published for search engine indexing and OG meta tags emitted by every public post.


๐Ÿ”Œ Integrate with any app

  • Make - drop run results into 1,800+ apps.
  • Zapier - trigger automations off completed runs.
  • Slack - post run summaries to a channel.
  • Google Sheets - sync each run into a spreadsheet.
  • Webhooks - notify your own services on run finish.
  • Airbyte - load runs into Snowflake, BigQuery, or Postgres.

๐Ÿ”— Recommended Actors

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more pre-built scrapers and data tools.


๐Ÿ†˜ Need Help? Open our contact form and we'll route the question to the right person.


Beehiiv is a registered trademark of Beehiiv, Inc. This Actor is not affiliated with or endorsed by Beehiiv. It reads only the public sitemap and OG meta tags every Beehiiv newsletter exposes for search engines.

You might also like

Beehiiv Newsletter Scraper

jungle_synthesizer/beehiiv-newsletter-scraper

Scrape posts from any beehiiv-powered newsletter. Input publication domains โ€” the actor discovers post URLs via sitemap and extracts title, author, publish date, excerpt, cover image, tags, and word count. Supports multi-newsletter fan-out in a single run.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

Beehiiv Newsletter Discovery Scraper

crawlerbros/beehiiv-newsletter-scraper

Discover and scrape newsletters from Beehiiv's public directory. Browse the full newsletter catalog, get detailed newsletter profiles by URL or subdomain, or extract recent posts from any Beehiiv newsletter. No login required

Beehiiv Newsletter Scraper - Posts & Authors

elliotpadfield/beehiiv-newsletter-scraper

Scrape public Beehiiv newsletters by publication URL, custom domain, sitemap, or post URL. Extract posts, authors, full text, HTML, markdown, images, outbound links, sponsor links, and publication metadata.

๐Ÿ‘ User avatar

Elliot Padfield

1

Beehiiv Newsletter Scraper

scraper_guru/beehiiv-scraper

Extract complete data from Beehiiv newsletters including posts, authors, engagement metrics, and full article HTML/text. Fast native API discovery & PerimeterX bypass

๐Ÿ‘ User avatar

LIAICHI MUSTAPHA

15

Newsletter Intelligence โ€“ Substack & Beehiiv

conceivable_extension/newsletter-intelligence

Monitors Substack and Beehiiv newsletters by keyword or author, extracts post metadata and engagement signals, detects advertising slots and sponsorship mentions, and exports structured data for competitor analysis and media buying.

2

Newsletter Scraper โ€” Substack, Beehiiv, Ghost Archives

benthepythondev/newsletter-scraper

Extract newsletter archives from Substack, Beehiiv, and Ghost platforms. Get full content in markdown format, complete metadata, embedded images, word counts, and AI-ready token counts. Perfect for content research, competitive analysis, and training AI models.

InboxReads Newsletter Directory Scraper

jungle_synthesizer/inboxreads-newsletter-directory-scraper

Scrape the InboxReads newsletter directory โ€” 6,600+ cross-platform newsletters (Substack, beehiiv, Ghost, Mailchimp). Extracts name, description, topics, pricing, subscriber count, open rate, click rate, and signup URL. Filter by topic or scrape the full directory.

๐Ÿ‘ User avatar

BowTiedRaccoon

3

Substack Scraper โ€” Posts, Authors & Newsletters

cryptosignals/substack-scraper

Extract Substack newsletter content. Get post titles, authors, publish dates, paywall status, subscriber counts, and full article text. Ideal for newsletter research and content monitoring. PPE pricing โ€” pay only for results.

27

Substack Publication Scraper

parseforge/substack-publication-scraper

Pull every public post from any Substack publication with title, subtitle, body preview, author, publish date, podcast URL, audience type, comment count, and reactions. Filter by post type and date range. Export to JSON, CSV, or Excel for newsletter research and competitive intelligence.