Beehiiv Newsletter Scraper

Pricing

Pay per event

Beehiiv Newsletter Scraper

Scrape posts from any beehiiv-powered newsletter. Input publication domains — the actor discovers post URLs via sitemap and extracts title, author, publish date, excerpt, cover image, tags, and word count. Supports multi-newsletter fan-out in a single run.

Pricing

Pay per event

Rating

0.0

(0)

Developer

👁 BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

17 days ago

Last modified

What it does

The actor accepts a list of beehiiv publication domains (e.g. readthepeak.com, discover.beehiiv.com) and for each domain:

Fetches <domain>/sitemap.xml to discover all public post URLs matching the /p/<slug> pattern.
Crawls each post page and extracts structured data from the embedded JSON-LD Article schema.
Yields one record per post with all metadata fields.

Publications that sit behind Cloudflare or other anti-bot measures are gracefully skipped with a warning. Free posts are scraped; paywalled posts (where isAccessibleForFree: false in JSON-LD) are automatically skipped.

Input

Parameter	Type	Description
`domains`	array	List of publication domains. Accepts bare domains (`readthepeak.com`), subdomains (`mybrand.beehiiv.com`), or full URLs (`https://readthepeak.com`).
`maxItems`	integer	Maximum posts to scrape per publication (0 = unlimited). Default: 10.

Example input:

{
"domains":["readthepeak.com","discover.beehiiv.com"],
"maxItems":50
}

Output

Each record contains:

Field	Description
`publication_domain`	Input domain (e.g. `readthepeak.com`)
`publication_name`	Newsletter name from JSON-LD publisher
`post_url`	Canonical post URL
`post_title`	Post headline
`post_subtitle`	Post subtitle / description
`author`	Author name
`publish_date`	ISO 8601 publish timestamp
`excerpt`	Short description (up to 300 chars)
`cover_image_url`	Cover image URL
`word_count`	Estimated word count of post body
`tags`	Comma-separated tags
`full_text`	Full post body text (empty unless `include_full_text` is set)
`scraped_at`	ISO 8601 scrape timestamp

Example output record:

{
"publication_domain":"readthepeak.com",
"publication_name":"The Peak",
"post_url":"https://www.readthepeak.com/p/canadian-universities-are-falling-behind",
"post_title":"Canadian universities are falling behind",
"post_subtitle":"Canada's post-secondary schools are losing their edge.",
"author":"Lucas Arender",
"publish_date":"2026-06-02T10:00:00.000Z",
"excerpt":"Canada's post-secondary schools are losing their edge.",
"cover_image_url":"https://beehiiv-images-production.s3.amazonaws.com/...",
"word_count":291,
"tags":"Water Cooler, Perspectives",
"full_text":"",
"scraped_at":"2026-06-02T20:39:48.116Z"
}

Limitations

Publications behind Cloudflare or PerimeterX (e.g. some high-traffic custom domains) will return a warning and be skipped. Use a different domain format if the publication has a *.beehiiv.com subdomain that is not CF-walled.
Paywalled posts (subscriber-only) are detected via JSON-LD and automatically skipped.
Publications without a sitemap.xml or with no /p/ posts in their sitemap are skipped.
full_text extraction is best-effort — post body selectors may vary slightly across beehiiv themes.

👁 Beehiiv Newsletter Archive Scraper avatar

Beehiiv Newsletter Archive Scraper

parseforge/beehiiv-newsletter-scraper

Pull every public post from one or many Beehiiv newsletters: title, description, image, publish date, author, word count, and excerpt. Discover via the public sitemap, fan across multiple newsletters, filter by keyword. Export to JSON, CSV, or Excel for newsletter research and content trends.

👁 User avatar

ParseForge

👁 Beehiiv Newsletter Discovery Scraper avatar

Beehiiv Newsletter Discovery Scraper

crawlerbros/beehiiv-newsletter-scraper

Discover and scrape newsletters from Beehiiv's public directory. Browse the full newsletter catalog, get detailed newsletter profiles by URL or subdomain, or extract recent posts from any Beehiiv newsletter. No login required

👁 User avatar

Crawler Bros

👁 Buttondown Newsletter Archive Scraper avatar

Buttondown Newsletter Archive Scraper

jungle_synthesizer/buttondown-newsletter-archive-scraper

Scrape posts from any Buttondown newsletter publication. Input a list of publication usernames and get back every post with title, date, excerpt, cover image, tags, and optional full body text. Supports multi-publication fan-out. No login required.

👁 User avatar

BowTiedRaccoon

👁 Beehiiv Newsletter Scraper - Posts & Authors avatar

Beehiiv Newsletter Scraper - Posts & Authors

elliotpadfield/beehiiv-newsletter-scraper

Scrape public Beehiiv newsletters by publication URL, custom domain, sitemap, or post URL. Extract posts, authors, full text, HTML, markdown, images, outbound links, sponsor links, and publication metadata.

👁 User avatar

Elliot Padfield

Substack Newsletter Scraper

cloud9_ai/substack-scraper

Scrape posts from any Substack newsletter publication. Returns post titles, URLs, publish dates, authors, and content previews via RSS feed.

👁 User avatar

cloud9

👁 Beehiiv Newsletter Scraper avatar

Beehiiv Newsletter Scraper

scraper_guru/beehiiv-scraper

Extract complete data from Beehiiv newsletters including posts, authors, engagement metrics, and full article HTML/text. Fast native API discovery & PerimeterX bypass

👁 User avatar

LIAICHI MUSTAPHA

👁 Substack Newsletter Scraper avatar

Substack Newsletter Scraper

boundary/substack-newsletter-scraper

Scrape Substack newsletter posts — titles, content, reactions, comments, tags, and author data. Supports custom domains. No login needed.

👁 User avatar

Boundary

👁 Newsletter Scraper — Substack, Beehiiv, Ghost Archives avatar

Newsletter Scraper — Substack, Beehiiv, Ghost Archives

benthepythondev/newsletter-scraper

Extract newsletter archives from Substack, Beehiiv, and Ghost platforms. Get full content in markdown format, complete metadata, embedded images, word counts, and AI-ready token counts. Perfect for content research, competitive analysis, and training AI models.

👁 User avatar

ben

Newsletter Archiver: Substack, Beehiiv, Ghost, RSS

aitoolbreakdown/atb-newsletter-archiver

Point it at a newsletter's public RSS feed. Returns every post as structured JSON: title, date, full HTML + plaintext, author, URL. Works with Substack, Beehiiv, Ghost, and any Atom/RSS 2.0 feed.

👁 User avatar

AI Tool Breakdown

Substack Newsletter Scraper

red.cars/substack-newsletter-scraper

Extract newsletter content, subscriber data, and author insights from any Substack publication - no API key required!

👁 User avatar

AutomateLab

1.0

URL: https://apify.com/jungle_synthesizer/beehiiv-newsletter-scraper

⇱ Beehiiv Newsletter Scraper · Apify

Beehiiv Newsletter Scraper

What it does

Input

Output

Limitations

You might also like

Beehiiv Newsletter Archive Scraper

Beehiiv Newsletter Discovery Scraper

Buttondown Newsletter Archive Scraper

Beehiiv Newsletter Scraper - Posts & Authors

Substack Newsletter Scraper

Beehiiv Newsletter Scraper

Substack Newsletter Scraper

Newsletter Scraper — Substack, Beehiiv, Ghost Archives

Newsletter Archiver: Substack, Beehiiv, Ghost, RSS

Substack Newsletter Scraper