Substack Newsletter Scraper

Pricing

Pay per event

Substack Newsletter Scraper

Extract comprehensive Substack newsletter data including author profiles, subscriber counts, social media links, and contact information for B2B outreach and market research.

Pricing

Pay per event

Rating

4.0

(1)

Developer

👁 Akram

Akram

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

12 hours

Issues response

a month ago

Last modified

Substack Newsletter Subscriber & Engagement Scraper

A comprehensive Apify Actor for extracting Substack publication details, subscriber counts (when visible), and author contact information. Perfect for B2B outreach to newsletter creators, market research, and competitive analysis.

🚀 Key Features

📊 Publication Metadata: Extract title, description, author info, author ID, categories, and subscriber counts (when publicly visible)
📧 Email Discovery: Automatically find author contact emails from About pages and other content
🔗 Social Media Links: Extract Twitter, LinkedIn, Facebook, Instagram, and YouTube profiles
📝 Flexible Post Scraping: Choose from 3 modes - no posts, post metadata only, or full article content
📈 Engagement Metrics: Extract likes, comments, and word counts for each post
⚡ Batch Processing: Process multiple newsletters in a single run
💰 Pay-per-Event: Charged per newsletter processed (successful or failed)

📋 Input Parameters

Required

newsletterUrls (array): List of Substack newsletter URLs to scrape
- Example: [{"url": "https://platformer.substack.com"}, {"url": "https://lennysnewsletter.com"}]
- Supports both *.substack.com URLs and custom domains
- Process multiple newsletters in a single run

Optional

postScrapingMode (string, default: "information_and_content"): Controls what post data to extract
- "none" - Newsletter metadata only (fastest, minimal data usage)
- "information" - Newsletter metadata + post info (title, date, engagement, preview)
- "information_and_content" - Newsletter metadata + post info + full article content
maxPostsPerNewsletter (number, default: 12, max: 12): Number of recent posts to scrape per newsletter (only applies when postScrapingMode is not "none")
delayBetweenRequests (number, default: 3000, range: 500-10000): Delay in milliseconds between HTTP requests to avoid rate limiting

💡 Example Input

{
"newsletterUrls":[
{"url":"https://platformer.substack.com"},
{"url":"https://lennysnewsletter.com"},
{"url":"https://andrewsullivan.substack.com"}
],
"postScrapingMode":"information_and_content",
"maxPostsPerNewsletter":12,
"delayBetweenRequests":3000
}

📊 Output Structure

The output structure varies based on the postScrapingMode setting:

Mode 1: `"none"` - Newsletter Metadata Only

{
"author_id":241262,
"author_name":"Casey Newton",
"author_handle":"platformer",
"author_bio":"Casey Newton is the founder and editor of Platformer...",
"author_photo_url":"https://substack-post-media.s3.amazonaws.com/...",
"author_follower_count":209469,
"email":"casey@platformer.news",
"author_twitter_url":"https://twitter.com/CaseyNewton",
"author_linkedin_url":null,
"author_facebook_url":"https://www.facebook.com/...",
"author_instagram_url":"https://instagram.com/crumbler",
"website_url":"https://www.platformer.news",
"publication_id":7976,
"publication_name":"Platformer",
"publication_subdomain":"platformer",
"publication_url":"https://www.platformer.news",
"publication_description":"News at the intersection of Silicon Valley and democracy...",
"publication_logo_url":"https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984...",
"publication_custom_domain":"www.platformer.news",
"publication_created_at":"2019-03-29T13:28:21.009Z",
"publication_subscriber_count":176000,
"publication_subscriber_count_string":"176K+",
"publication_subscriber_count_visible":true,
"publication_is_paid":false,
"publication_payments_enabled":false,
"publication_founding_plan_name":"Mystery Tier",
"publication_has_posts":true,
"author_is_active":true,
"author_has_likes":true,
"author_profile_set_up_at":"2021-04-22T18:51:48.648Z",
"author_profile_disabled":false,
"author_is_verified":false,
"scraped_at":"2025-11-07T19:19:13.110957+00:00"
}

Mode 2: `"information"` - Newsletter + Post Information

Includes all fields from Mode 1, plus a posts array:

{
// ... all newsletter metadata from Mode 1 ...
"posts":[
{
"url":"https://www.platformer.news/trump-won-heres-what-comes-next/",
"title":"Trump won. Here's what comes next.",
"subtitle":"What a second Trump administration could mean for the internet",
"date":"2024-11-07",
"author":"Casey Newton",
"likes":13,
"comments_count":43,
"content_preview":"Good morning, and wow. Donald Trump won a second term as president...",
"word_count":2051
}
// ... up to maxPostsPerNewsletter posts ...
]
}

Mode 3: `"information_and_content"` - Full Article Content

Includes all fields from Mode 2, plus full content for each post:

{
// ... all newsletter metadata ...
"posts":[
{
"url":"https://www.platformer.news/trump-won-heres-what-comes-next/",
"title":"Trump won. Here's what comes next.",
"subtitle":"What a second Trump administration could mean for the internet",
"date":"2024-11-07",
"author":"Casey Newton",
"likes":13,
"comments_count":43,
"content_preview":"Good morning, and wow. Donald Trump won a second term as president...",
"word_count":2051,
"content":"Good morning, and wow. Donald Trump won a second term as president...\n\n[Full 2000+ word article text extracted here]\n\n..."
}
// ... up to maxPostsPerNewsletter posts with full content ...
]
}

🔧 Use Cases

B2B Outreach: Build targeted lists of newsletter creators with contact information for partnership opportunities
Market Research: Analyze newsletter subscriber counts, engagement metrics, and publishing frequency
Competitive Analysis: Track competitor newsletters, their growth, and content strategies
Content Analysis: Extract full article content for competitive research, trend analysis, or content strategy insights
Lead Generation: Find potential customers or partners in specific niches based on newsletter topics and engagement
Training Data Collection: Gather high-quality written content for AI/ML applications (respect copyright and usage rights)
Publishing Patterns: Analyze posting frequency, optimal publishing times, and content length strategies
Engagement Research: Study which topics, titles, and content formats drive the most likes and comments

💰 Pricing

This Actor uses pay-per-event pricing:

You're charged per newsletter URL processed, whether extraction succeeds or fails
Failed attempts (invalid URLs, inaccessible newsletters, parsing errors) are charged the same rate — compute is consumed regardless of outcome
Validate your URLs before submitting to avoid charges on dead or invalid newsletters
Transparent usage-based billing

🏷️ Tags

Newsletter scraping, Substack, B2B outreach, lead generation, market research, email discovery, social media extraction

👁 Substack Leaderboard Scraper 📊 avatar

Substack Leaderboard Scraper 📊

easyapi/substack-leaderboard-scraper

Scrape detailed publication data from Substack leaderboards. Get comprehensive insights about top newsletters including subscriber counts, pricing, author details, and more. Perfect for newsletter research and market analysis.

👁 User avatar

EasyApi

👁 Substack Scraper — Posts, Authors & Newsletters avatar

Substack Scraper — Posts, Authors & Newsletters

cryptosignals/substack-scraper

Extract Substack newsletter content. Get post titles, authors, publish dates, paywall status, subscriber counts, and full article text. Ideal for newsletter research and content monitoring. PPE pricing — pay only for results.

👁 User avatar

Web Data Labs

👁 Substack Scraper | All-In-One avatar

Substack Scraper | All-In-One

fatihtahta/substack-scraper

Get full articles, user profiles, and search results with All-in-One Substack Scraper. Extract rich data including titles, bios, subscriber counts, social links and engagement metrics. ideal for market research, creator discovery, trend tracking, and audience analysis.

👁 User avatar

Fatih Tahta

136

👁 Substack Posts Scraper 📚 avatar

Substack Posts Scraper 📚

easyapi/substack-posts-scraper

Scrape Substack posts and articles by keywords. Extract comprehensive post data including title, author, publication details, podcast information, reactions, and more. Perfect for content analysis and research.

👁 User avatar

EasyApi

172

1.9

👁 Substack Scraper avatar

Substack Scraper

qpayre/substack-scraper

The Substack Author Scraper is a powerful Apify actor that makes it easy for content creators to scrape and retrieve all posts from their favorite Substack authors. With structured data presented in a user-friendly format, analyzing and processing valuable information has never been easier.

👁 User avatar

QPS

453

👁 Substack Scraper avatar

Substack Scraper

scraper_guru/substack-scraper

Extract complete data from Substack newsletters including posts, authors, engagement metrics, and article text. 13 fields per post. Fast and reliable.

👁 User avatar

LIAICHI MUSTAPHA

2.6

👁 YouTube Video Details Scraper avatar

YouTube Video Details Scraper

maged120/youtube-video-details

Extract full metadata from any YouTube video or Short — title, views, likes, comments, subtitles, chapters, tags, and more. No YouTube API key needed.

👁 User avatar

Maged

👁 Substack Scraper avatar

Substack Scraper

automation-lab/substack-scraper

Scrape Substack newsletters — posts, comments, publication metadata. Full archive depth with no caps. Export to JSON, CSV, Excel, or connect via API.

👁 User avatar

Stas Persiianenko

189