VOOZH about

URL: https://apify.com/scraper_guru/substack-scraper

โ‡ฑ Substack Data Extractor ยท Apify


Pricing

from $0.35 / 1,000 posts

Go to Apify Store

Extract complete data from Substack newsletters including posts, authors, engagement metrics, and article text. 13 fields per post. Fast and reliable.

Pricing

from $0.35 / 1,000 posts

Rating

2.6

(2)

Developer

๐Ÿ‘ LIAICHI MUSTAPHA

LIAICHI MUSTAPHA

Maintained by Community

Actor stats

5

Bookmarked

43

Total users

2

Monthly active users

4.9 hours

Issues response

2 months ago

Last modified

Share

Substack Newsletter Scraper ๐Ÿ“ฐ

Extract complete data from Substack newsletters including posts, authors, engagement metrics, and full article text.

What does this Actor do? ๐ŸŽฏ

This actor scrapes Substack newsletters and extracts 13 data points per post:

Content Data

  • Post headline & subheading
  • Full article text (or preview for paid posts)
  • Post URL

Author Information

  • Author name
  • Author profile URL

Publishing Data

  • Publication date
  • Free vs Paid status

Engagement Metrics

  • Number of likes โค๏ธ
  • Number of comments ๐Ÿ’ฌ
  • Number of restacks ๐Ÿ”„

Perfect for ๐Ÿ’ก

  • Content Researchers - Analyze trends across newsletters
  • Competitive Analysis - Track what competitors publish
  • Data Scientists - Build training datasets
  • Writers - Research popular topics
  • AI Developers - Collect training data

Input

{
"substackUrls":[
"https://tedhope.substack.com",
"https://theankler.com"
],
"scrapingMethod":"sitemap",
"maxPostsPerSubstack":100,
"batchSize":20
}
FieldTypeRequiredDescription
substackUrlsArrayYesList of Substack URLs
scrapingMethodStringNo"sitemap" or "archive"
maxPostsPerSubstackNumberNoLimit posts (0 = unlimited)
batchSizeNumberNoSubstacks per batch

Output

{
"substack_url":"https://example.substack.com",
"post_url":"https://example.substack.com/p/post-title",
"headline":"Amazing Post Title",
"subheading":"Subtitle here",
"author_name":"John Doe",
"author_url":"https://substack.com/@johndoe",
"date":"December 10, 2024",
"free_or_paid":"Free",
"likes":156,
"comments":23,
"restacks":12,
"article_text":"Full article content...",
"content_type":"full"
}

Features โœจ

  • Two scraping methods - Sitemap (fast) or Archive (fallback)
  • Complete metadata - 13 fields per post
  • Engagement metrics - Likes, comments, restacks
  • Smart extraction - Handles paywalled content
  • Structured output - Clean JSON/CSV format

Pricing ๐Ÿ’ฐ

Memory is automatically adjusted based on your input โ€” no manual configuration needed.

ScaleApproximate CostTime
10 Substacks$0.01-0.052-5 min
100 Substacks$0.50-1.0030-60 min
1,000 Substacks$5-105-10 hours

Start with Apify's free tier - includes $5 monthly credit!

Tips ๐Ÿ’ช

  1. Test first - Try 2-3 Substacks initially
  2. Use sitemap - Faster and more reliable
  3. Set limits - Use maxPostsPerSubstack for large newsletters
  4. Schedule runs - Set up weekly/monthly automation

FAQ โ“

Q: Can I scrape paywalled content? A: You'll get previews for paid posts, not full content.

Q: How long does it take? A: ~10-30 seconds per Substack with 100 posts.

Q: What if sitemap doesn't work? A: Use "archive" method as fallback.

Q: How much memory does it use? A: Memory is allocated dynamically based on how many URLs you provide โ€” you don't need to configure anything manually. Typical runs use 512MBโ€“1GB.


Changelog ๐Ÿ“‹

v1.0.4 โ€” April 2, 2026 ๐Ÿ”ง

Bug Fixes & Performance Improvements

  • Fixed: Incomplete scraping results โ€” The scraper was mishandling Substack's nested sitemap indexes, causing runs to stop prematurely after just a few posts. It now correctly parses all sitemap levels and captures 100% of available posts.
  • Fixed: Memory misconfiguration โ€” Default memory was incorrectly set to 16GB, making runs unnecessarily expensive. Memory is now dynamically allocated based on your input (scales with number of URLs provided), with a maximum cap of 2GB. Typical single-URL runs now use ~1GB.
  • Improved: Cost efficiency โ€” For a standard single-newsletter scrape, costs are now reduced by ~94% compared to previous versions.

v1.0.0 โ€” Initial Release ๐Ÿš€

  • Full post scraping via Sitemap and Archive methods
  • 13 data fields per post including engagement metrics
  • Batch processing support
  • Paid/free post detection

Support ๐Ÿ“ง

About ๐Ÿ‘จโ€๐Ÿ’ป

Built by MUSTAPHA LIAICHI - Automation & Web Scraping Specialist


Happy Scraping! ๐Ÿš€

You might also like

Substack Scraper โ€” Posts, Authors & Newsletters

cryptosignals/substack-scraper

Extract Substack newsletter content. Get post titles, authors, publish dates, paywall status, subscriber counts, and full article text. Ideal for newsletter research and content monitoring. PPE pricing โ€” pay only for results.

27

Substack Posts Scraper ๐Ÿ“š

easyapi/substack-posts-scraper

Scrape Substack posts and articles by keywords. Extract comprehensive post data including title, author, publication details, podcast information, reactions, and more. Perfect for content analysis and research.

Substack Leaderboard Scraper ๐Ÿ“Š

easyapi/substack-leaderboard-scraper

Scrape detailed publication data from Substack leaderboards. Get comprehensive insights about top newsletters including subscriber counts, pricing, author details, and more. Perfect for newsletter research and market analysis.

Substack Scraper

automation-lab/substack-scraper

Scrape Substack newsletters โ€” posts, comments, publication metadata. Full archive depth with no caps. Export to JSON, CSV, Excel, or connect via API.

๐Ÿ‘ User avatar

Stas Persiianenko

189

Substack Notes Scraper ๐Ÿ”

easyapi/substack-notes-scraper

Extract notes and comments from Substack's search results with images, user info, and engagement metrics. Perfect for content analysis, user research, and tracking discussions around specific topics on Substack.

Substack Scraper

qpayre/substack-scraper

The Substack Author Scraper is a powerful Apify actor that makes it easy for content creators to scrape and retrieve all posts from their favorite Substack authors. With structured data presented in a user-friendly format, analyzing and processing valuable information has never been easier.

Substack Scraper | All-In-One

fatihtahta/substack-scraper

Get full articles, user profiles, and search results with All-in-One Substack Scraper. Extract rich data including titles, bios, subscriber counts, social links and engagement metrics. ideal for market research, creator discovery, trend tracking, and audience analysis.

136

Substack Newsletter Scraper

digispruce/substack-scraper

Extract comprehensive Substack newsletter data including author profiles, subscriber counts, social media links, and contact information for B2B outreach and market research.

YouTube Video Details Scraper

maged120/youtube-video-details

Extract full metadata from any YouTube video or Short โ€” title, views, likes, comments, subtitles, chapters, tags, and more. No YouTube API key needed.

Substack Scraper - Newsletters, Posts & Authors

logiover/substack-newsletter-scraper

Substack API alternative: scrape newsletters, posts & authors without login. Export Substack data to CSV/JSON. No key, no proxy.