VOOZH about

URL: https://apify.com/easyapi/substack-posts-scraper

โ‡ฑ Substack Posts Scraper ๐Ÿ“š ยท Apify


Pricing

from $4.99 / 1,000 results

Go to Apify Store

Substack Posts Scraper ๐Ÿ“š

Scrape Substack posts and articles by keywords. Extract comprehensive post data including title, author, publication details, podcast information, reactions, and more. Perfect for content analysis and research.

Pricing

from $4.99 / 1,000 results

Rating

1.9

(2)

Developer

๐Ÿ‘ EasyApi

EasyApi

Maintained by Community

Actor stats

8

Bookmarked

175

Total users

25

Monthly active users

2 months ago

Last modified

Share

Powerful scraper for extracting posts and articles from Substack based on keywords. Get detailed information about posts, publications, and authors with advanced search capabilities.

Features โœจ

  • ๐Ÿ” Search posts by keywords
  • ๐Ÿ“Š Extract comprehensive post metadata
  • ๐ŸŽ™๏ธ Support for podcast episodes data
  • ๐Ÿ‘ฅ Get author and publication details
  • โค๏ธ Capture engagement metrics (reactions, comments)
  • ๐Ÿ”„ Auto-scrolling for pagination
  • โšก High-performance with Puppeteer
  • ๐Ÿ›ก๏ธ Built-in anti-detection mechanisms

Output Data Structure ๐Ÿ“‹

The actor provides rich post data including:

  • Post title, subtitle, and description
  • Publication details
  • Author information
  • Podcast episode data (if applicable)
  • Cover images and media
  • Engagement metrics
  • Tags and categories
  • Publication timestamps
  • And much more!

Usage ๐Ÿ’ก

Simply provide:

  1. Keywords to search for
  2. Maximum number of items to scrape (optional)

The actor will automatically:

  • Search Substack for your keywords
  • Scroll through results
  • Extract detailed post information
  • Handle pagination
  • Export structured JSON data

Use Cases ๐ŸŽฏ

  • Content Research
  • Market Analysis
  • Topic Monitoring
  • Audience Engagement Analysis
  • Content Aggregation
  • Newsletter Analytics
  • Competitive Analysis

Limitations โš ๏ธ

  • Respects Substack's terms of service
  • Public posts only
  • Rate limiting applied for stability

Input Example

A full explanation of an input example in JSON.

{
"keywords":[
"ai"
],
"maxItems":50
}

Output sample

The results will be wrapped into a dataset which you can always find in the Storage tab. Here's an excerpt from the data you'd get if you apply the input parameters above:

And here is the same data but in JSON. You can choose in which format to download your data: JSON, JSONL, Excel spreadsheet, HTML table, CSV, or XML.

[
{
"keyword":"ai",
"id":156491923,
"editor_v2":false,
"publication_id":2270667,
"title":"New AI image models, free AI music generators, GPT can THINK now, new top AI models, DeepSeek Janus",
"social_title":null,
"search_engine_title":null,
"search_engine_description":null,
"type":"podcast",
"slug":"new-ai-image-models-free-ai-music",
"post_date":"2025-02-04T23:10:38.722Z",
"audience":"everyone",
"podcast_duration":2684.9436,
"video_upload_id":null,
"podcast_upload_id":"06e2c81a-16e8-4c32-a936-a1f89d596005",
"write_comment_permissions":"everyone",
"should_send_free_preview":false,
"free_unlock_required":false,
"default_comment_sort":null,
"canonical_url":"https://aisearch.substack.com/p/new-ai-image-models-free-ai-music",
"section_id":null,
"top_exclusions":[],
"pins":[],
"is_section_pinned":false,
"section_slug":null,
"section_name":null,
"reactions":{
"โค":0
},
"restacked_post_id":null,
"restacked_post_slug":null,
"restacked_pub_name":null,
"restacked_pub_logo_url":null,
"position":1,
"subtitle":"Welcome to the AI Search podcast. Here are the top highlights in AI this week.",
"cover_image":"https://substack-post-media.s3.amazonaws.com/public/images/1d231857-e4ee-468b-b626-0deb428ee7d6_1400x1400.png",
"cover_image_is_square":true,
"cover_image_is_explicit":false,
"podcast_episode_image_url":"https://substack-post-media.s3.amazonaws.com/public/images/1d231857-e4ee-468b-b626-0deb428ee7d6_1400x1400.png",
"podcast_episode_image_info":{
"url":"https://substack-post-media.s3.amazonaws.com/public/images/1d231857-e4ee-468b-b626-0deb428ee7d6_1400x1400.png",
"isDefaultArt":false,
"isDefault":false
},
"podcast_url":"https://api.substack.com/api/v1/audio/upload/06e2c81a-16e8-4c32-a936-a1f89d596005/src",
"videoUpload":null,
"podcastFields":{
"post_id":156491923,
"podcast_episode_number":null,
"podcast_season_number":null,
"podcast_episode_type":null,
"should_syndicate_to_other_feed":null,
"syndicate_to_section_id":null,
"hide_from_feed":false,
"free_podcast_url":null,
"free_podcast_duration":null
},
"podcast_preview_upload_id":null,
"podcastUpload":{
"id":"06e2c81a-16e8-4c32-a936-a1f89d596005",
"name":"news-24.mp3",
"created_at":"2025-02-04T23:09:13.865Z",
"uploaded_at":"2025-02-04T23:09:23.872Z",
"publication_id":2270667,
"state":"transcoded",
"post_id":156491923,
"user_id":191014175,
"duration":2684.9436,
"height":null,
"width":null,
"thumbnail_id":1,
"preview_start":null,
"preview_duration":null,
"media_type":"audio",
"primary_file_size":"42959560",
"is_mux":false,
"mux_asset_id":null,
"mux_playback_id":null,
"mux_preview_asset_id":null,
"mux_preview_playback_id":null,
"mux_rendition_quality":null,
"mux_preview_rendition_quality":null,
"explicit":false,
"copyright_infringement":null,
"src_media_upload_id":null,
"live_stream_id":null,
"transcription":{
"media_upload_id":"06e2c81a-16e8-4c32-a936-a1f89d596005",
"created_at":"2025-02-04T23:10:05.279Z",
"requested_by":191014175,
"status":"transcribed",
"modal_call_id":"fc-01JK9KMQDG2KRT327JTSR976DS",
"approved_at":"2025-02-04T23:12:43.876Z",
"transcript_url":"s3://substack-video/video_upload/post/156491923/06e2c81a-16e8-4c32-a936-a1f89d596005/1738710621/transcription.json",
"attention_vocab":null,
"speaker_map":null,
"captions_map":{
"en":{
"url":"s3://substack-video/video_upload/post/156491923/06e2c81a-16e8-4c32-a936-a1f89d596005/1738710621/en.vtt",
"language":"en",
"original":true
}
},
"cdn_url":"https://substackcdn.com/video_upload/post/156491923/06e2c81a-16e8-4c32-a936-a1f89d596005/1738710621/transcription.json?Expires=1739315665&Key-Pair-Id=APKAIVDA3NPSMPSPESQQ&Signature=kkyvVNWtOpL1VD3Y4n0LkIioiMU3r10tYrBXXlrNC927uzauCnLvzbp4j4-VU5UBwCm5HTayghaSnVeUPWfr6GPD-YTYgeHufNrkwCtmgqinTner3DwKh7z4EsvxbTkH58qXOAR82qLG8MHuu~iSTsXJ5CARuEeGPTW121bHK74poh6QH6jMT3iW-8qqRv4VP4aioSWL8OQyolUxoalTWSiejR6RE9RTxdRUMUbg8pk60GN3nzq3NTRff0qiZtnwJuvh~-A0L4FiTCiNtdFJsHOfYmcieyRydEDj7rHLsgY7yzuFXnsQx2qau9aoF79XAsJ5s4T1EySb~vg7fMTPNQ__",
"cdn_unaligned_url":"https://substackcdn.com/video_upload/post/156491923/06e2c81a-16e8-4c32-a936-a1f89d596005/1738710621/unaligned_transcription.json?Expires=1739315665&Key-Pair-Id=APKAIVDA3NPSMPSPESQQ&Signature=TpwJtOqaEBWXzOc2orXqEOjPumCJZzfg0wCNb22pdu~T8GQ2PCRcJaPAd6qZZzEPMFLsHbtiEvxC23ZIiD4D6ft94TvlQVYJV~TKKfTy0nC8Ut77ni9FJvzfRQhfYw1tUPdQ2mQEx2s5~TVCKlplUoaceWJ03B55xSURcT9apy4-8X2MjZk57O9Z-almjQ2QtkvxOUxNWvGiM1HN4RGKCPNfu211OpEn1rVMbmU~0WdZ3Sz7QaaXRnJbc3~tQKkev4MWXyq-E8lYa88lpNFj1LJnpF9piIAwfJUtkPym8SXdqiMLFHj0B7pBc4Th15eTUeL1MgdM145boHfnuAlP~w__",
"signed_captions":[
{
"language":"en",
"url":"https://substackcdn.com/video_upload/post/156491923/06e2c81a-16e8-4c32-a936-a1f89d596005/1738710621/en.vtt?Expires=1739315665&Key-Pair-Id=APKAIVDA3NPSMPSPESQQ&Signature=JpSxDJtOUrhIB698DRiCsap2PZTwPNbQaH1zBYxivUyFGmr8oTHIU80z0SELPqmnsaJmpRTgXjuAKJMbBBMeZlg7FwCNAPxijb9jhS5Oai-SZrDKH4jG21RJwh2TiF0Yg0yp9kF0z8xpK56RzqeS7JCHUJG8iVQmjqSNrrvCVtOGBGN3fBIHlr7Z3RKioxD0grIAatAkDbCwjkc49~~XWa9hd-awKhqByx2o2w6uHlNfuC5ZRRpwnaiX8Ju6rxU9MwW24QQCKCpjbND6S6kaGJ-Z~N90shdh-fD31FymCJ4quq6M1JaEyonZIjcwyUc6FduiKeQi25x-MvUIITtrow__",
"original":true
}
]
}
},
"podcastPreviewUpload":null,
"voiceover_upload_id":null,
"voiceoverUpload":null,
"has_voiceover":false,
"description":"Welcome to the AI Search podcast. Here are the top highlights in AI this week.",
"body_json":null,
"body_html":null,
"truncated_body_text":"INSANE AI news: OpenAI o3-mini, DeepSeek Janus-Pro, Qwen2.5-Max, Riffusion FUZZ, YuE AI music generator, Doubao 1.5 Pro, Google Daily Listen, Tulu 3",
"wordcount":38,
"postTags":[
{
"id":"06ec7467-035f-41d3-aa2a-f2dafd005ba2",
"publication_id":2270667,
"name":"research",
"slug":"research",
"hidden":false
},
{
"id":"134583b8-fd61-4289-83f8-2768f0e74637",
"publication_id":2270667,
"name":"machine learning",
"slug":"machine-learning",
"hidden":false
},
{
"id":"3b762592-4665-4885-b8c3-5c01a13fbd93",
"publication_id":2270667,
"name":"artificial intelligence",
"slug":"artificial-intelligence",
"hidden":false
},
{
"id":"aae204af-3183-4835-9184-ac27d860a342",
"publication_id":2270667,
"name":"science",
"slug":"science",
"hidden":false
},
{
"id":"d07870f1-c05d-4f6b-9a7e-c94b7e0ba2c5",
"publication_id":2270667,
"name":"tech",
"slug":"tech",
"hidden":false
},
{
"id":"e9d24f2a-02b0-4774-bc85-8b311ff1ab12",
"publication_id":2270667,
"name":"ai",
"slug":"ai",
"hidden":false
}
],
"teaser_post_eligible":true,
"postCountryBlocks":[],
"coverImagePalette":{
"Vibrant":{
"rgb":[
60,
180,
252
],
"population":3621
},
"DarkVibrant":{
"rgb":[
109,
52,
68
],
"population":14
},
"LightVibrant":{
"rgb":[
100,
196,
252
],
"population":5
},
"Muted":{
"rgb":[
164,
87,
108
],
"population":5
},
"DarkMuted":{
"rgb":[
86,
47,
61
],
"population":115
},
"LightMuted":{
"rgb":[
218,
174,
194
],
"population":99
}
},
"publishedBylines":[
{
"id":191014175,
"name":"AI Search",
"handle":"aisearch",
"previous_name":null,
"photo_url":"https://substack-post-media.s3.amazonaws.com/public/images/e1ef43b4-d382-41ad-8e0b-86080c6f0b2a_1400x1400.png",
"bio":"Stay up to date with AI news, tech, & research",
"profile_set_up_at":"2024-01-18T18:32:07.954Z",
"publicationUsers":[
{
"id":2288577,
"user_id":191014175,
"publication_id":2270667,
"role":"admin",
"public":true,
"is_primary":false,
"publication":{
"id":2270667,
"name":"AI Search",
"subdomain":"aisearch",
"custom_domain":null,
"custom_domain_optional":false,
"hero_text":"Welcome to the AI Search newsletter. We bring you the highlights in AI every week. No fluff, just the interesting stuff. \n\nSubscribe and get a FREE cheat sheet on the top 50 most useful AI tools!",
"logo_url":"https://substack-post-media.s3.amazonaws.com/public/images/2905d20a-608c-4fa8-9bdf-7af0c3792e1a_1280x1280.png",
"author_id":191014175,
"theme_var_background_pop":"#FF0000",
"created_at":"2024-01-18T18:32:47.729Z",
"rss_website_url":null,
"email_from_name":"AI Search",
"copyright":"AI Search",
"founding_plan_name":null,
"community_enabled":false,
"invite_only":false,
"payments_state":"disabled",
"language":null,
"explicit":false,
"is_personal_mode":false
}
}
],
"is_guest":false,
"bestseller_tier":null
}
],
"reaction":null,
"reaction_count":0,
"comment_count":0,
"child_comment_count":0,
"is_geoblocked":false,
"hasCashtag":false,
"scrapedAt":"2025-02-10T05:26:49.500Z"
},
...
]

Related Actors

You might also like

Substack Leaderboard Scraper ๐Ÿ“Š

easyapi/substack-leaderboard-scraper

Scrape detailed publication data from Substack leaderboards. Get comprehensive insights about top newsletters including subscriber counts, pricing, author details, and more. Perfect for newsletter research and market analysis.

Substack Scraper

scraper_guru/substack-scraper

Extract complete data from Substack newsletters including posts, authors, engagement metrics, and article text. 13 fields per post. Fast and reliable.

๐Ÿ‘ User avatar

LIAICHI MUSTAPHA

43

2.6

Substack Newsletter Scraper

digispruce/substack-scraper

Extract comprehensive Substack newsletter data including author profiles, subscriber counts, social media links, and contact information for B2B outreach and market research.

Substack Scraper โ€” Posts, Authors & Newsletters

cryptosignals/substack-scraper

Extract Substack newsletter content. Get post titles, authors, publish dates, paywall status, subscriber counts, and full article text. Ideal for newsletter research and content monitoring. PPE pricing โ€” pay only for results.

27

Substack Scraper

qpayre/substack-scraper

The Substack Author Scraper is a powerful Apify actor that makes it easy for content creators to scrape and retrieve all posts from their favorite Substack authors. With structured data presented in a user-friendly format, analyzing and processing valuable information has never been easier.

Substack Scraper | All-In-One

fatihtahta/substack-scraper

Get full articles, user profiles, and search results with All-in-One Substack Scraper. Extract rich data including titles, bios, subscriber counts, social links and engagement metrics. ideal for market research, creator discovery, trend tracking, and audience analysis.

135

Substack Notes Scraper ๐Ÿ”

easyapi/substack-notes-scraper

Extract notes and comments from Substack's search results with images, user info, and engagement metrics. Perfect for content analysis, user research, and tracking discussions around specific topics on Substack.

YouTube Video Details Scraper

maged120/youtube-video-details

Extract full metadata from any YouTube video or Short โ€” title, views, likes, comments, subtitles, chapters, tags, and more. No YouTube API key needed.

Substack Scraper

automation-lab/substack-scraper

Scrape Substack newsletters โ€” posts, comments, publication metadata. Full archive depth with no caps. Export to JSON, CSV, Excel, or connect via API.

๐Ÿ‘ User avatar

Stas Persiianenko

187

Bizquest [Only $1๐Ÿ’ฐ] Scraper

memo23/bizquest-scraper

๐Ÿ’ฐ$1 per 1K BizQuest businesses, asset sales & franchises from search or detail URLs: title, URL, asking price, location, industry, cash flow, EBITDA, gross income, inventory, employees, years operating, address, summary, broker contact, facilities, financing, flags nested details + raw API fields

๐Ÿ‘ User avatar

Muhamed Didovic

55

5.0