👁 Reddit Scraper - Posts, Comments, Subreddits & Users avatar

Reddit Scraper - Posts, Comments, Subreddits & Users

Pricing

from $1.50 / 1,000 results

👁 Reddit Scraper - Posts, Comments, Subreddits & Users

Reddit Scraper - Posts, Comments, Subreddits & Users

Fast, reliable Reddit scraper. Extract posts, comments, subreddits & users from any subreddit without Reddit API keys or login. AI-ready JSON for LLM training, sentiment analysis, lead generation. Export JSON/CSV/Excel.

Pricing

from $1.50 / 1,000 results

Rating

0.0

(0)

Developer

👁 deusex machine

deusex machine

Maintained by Community

Actor stats

Bookmarked

115

Total users

Monthly active users

18 days ago

Last modified

Reddit Scraper — Posts, Comments, Subreddits & Users API

Reddit Scraper is a fast, reliable Reddit data extraction tool that lets you scrape posts, comments, subreddits, and users from any subreddit or Reddit search query — without a Reddit API key, without login, and without rate limits. Extract structured JSON data ready for LLM pipelines, AI training, sentiment analysis, lead generation, market research, and brand monitoring.

Unlike the official Reddit API, this scraper has no OAuth setup, no app registration, no 60-requests-per-minute cap, and no 10-post limit per subreddit listing. Just give it a list of subreddits or a search query and it returns clean, normalized Reddit data in JSON, CSV, or Excel.

29 post fields · 10 comment fields · Nested comment threads · Image galleries · Flairs · Subreddit stats · Search any query · Export to JSON/CSV/Excel

✨ Why use this scraper

The Reddit API is powerful, but it's also:

Rate-limited: 60 requests per minute per OAuth client, 10 posts per listing page.
Authenticated: Requires app registration, OAuth flow, and token refresh logic.
Incomplete: Doesn't return all fields the public web UI shows (upvote ratios, galleries, media metadata).
Unpredictable: Terms-of-service changes have repeatedly broken third-party Reddit API consumers.

This Reddit scraper gives you:

✅ No Reddit API key needed — extracts Reddit data from public JSON endpoints.
✅ No login, no OAuth — scrape Reddit anonymously, no credentials to manage.
✅ High throughput — 1,000+ Reddit posts per run, concurrent subreddits, session persistence.
✅ Rich Reddit data — 29 fields per post, 10 fields per comment, nested replies up to depth 3.
✅ AI-ready JSON output — plug directly into LLM pipelines, RAG systems, or sentiment analysis models.
✅ Multiple export formats — JSON, CSV, Excel (XLSX), or direct API access via the Apify Dataset API.
✅ 99%+ success rate — automatic proxy rotation, retries, and session reuse keep your scraping jobs stable.

📤 Output fields

Every Reddit post is returned with 29 normalized fields covering text content, media, scoring, flairs, subreddit metadata, and timestamps. When comments are enabled, the scraper also returns the nested comment tree as structured JSON.

Post fields (29)

Field	Description
`id`	Reddit post ID (e.g. `t3_1s6e3dp`)
`subreddit`	Subreddit name (e.g. `technology`)
`title`	Post title
`author`	Reddit username of the post author
`score`	Net upvotes minus downvotes
`upvoteRatio`	Upvote ratio (e.g. `0.95` = 95% upvotes)
`numComments`	Total comment count
`url`	Reddit permalink to the post
`selftext`	Post body for text posts (up to 5,000 chars)
`thumbnail`	Thumbnail preview URL
`imageUrls`	All image URLs from galleries and image posts
`media`	Video URL + duration, or image URL
`created`	Post creation time (ISO 8601)
`edited`	Last edit timestamp, or `false`
`isVideo`	Video post flag
`isSelf`	Text post (`true`) vs link post (`false`)
`isGallery`	Multi-image gallery post
`domain`	Source domain (e.g. `youtube.com`, `self.technology`)
`linkUrl`	External URL for link posts
`flair`	Post flair (e.g. `Discussion`, `News`, `Privacy`)
`awards`	Total Reddit awards
`isNSFW`	NSFW flag
`isSpoiler`	Spoiler flag
`isPinned`	Pinned/stickied by mods
`numCrossposts`	Times crossposted to other subreddits
`subredditSubscribers`	Subreddit subscriber count
`postType`	Classification: `text`, `link`, `video`, `image`, `gallery`
`scrapedAt`	Scraping timestamp (ISO 8601)
`comments`	Array of comments (when enabled)

Comment fields (10)

Field	Description
`id`	Comment ID
`author`	Commenter Reddit username
`body`	Comment text (up to 2,000 chars)
`score`	Net upvotes
`created`	Timestamp (ISO 8601)
`depth`	Nesting level (0 = top-level, 1 = reply, 2 = reply-to-reply)
`isSubmitter`	Whether the commenter is the post author
`parentId`	Parent comment or post ID
`controversiality`	Controversy flag (0 or 1)
`replies`	Number of direct replies

🎯 Use cases

Reddit data powers some of the most valuable public datasets for AI, research, and market intelligence. Here's how teams use this Reddit scraper:

1. AI & LLM training data

Reddit posts and comments are a gold mine for training conversational AI, instruction-tuning LLMs, and building RAG systems. The scraper outputs clean JSON that drops straight into your embedding pipeline. Use the searchQuery input to filter for domain-specific Reddit data (e.g. medical, legal, finance).

2. Sentiment analysis & brand monitoring

Extract Reddit posts and comments mentioning your brand, product, or competitors. Feed them into VADER, RoBERTa, or an LLM and track sentiment trends over time. The scraper's comment threading means you capture full discussion context, not isolated quotes.

3. Lead generation

Search Reddit for people asking for solutions your product solves. The scraper supports filters like searchQuery: "best CRM for small business" or sort: top, timeFilter: month to find high-intent prospects. Combine with the author field to build contact lists.

4. Market research

Monitor entire subreddits (e.g. r/smallbusiness, r/saas, r/entrepreneur) for trending topics, pain points, and recurring questions. The scraper returns subreddit subscriber counts, post engagement, and time-based filters so you can segment by reach and recency.

5. Academic research

Researchers use Reddit data for computational social science, public health monitoring, and linguistic analysis. This scraper gives you reproducible, timestamp-stamped Reddit data without the friction of the Reddit API's OAuth flow.

6. Content discovery & trend spotting

Journalists, newsletter writers, and content marketers scrape Reddit to surface emerging topics before they hit mainstream media. Sort by rising or top/day to catch conversations at the right moment.

7. Competitor intelligence

Scrape Reddit discussions about competitor products to extract feature requests, complaints, and comparison threads. The flair, numComments, and upvoteRatio fields help you prioritize signal over noise.

🚀 How to use

Example 1 — Scrape hot posts from multiple subreddits

{
"subreddits":["technology","programming","webdev"],
"maxPosts":50,
"sort":"hot"
}

maxPosts applies per subreddit — this returns up to 150 Reddit posts total across the 3 subreddits.

Example 2 — Search across all of Reddit

{
"searchQuery":"best CRM for small business",
"maxPosts":100,
"sort":"top",
"timeFilter":"month"
}

This searches Reddit globally for posts matching your query, sorted by top scores from the past month.

Example 3 — Get Reddit posts with nested comments

{
"subreddits":["AskReddit"],
"maxPosts":25,
"sort":"top",
"timeFilter":"week",
"includeComments":true,
"maxCommentsPerPost":20
}

This returns 25 top posts from r/AskReddit this week, each with up to 20 nested comments (including replies up to depth 3).

Example 4 — Scrape Reddit from Python

from apify_client import ApifyClient
client = ApifyClient("<YOUR_APIFY_TOKEN>")
run_input ={
"subreddits":["MachineLearning","LocalLLaMA"],
"maxPosts":200,
"sort":"top",
"timeFilter":"week",
"includeComments":True,
"maxCommentsPerPost":25,
}
run = client.actor("makework36/reddit-scraper").call(run_input=run_input)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item["title"],"·", item["score"],"upvotes")

Example 5 — Scrape Reddit from Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'<YOUR_APIFY_TOKEN>'});
const run =await client.actor('makework36/reddit-scraper').call({
searchQuery:'apify reddit scraper',
maxPosts:50,
sort:'relevance',
timeFilter:'all',
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(`Scraped ${items.length} Reddit posts`);

Example 6 — Trigger a Reddit scraping run from cURL

curl-X POST "https://api.apify.com/v2/acts/makework36~reddit-scraper/run-sync-get-dataset-items?token=<YOUR_APIFY_TOKEN>"\
-H"Content-Type: application/json"\
-d'{
 "subreddits": ["news"],
 "maxPosts": 10,
 "sort": "new"
 }'

Request routing — each subreddit or search query becomes a seeded request with its own session cookie and proxy IP.
Public JSON endpoints — the scraper hits Reddit's public .json endpoints (e.g. /r/{sub}/hot.json) which return the same data the Reddit web UI consumes. No Reddit API key required.
Proxy rotation — datacenter IPs are blocked by Reddit; the scraper rotates residential proxies and retries up to 12 times per request.
Session persistence — working proxy + session cookie pairs are reused across subreddits to reduce block rates.
Comment tree reconstruction — comment replies are fetched recursively and linked via parentId so you can rebuild the full thread.
Schema normalization — Reddit's raw JSON is mapped to 29 clean fields with consistent types and ISO 8601 timestamps.

📥 Input

Parameter	Type	Default	Description
`subreddits`	array	`[]`	List of subreddit names to scrape (no `r/` prefix)
`searchQuery`	string	—	Search Reddit globally for this term
`maxPosts`	integer	`50`	Max posts per subreddit (1-500)
`sort`	string	`hot`	Sort order: `hot`, `new`, `top`, `rising`, `relevance`
`timeFilter`	string	`day`	Time range for `top` / `relevance`: `hour`, `day`, `week`, `month`, `year`, `all`
`includeComments`	boolean	`false`	Fetch comments for each Reddit post
`maxCommentsPerPost`	integer	`10`	Comments per post (1-100), includes nested replies up to depth 3

📋 Output example

Each item in the dataset is a single Reddit post as a JSON object. When includeComments: true, each post includes a comments array.

{
"id":"t3_1s6gkmj",
"subreddit":"pics",
"title":"About 100,000 attended the No Kings protest in St. Paul, Minnesota",
"author":"katotooo",
"score":24915,
"upvoteRatio":0.97,
"numComments":235,
"url":"https://www.reddit.com/r/pics/comments/1s6gkmj/about_100000_attended_the_no_kings_protest/",
"selftext":null,
"thumbnail":"https://preview.redd.it/oy94fh2hnvrg1.jpg?width=140&height=93",
"imageUrls":[
"https://preview.redd.it/oy94fh2hnvrg1.jpg?width=3024&format=pjpg",
"https://preview.redd.it/9pmbe5aqnvrg1.jpg?width=4032&format=pjpg"
],
"media":null,
"created":"2026-03-29T00:21:20.000Z",
"edited":false,
"isVideo":false,
"isSelf":false,
"isGallery":true,
"domain":"old.reddit.com",
"linkUrl":"https://www.reddit.com/gallery/1s6gkmj",
"flair":"Politics",
"awards":0,
"isNSFW":false,
"isSpoiler":false,
"isPinned":false,
"numCrossposts":2,
"subredditSubscribers":33336092,
"postType":"gallery",
"scrapedAt":"2026-03-29T08:05:31.904Z",
"comments":[
{
"id":"od1rtqc",
"author":"YJSubs",
"body":"When I see Bernie, I thought, did you just identify him as bald eagle?",
"score":1,
"created":"2026-03-29T00:21:22.000Z",
"depth":0,
"isSubmitter":false,
"parentId":"t3_1s6gkmj",
"controversiality":0,
"replies":1
},
{
"id":"od1s2fp",
"author":"rclonecopymove",
"body":"Same, he's bald but not very aquiline.",
"score":1,
"created":"2026-03-29T00:22:45.000Z",
"depth":1,
"isSubmitter":false,
"parentId":"t1_od1rtqc",
"controversiality":0,
"replies":0
}
]
}

Below are three real examples of Reddit data returned by this scraper (anonymized).

Sample 1 — tech discussion post

{
"id":"t3_abc123",
"subreddit":"programming",
"title":"Why I switched from Postgres to SQLite for my side project",
"author":"indie_dev_42",
"score":3284,
"upvoteRatio":0.94,
"numComments":412,
"flair":"Discussion",
"postType":"text",
"subredditSubscribers":5100000
}

Sample 2 — image gallery post

{
"id":"t3_def456",
"subreddit":"EarthPorn",
"title":"Sunrise over Torres del Paine, Patagonia",
"author":"nature_photog",
"score":18920,
"upvoteRatio":0.99,
"numComments":78,
"postType":"gallery",
"imageUrls":["https://preview.redd.it/....jpg"]
}

Sample 3 — video post with comments

{
"id":"t3_ghi789",
"subreddit":"nextfuckinglevel",
"title":"Robot parkour demo from Boston Dynamics",
"score":42018,
"upvoteRatio":0.96,
"media":{"videoUrl":"https://v.redd.it/....mp4","duration":47},
"postType":"video",
"comments":[{"author":"user1","body":"...","score":2100,"depth":0}]
}

⚡ Performance

Scenario	Cost	Typical runtime
1,000 Reddit posts, no comments	~$0.016	~5-10 min
1,000 Reddit posts + 10 comments each	~$0.05	~12-18 min
3 subreddits × 50 posts	~$0.003	~40 sec
Full subreddit scrape (500 posts + 50 comments)	~$0.08	~15-25 min

Pricing follows the Apify Compute Unit model — you pay only for the compute used. No Reddit API subscription, no proxy add-ons, no hidden fees.

Tips to reduce cost:

Set includeComments: false if you only need post metadata.
Use a tight timeFilter (day or week) instead of all to avoid paginating deep archives.
Scrape specific subreddits instead of broad search queries when possible.

❓ FAQ

Does this Reddit scraper need a Reddit API key? No. It extracts Reddit data from public JSON endpoints with a stealth browser, no Reddit API key or OAuth required.

How is this different from the official Reddit API? The Reddit API has strict rate limits (60 req/min), requires OAuth, and caps listings at 10 posts per page. This scraper has no such limits, supports search and comment extraction out of the box, and returns richer data.

What does maxPosts mean? It's per subreddit, not global. 3 subreddits × 50 maxPosts = up to 150 Reddit posts.

How deep do comments go? Up to 3 levels: top-level (depth 0), replies (depth 1), replies-to-replies (depth 2). Each comment has a parentId so you can rebuild the thread.

Can I scrape NSFW subreddits? Yes, but results may include adult content. Use the isNSFW field to filter.

Is this legal? Scraping public Reddit data for research, journalism, and business intelligence is generally allowed under fair-use principles. Consult your legal team for your specific use case and review Reddit's User Agreement.

Can I schedule recurring Reddit scrapes? Yes. Use Apify's Scheduler to run this scraper hourly, daily, or weekly. Pair it with the Apify Webhooks to push new Reddit data to your own database.

📊 Comparison

Choosing the right Reddit scraper depends on what Reddit data you need, how often you need it, and whether you're willing to deal with the Reddit API's OAuth flow and rate limits. Here's an honest comparison:

Feature	This Reddit Scraper	Official Reddit API	Reddit PRAW library	Generic web scrapers
Reddit API key required	❌ No	✅ Yes (OAuth)	✅ Yes (OAuth)	❌ No
Rate limit	None	60 req/min	60 req/min	Varies
Comment threads	✅ Nested up to depth 3	✅ Full tree	✅ Full tree	❌ Usually not
Search across Reddit	✅ Yes	✅ Yes	✅ Yes	❌ Manual
Subreddit scraping	✅ Multi-subreddit, parallel	✅ One at a time	✅ One at a time	❌ Manual
Export to JSON / CSV / Excel	✅ All three	❌ JSON only	❌ Python objects	Varies
Maintenance burden	Apify handles it	You handle OAuth + retries	You handle OAuth + retries	You handle everything
Cost for 10K Reddit posts	~$0.50	Free but slow	Free but slow	Free + your dev time
Setup time	<1 minute	30-60 minutes	15 minutes	Hours or days

When to use this Reddit scraper:

You need Reddit data fast, without setting up OAuth.
You want structured, normalized JSON without writing parsers.
You're feeding Reddit data into an AI / LLM pipeline.
You want to scrape multiple subreddits in parallel.
You need CSV / Excel export for non-engineers.

When to use the official Reddit API instead:

You're building a Reddit bot that posts, votes, or messages.
You need real-time WebSocket events.
You're fine with 60-requests-per-minute and OAuth setup.

💵 Pricing

The Reddit scraper is already among the cheapest Reddit data sources per post, but you can push it even lower:

Turn off comments when you don't need them. includeComments: false cuts cost by ~3x.
Scope timeFilter tightly. Scraping top/all pulls deep history; top/week is often enough.
Prefer subreddits over searchQuery when possible. Search pages are heavier than subreddit listings.
Schedule hourly instead of real-time. Reddit's new feed doesn't change fast enough to justify <15-min polling for most use cases.
Dedupe across runs. Store Reddit post IDs in Redis / DynamoDB and skip posts you've already scraped.
Cap maxCommentsPerPost smartly. 10-20 comments per post captures most of the signal; 100 is usually overkill.

📝 Changelog

1.0 — Initial public release. 29 post fields, 10 comment fields, search and subreddit modes, nested comment threads, AI-ready JSON output, JSON/CSV/Excel export.

👁 Reddit Scraper avatar

Reddit Scraper

prodiger/reddit-scraper

Extract posts, comments, user profiles, and search results from Reddit. Pure HTTP, no API key required.

👁 User avatar

Arnas

153

👁 Reddit Comments Scraper avatar

Reddit Comments Scraper

simpleapi/reddit-comments-scraper

Reddit Comments Scraper provides clean Reddit comment datasets for analysis. Track conversation flow, engagement signals, and user feedback across threads for research, insights, and data analytics use cases.

👁 User avatar

SimpleAPI

👁 Reddit Scraper - Posts, Comments & Users avatar

Reddit Scraper - Posts, Comments & Users

betterdevsscrape/reddit-scraper

Extract posts, comments, communities & user profiles from any subreddit at scale. Fetches all comments including hidden/collapsed ones. Breaks Reddit's 1000-post limit with date windowing. No login needed, no browser. $0.003 per result. Supports search, sorting, NSFW filtering & date filtering.

👁 User avatar

Better Devs Scrape

972

👁 Reddit Scraper Pro avatar

Reddit Scraper Pro

harshmaur/reddit-scraper-pro

Reddit Scraper Pro is a powerful, unlimited scraping for $20/mo for extracting data from Reddit. Scrape posts, users, comments, and communities with advanced search capabilities. Perfect for brand monitoring, trend tracking, and competitor research. Supports make, n8n integrations

👁 User avatar

Harsh Maur

2.5K

4.7

👁 Reddit Scraper - Posts, Comments & Subreddits avatar

Reddit Scraper - Posts, Comments & Subreddits

viralanalyzer/reddit-scraper

Extract Reddit posts, comments, subreddit data, and user profiles.

👁 User avatar

viralanalyzer

5.0

👁 Reddit Scraper avatar

Reddit Scraper

trudax/reddit-scraper

Unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

👁 User avatar

Trudax

14K

2.5

👁 Reddit MCP Server — Claude, ChatGPT, Cursor, Codex avatar

Reddit MCP Server — Claude, ChatGPT, Cursor, Codex

makework36/reddit-mcp-server

Native Reddit MCP server for AI agents. 7 Reddit tools (search, subreddits, posts+comments, users, trending) over Streamable HTTP. Works with Claude Desktop, Cursor, ChatGPT, OpenAI Codex, Agents SDK, Windsurf. No Reddit API key. Pay per tool call.

👁 User avatar

deusex machine

👁 Reddit Comments Scraper avatar

Reddit Comments Scraper

mysteriousshadow/reddit-comments-scraper

Easily extract Reddit comments with customizable depth. Retrieve top-level comments, direct replies, or entire thread hierarchies in flat or nested formats for seamless analysis and research.

👁 User avatar

Mysterious Shadow

👁 Reddit Search Scraper — Posts, Comments & Users avatar

Reddit Search Scraper — Posts, Comments & Users

logiover/reddit-search-scraper

Scrape Reddit subreddit search with no API key or login. Export posts and comments to CSV/JSON — a Reddit API alternative for keyword monitoring.

👁 User avatar

Logiover

👁 Reddit Scraper Lite avatar

Reddit Scraper Lite

trudax/reddit-scraper-lite

Pay Per Result, unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

👁 User avatar

Trudax

28K

4.6

URL: https://apify.com/makework36/reddit-scraper

⇱ Reddit Scraper API - Posts, Comments, Subreddits Data · Apify

Reddit Scraper - Posts, Comments, Subreddits & Users

Reddit Scraper — Posts, Comments, Subreddits & Users API

✨ Why use this scraper

📤 Output fields

Post fields (29)

Comment fields (10)

🎯 Use cases

1. AI & LLM training data

2. Sentiment analysis & brand monitoring

3. Lead generation

4. Market research

5. Academic research

6. Content discovery & trend spotting

7. Competitor intelligence

🚀 How to use

Example 1 — Scrape hot posts from multiple subreddits

Example 2 — Search across all of Reddit

Example 3 — Get Reddit posts with nested comments

Example 4 — Scrape Reddit from Python

Example 5 — Scrape Reddit from Node.js

Example 6 — Trigger a Reddit scraping run from cURL

📥 Input

📋 Output example

Sample 1 — tech discussion post

Sample 2 — image gallery post

Sample 3 — video post with comments

⚡ Performance

❓ FAQ

📊 Comparison

💵 Pricing

📝 Changelog

You might also like

Reddit Scraper

Reddit Comments Scraper

Reddit Scraper - Posts, Comments & Users

Reddit Scraper Pro

Reddit Scraper - Posts, Comments & Subreddits

Reddit Scraper

Reddit MCP Server — Claude, ChatGPT, Cursor, Codex

Reddit Comments Scraper

Reddit Search Scraper — Posts, Comments & Users

Reddit Scraper Lite