VOOZH about

URL: https://apify.com/sheshinmcfly/reddit-thread-scraper

⇱ Reddit Thread Scraper - Posts & Comments Β· Apify


Pricing

from $2.00 / 1,000 results

Go to Apify Store

Reddit Thread Scraper

Extract posts and top comments from any Reddit thread or subreddit. Returns post title, author, score, URL, body text, and top-voted comments with full metadata. Ideal for sentiment analysis, research, AI training datasets, and community monitoring. No API key required.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Sheshinmcfly

Sheshinmcfly

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

1

Monthly active users

9 days ago

Last modified

Share

Extract posts and comments from any subreddit via Reddit's official public JSON API. No authentication required. Filter by sort order, time range, and number of comments.

Perfect for AI training datasets, sentiment analysis, market research, and trend monitoring.


What data does it extract?

Posts

FieldDescriptionExample
typeRecord type"post"
idReddit post ID"1sa4rlx"
subredditSubreddit name"MachineLearning"
titlePost title"New paper on LLM reasoning"
authorUsername"researcher123"
scoreUpvotes - downvotes1420
upvoteRatioUpvote ratio0.97
numCommentsTotal comment count83
selftextPost body text"We propose a new..."
urlLink URL"https://arxiv.org/..."
permalinkReddit post URL"https://reddit.com/r/..."
flairPost flair label"Research"
createdAtPost creation time"2026-04-21T10:00:00Z"
extractedAtExtraction timestamp"2026-04-21T12:00:00Z"

Comments

FieldDescriptionExample
typeRecord type"comment"
idComment ID"abc123"
postIdParent post ID"1sa4rlx"
authorUsername"user456"
bodyComment text"Great work, but..."
scoreUpvotes - downvotes342
depthNesting level (0 = top-level)0
permalinkDirect link to comment"https://reddit.com/..."
createdAtComment creation time"2026-04-21T10:05:00Z"

Use cases

  • AI training data: Clean text from expert communities for LLM fine-tuning
  • Sentiment analysis: Monitor brand mentions and user opinions
  • Market research: Track trends and discussions in niche communities
  • Competitive intelligence: See what problems users are discussing
  • RAG pipelines: Feed domain-specific knowledge into retrieval systems
  • Content research: Find top-performing posts for content strategy

How to use

  1. Open the actor and configure:
    • Subreddits: List subreddit names (e.g. MachineLearning, investing, python)
    • Sort: hot, new, top, or rising
    • Time filter: For "top" sort β€” day, week, month, year, all
    • Max posts: Cap per subreddit
    • Include comments: Also extract top comments
  2. Click Start
  3. Download results as JSON, CSV, or Excel

Example output (JSON)

[
{
"type":"post",
"id":"1sa4rlx",
"subreddit":"MachineLearning",
"title":"[D] New method achieves SOTA on reasoning benchmarks",
"author":"ml_researcher",
"score":1420,
"upvoteRatio":0.97,
"numComments":83,
"selftext":"We introduce a novel approach...",
"url":"https://arxiv.org/abs/2504.12345",
"permalink":"https://www.reddit.com/r/MachineLearning/comments/1sa4rlx/",
"flair":"Research",
"createdAt":"2026-04-21T10:00:00.000Z",
"extractedAt":"2026-04-21T12:00:00.000Z"
},
{
"type":"comment",
"id":"kxyz789",
"postId":"1sa4rlx",
"subreddit":"MachineLearning",
"author":"deep_learner",
"body":"Impressive results. Did you test on out-of-distribution benchmarks?",
"score":342,
"depth":0,
"permalink":"https://www.reddit.com/r/MachineLearning/comments/1sa4rlx/comment/kxyz789/",
"createdAt":"2026-04-21T10:05:00.000Z",
"extractedAt":"2026-04-21T12:00:00.000Z"
}
]

Pricing

This actor charges $0.002 USD per item extracted (posts and comments each count as one item). Extracting 100 posts with 10 comments each = 1,100 items β‰ˆ $2.20 USD.


Keywords

reddit scraper, subreddit posts extractor, reddit comments scraper, reddit data for AI, reddit sentiment analysis, reddit thread extractor, social media scraper, reddit API scraper, NLP training data, reddit market research


Legal Disclaimer

This actor extracts publicly available data only from Reddit using Reddit's official public JSON API (reddit.com/r/{subreddit}.json), in compliance with Chilean Law 19.628 on the Protection of Private Life (Ley 19.628 sobre ProtecciΓ³n de la Vida Privada).

What this actor does NOT collect:

  • Private messages or non-public posts
  • Email addresses or personal contact information
  • Data from private or restricted subreddits
  • Any data not freely visible to anonymous visitors

What this actor collects:

  • Post titles, body text, and metadata (public content)
  • Publicly visible usernames and comment text
  • Engagement metrics (score, upvotes, comment counts)

All data is publicly accessible without authentication via Reddit's JSON API. Users are solely responsible for ensuring their use of this data complies with applicable laws and Reddit's terms of service.

Other actors you may like

You might also like

Reddit Posts & Comments Scraper β€” Full Thread Extraction

maged120/reddit-scraper

Scrape Reddit posts and full comment threads from any post URL. Extract title, score, author, timestamp, and all nested comments without login.

Reddit Thread Details Scraper

ecomscrape/reddit-thread-details-scraper

Reddit Thread Details Scraper automates extraction of comprehensive thread metadata including post content, engagement metrics, author information, and moderation data. Efficiently collect detailed Reddit data for social listening, market research, sentiment analysis, and community insights.

ecomscrape

3

Reddit Comments Scraper

quakerish_joyride/reddit-comments-scraper

Extract comments from any Reddit post or subreddit. Returns structured JSON with author, score, timestamp, and nested replies. Fast, no API key required.

48

Reddit Scraper

fetch_cat/reddit-scraper

Scrape public Reddit posts and comments from subreddit, search, user, and thread RSS feeds without Reddit API credentials.

Reddit Posts Scraper β€” Comments, Replies & Subreddits

khadinakbar/reddit-posts-scraper

Extract Reddit posts, comments & subreddit data with no login required. Returns title, score, author, flair, body text, and dates MCP-ready. $2.00/1K.

Reddit Comment Scraper β€” Post Comments & Subreddit Monitoring

automly/reddit-comment-scraper

Extract comments from specific Reddit posts or from the top posts of any subreddit. Supports all Reddit comment sort modes. Residential proxy required for reliable access.

Reddit Api Scraper

scrapio/reddit-api-scraper

Extract structured Reddit data with the Reddit API Scraper. Collect posts, comments, usernames, upvotes, subreddit names, and timestamps directly through the Reddit API. Ideal for market research, sentiment analysis, and community monitoring.

Reddit Search Scraper β€” Posts, Comments & Users

logiover/reddit-search-scraper

Scrape Reddit subreddit search with no API key or login. Export posts and comments to CSV/JSON β€” a Reddit API alternative for keyword monitoring.