VOOZH about

URL: https://apify.com/solid-scraper/reddit-comments-scraper

โ‡ฑ Reddit Comments Scraper ยท Apify


Pricing

from $2.99 / 1,000 results

Go to Apify Store

Reddit Comments Scraper

๐Ÿ”Ž Extract valuable Reddit comments with this Comments Scraperโ€”fast, accurate, and built for research, sentiment, and community insights. ๐Ÿ“Šโœจ Perfect for marketers, analysts, and data teams wanting actionable results.

Pricing

from $2.99 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ SolidScraper

SolidScraper

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Share

Reddit Comments Scraper ๐Ÿ“ฃ

Reddit Comments Scraper automatically collects comments (including nested replies, when enabled) from one or more Reddit posts and returns a flat record per comment, complete with path and depth metadata. If youโ€™re looking to scrape reddit comments, extract reddit thread comments for analysis, or build a bulk reddit comment scraper workflow, this tool helps you get structured comment data at scaleโ€”without manually copying threads one by one. Whether youโ€™re a marketer, data analyst, researcher, or developer, you can use this reddit comments extraction actor to speed up collection and save you hours of manual work.


Why choose Reddit Comments Scraper?

FeatureBenefit
โœ… Comments + Nested Replies CollectionExtracts top-level comments and (optionally) the full reply tree for each post
โœ… All-in-One Batch InputLets you scrape comments from multiple post URLs in a single run
โœ… Reliable Scraping with Fallback LogicIncludes retries and handles access challenges using a real browser session
โœ… Proxy Support for StabilitySupports configurable proxy settings to improve scraping reliability
โœ… Structured Flat OutputReturns one JSON record per comment with path/depth metadata for easy downstream processing
โœ… Scales with Concurrency ControlsUses configurable parallelism via maximum concurrent posts to fit your throughput needs

Key features

  • ๐Ÿ“Š Flat comment data with tree metadata: Produces one record per comment with commentPath and commentDepth so you can analyze conversation structure.
  • ๐Ÿ’ฌ Optional nested reply extraction: When enabled, replies to comments are also collected (full thread tree); when disabled, only top-level comments are returned.
  • ๐Ÿ” Sort-controlled comment ordering: Supports top, best, new, controversial, old, and qa sorting to match your research needs.
  • ๐Ÿง  Top-level vs reply awareness: Adds isTopLevel and parentPath so you can distinguish roots from replies in your analysis.
  • ๐Ÿ›ก๏ธ Resilient runs with retries: Uses multiple attempts per post to reduce the chance of partial failures.
  • ๐ŸŒ Post URL support: Accepts one or more Reddit post URLs and normalizes them for collection.
  • ๐Ÿ’พ Dataset-ready results: Pushes extracted comment records to the Apify dataset as JSON (one item list per successful post).
  • โš™๏ธ Concurrency controls: Uses maxConcurrentPosts so you can balance speed against memory usage.

Input

Provide input via an input.json file. Example structure:

{
"postUrls":[
"https://www.reddit.com/r/AskMec/comments/14990m6/les_applications_de_rencontres_fonctionnent_telles/"
],
"maxComments":500,
"includeNestedReplies":true,
"sortBy":"top",
"maxConcurrentPosts":2,
"proxyConfiguration":{
"useApifyProxy":false
}
}

Input Fields

FieldRequiredDescription
postUrlsYesOne or more Reddit post URLs to scrape comments from.
maxCommentsNoMaximum number of comments to extract per post (counts nested replies too). Default is 500. Must be at least 1.
includeNestedRepliesNoWhen enabled, replies to comments are also extracted (the full thread tree). When disabled, only top-level comments are returned. Default is true.
sortByNoHow Reddit should order the comments before they are collected. Options: top, best, new, controversial, old, qa. Default is top.
maxConcurrentPostsNoHow many posts to scrape in parallel. Each post runs its own browser, so higher values need more memory. Default is 2 (min 1, max 10).
proxyConfigurationNoProxy settings for the scraper. If provided, the actor uses your configuration; otherwise it creates a default proxy configuration with residential groups.

Output

The actor saves extracted comments in JSON format by pushing a list of comment records to the Apify dataset (charged_event_name="result") for each successfully processed post.

Example output record:

[
{
"postUrl":"https://www.reddit.com/r/.../comments/.../",
"postTitle":"Example post title",
"postAuthor":"example_author",
"postScore":12345,
"subreddit":"examplesubreddit",
"commentDepth":0,
"commentAuthor":"comment_author",
"commentText":"This is a comment body.",
"commentTimestamp":"2024-01-15T10:22:33.000Z",
"commentPath":"0",
"parentPath":null,
"isTopLevel":true,
"replyCount":2,
"scrapedAt":"2024-01-15T10:30:00.000Z"
}
]

Output Fields

FieldTypeDescription
postUrlstringThe normalized Reddit post URL for which the comment was scraped.
postTitlestringThe post title.
postAuthorstringThe post author username.
postScorenumberThe post score at the time of collection.
subredditstringThe subreddit name.
commentDepthnumberDepth of the comment in the thread tree (top-level is 0).
commentAuthorstringThe comment author username.
commentTextstringThe comment body text.
commentTimestampstringUTC timestamp (ISO-8601 with milliseconds and trailing Z) for when the comment was created.
commentPathstringEncoded position of the comment within the tree (e.g., "0", "0/1", "0/1/0").
parentPathstring | nullThe parent commentโ€™s commentPath (or null for top-level comments).
isTopLevelbooleantrue when commentDepth is 0; otherwise false.
replyCountnumberCount of direct replies to this comment.
scrapedAtstringUTC timestamp (ISO-8601 with milliseconds and trailing Z) indicating when the scraping happened.
error_messagestringNot provided in the dataset schema emitted by this actor. Failures are logged and posts that succeed will push records.

You can export the resulting dataset from Apify as JSON or CSV (depending on your chosen export settings in the Apify UI).


How to use Reddit Comments Scraper (via Apify Console)

  1. Open Apify Console: Go to console.apify.com and log in.
  2. Find the actor: Search for Reddit Comments Scraper in the Actors marketplace and open the actor page.
  3. Open the INPUT panel: In the actor run screen, locate the INPUT section.
  4. Add your post URLs: Paste one or more Reddit post URLs into postUrls.
  5. Choose your comment limits and structure:
    Set maxComments (per post), enable/disable includeNestedReplies, and pick sortBy if you need a specific ordering.
  6. Set concurrency for your budget: Adjust maxConcurrentPosts (each parallel post uses its own browser, so higher values use more memory).
  7. Configure proxy (optional): If you have proxyConfiguration, add it; otherwise the actor creates a default residential proxy configuration.
  8. Run & monitor: Click Run. Watch logs for progress, extraction counts, and any retry attempts.
  9. Open the OUTPUT dataset: After completion, go to the dataset tab to preview the extracted reddit comments data and export it to JSON/CSV.

No coding requiredโ€”get reddit comments extraction results in minutes.


Advanced features & SEO optimization

  • ๐Ÿ” Engineered for โ€œReddit Comments Scraperโ€ workflows: The actor is designed for reddit comments to csv scraper style pipelines where you need a clean, flat structure for analysis and BI.
  • ๐Ÿงฉ Thread-aware output for conversation mining: Each comment includes commentPath, parentPath, commentDepth, replyCount, and isTopLevel, making reddit comments mining and scrape reddit thread comments workflows much easier.
  • ๐Ÿ•’ Consistent UTC timestamps: Uses ISO-8601 scrapedAt and commentTimestamp values for reliable time-based analysis.
  • ๐Ÿงฐ Input-friendly sorting: With sortBy, you can align collection with your research question (for example, focusing on most upvoted or most recent discussions).
  • ๐Ÿ” Resilience for public web data: Includes retries and supports configurable proxy settings for stable scraping of publicly available data.

Best use cases

  • ๐Ÿ“ˆ Marketing teams: Collect reddit comments data from multiple posts to find recurring themes and messaging angles for outreach campaigns.
  • ๐Ÿง  Researchers: Gather structured reddit comments extraction for qualitative coding and quantifying sentiment or discussion depth.
  • ๐Ÿ’ฌ Community managers: Monitor how conversations evolve by scraping threads with sortBy and analyzing commentDepth distributions.
  • ๐Ÿ—๏ธ Data analysts: Build a conversation graph using commentPath, parentPath, and replyCount from a bulk reddit comment scraper run.
  • ๐Ÿงช Product teams: Compare feedback across communities by scraping reddit comments from posts in relevant subreddits and exporting to CSV.
  • ๐Ÿ’ป Developer pipelines: Feed structured results into downstream systems (ETL, dashboards, or CRM enrichment steps) with predictable fields per comment.
  • ๐ŸŽฏ Content strategists: Scrape comments from posts to identify what users actually respond toโ€”then iterate your content based on real discussion threads.

Technical specifications

  • Supported Input Formats

    • โœ… postUrls: array of Reddit post URLs
    • โœ… maxComments: integer (default 500, minimum 1)
    • โœ… includeNestedReplies: boolean (default true)
    • โœ… sortBy: string enum (top, best, new, controversial, old, qa)
    • โœ… maxConcurrentPosts: integer (default 2, range 1 to 10)
    • โœ… Optional proxyConfiguration
  • Proxy Support

    • โœ… Configurable proxy support via proxyConfiguration
    • โœ… Default residential proxy configuration when proxyConfiguration is not provided
  • Retry Mechanism

    • โœ… Retries are built in for each post (multiple attempts per post)
  • Dataset Structure

    • โœ… JSON records pushed to the dataset with one flat record per comment
    • โœ… Includes commentPath/parentPath/commentDepth for thread reconstruction
  • Rate Limits & Performance

    • โœ… Designed for batch processing with configurable concurrency using maxConcurrentPosts
    • โš ๏ธ Each concurrent post uses its own browser session, so higher concurrency can increase memory usage
  • Limitations

    • โŒ Mod/bot-pinned comments are skipped (stickied items are not included)
    • โŒ Only publicly accessible comment data from the provided posts is collected

FAQ

What does Reddit Comments Scraper return?

โœ… It returns a flat list of JSON recordsโ€”one record per commentโ€”with thread metadata like commentPath, parentPath, and commentDepth, plus comment content (commentText) and timestamps (commentTimestamp).

Can it scrape nested replies?

โœ… Yes. With includeNestedReplies enabled, replies to comments are also extracted so you get the full thread tree. If you disable it, only top-level comments are returned.

How many comments can I extract from each post?

You control it with maxComments. It sets the maximum number of comments extracted per post and counts nested replies too.

Can I control the order of comments?

โœ… Yes. Use sortBy to choose how comments are ordered before they are collected: top, best, new, controversial, old, or qa.

Does it support scraping multiple Reddit posts at once?

โœ… Yes. Provide multiple links in postUrls. You can also control parallelism with maxConcurrentPosts to balance speed and resource usage.

Is there a dataset export format other than JSON?

Apify datasets can be exported after the run. The actor pushes JSON-formatted records to the dataset, and you can export to CSV from the Apify UI depending on your settings.

Do I need to use a proxy?

โŒ You donโ€™t have to, but you can. If you provide proxyConfiguration, the actor will use it; otherwise it creates a default residential proxy configuration to improve scraping reliability.

Is this compliant with privacy rules?

โœ… The actor only collects data from publicly accessible sources. Youโ€™re responsible for using the results in accordance with applicable laws (including privacy and platform rules) for your specific use case.


Support & feature requests

If youโ€™re using Reddit Comments Scraper for reddit comments web scraper or reddit comments data extraction workflows, weโ€™d love to hear how itโ€™s working for you.

  • ๐Ÿ’ก Feature Requests: Examples include additional export controls, adding more post-level metadata fields, or enhancements tailored for bulk reddit comments mining pipelines.
  • ๐Ÿ“ง Contact: For questions, support, or feedback, reach out at dataforleads@gmail.com.

Your feedback helps shape the roadmap for this reddit comment scraper tool.


Use the Reddit Comments Scraper to collect reddit comments extraction results with structured, thread-aware outputโ€”so you can scale analysis without the manual grind.


Disclaimer

This tool only accesses publicly accessible sources. It does not access private profiles, authenticated data, or password-protected content.

You are responsible for ensuring your use complies with applicable laws (for example, GDPR/CCPA), spam regulations, and the relevant platform terms of service. For data removal requests, contact dataforleads@gmail.com. Always use Reddit Comments Scraper responsibly, ethically, and for legitimate purposes.

You might also like

Reddit Comments Scraper

scrapers-hub/reddit-comments-scraper

Reddit comments scraper to extract comments, replies, and user data from Reddit threads ๐Ÿ’ฌ๐Ÿ“Š Perfect for sentiment analysis, research, and audience insights. Fast, accurate, and scalable data extraction.

Reddit Comments Scraper

khadinakbar/reddit-comments-scraper

Reddit Comments Scraper

scrapecraze/reddit-comments-scraper

๐Ÿ” Reddit Comments Scraper pulls insightful Reddit comment threads fastโ€”clean, structured data for sentiment, trend & community analysis. ๐Ÿง  Great for research, marketing insights, and competitive intelligence. ๐Ÿš€ Easy to run, export-ready results.

Reddit Post Comments Scraper

apiharvest/reddit-post-comments-scraper

Reddit Post Comments Scraper

Reddit Comments Search Scraper

scrapio/reddit-comments-search-scraper

๐Ÿ”Ž Reddit Comments Search Scraper extracts relevant discussions from Reddit comments fast. ๐Ÿ“ฃ Perfect for social listening, brand research, competitor insights, and trend spotting. Save time, get searchable data, and analyze sentiment with ease.

Reddit Comments Scraper

scraper-engine/reddit-comments-scraper

Reddit Comments Scraper extracts comments from Reddit posts with full context. Collect comment text, authors, scores, timestamps, and reply depth for sentiment analysis, research, moderation, and data-driven content insights.

๐Ÿ‘ User avatar

Scraper Engine

11

Reddit User Profile Posts Comments Scraper

solid-scraper/reddit-user-profile-posts-comments-scraper

๐Ÿš€ Scrape Reddit user profiles, comments, and posts with filters for keywords, subreddits, and engagement signals. Perfect for market research, influencer discovery, and community analyticsโ€”get actionable data fast. ๐Ÿ“ˆ

SolidScraper

2

Reddit Comments Scraper Pro

getdataforme/reddit-comments-scraper-pro

Reddit Comments Scraper Pro efficiently extracts comments, authors, and timestamps from Reddit posts....

Reddit User Profile Posts Comments Scraper

scrapecraze/reddit-user-profile-posts-comments-scraper

๐Ÿ”Ž Reddit User Profile Posts & Comments Scraper extracts detailed user posts, comments, and profile insights. ๐Ÿ“ˆ Perfect for market research, community analysis, and competitive intelligenceโ€”fast, accurate, and built for data-driven decisions.