Hacker News Scraper - Stories, Comments & Search
Pricing
from $0.30 / 1,000 results
Hacker News Scraper - Stories, Comments & Search
Scrape Hacker News stories, comments, polls, jobs, and Ask/Show HN posts. Search by keyword and date range, pull a user's full activity, or fetch any HN list (front page, new, best, ask, show, job). No auth required. Built for AI/ML datasets, tech trend research, and news monitoring.
Pricing
from $0.30 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
a month ago
Last modified
Share
๐ Hacker News Scraper
Scrape Hacker News stories, comments, polls, jobs, and Ask/Show HN posts. Search by keyword + date range, pull full HN lists (front page, new, best, ask, show, jobs), or fetch any user's complete activity.
โก Uses HN's official Algolia + Firebase APIs. No login, no rate-limit nightmares, no scraping HTML.
โจ What you can do
- ๐ Keyword search with date range, tag filter, min-points threshold, and sort by relevance/date/points
- ๐ Pull any HN list โ top, new, best, Ask HN, Show HN, jobs
- ๐ค Get any user's full activity โ stories AND comments, by username
- ๐ฌ Optionally fetch full comment trees โ flattened with parent IDs (great for AI training)
๐ Quick start
- Click Try for free
- Pick mode:
search,list, oruser - Enter targets
- Click Start
๐ฅ Input examples
Search for "claude" stories from the last month with 50+ points
{"mode":"search","searchQueries":["claude"],"tags":"story","sortBy":"points","since":"2026-04-01","minPoints":50,"maxItems":500,"includeComments":true}
Pull current front page
{"mode":"list","listType":"topstories","maxItems":30,"includeComments":true,"maxCommentsPerStory":50}
Get pg's recent activity
{"mode":"user","users":["pg","dang"],"maxItems":100}
๐ค Output (per item)
{"type":"story","id":12345678,"title":"Show HN: A new way to scrape data","text":null,"author":"username","points":234,"numComments":87,"createdAt":"2026-04-15T12:34:56.000Z","url":"https://example.com/article","hnUrl":"https://news.ycombinator.com/item?id=12345678","tags":["story","front_page"],"comments":[{"type":"comment","id":12345679,"text":"Great post!","author":"commenter","points":12,"createdAt":"2026-04-15T13:00:00.000Z","parentId":12345678}]}
๐ฏ Use cases
| Who | Why |
|---|---|
| ๐ค AI / LLM teams | High-signal tech-discussion training data, filterable by topic and quality (points threshold) |
| ๐ฐ Tech journalists | Track what's trending in dev/startup community |
| ๐งโ๐ป Engineers | Dataset of "Show HN" launches, open-source releases, new tools |
| ๐ VCs / scouts | Monitor early-stage signals across founders posting on HN |
| ๐ SEO researchers | Track tech keywords surfacing in HN search trends |
โ๏ธ Tech notes
- Search uses HN's Algolia API โ fast, fielded, supports complex queries
- Lists & item details use HN's Firebase API โ official, real-time
- No auth required, no API key needed, no rate-limit (within reason)
- Comments are fetched recursively from Firebase for accuracy and full trees
โ FAQ
Will I get rate-limited? HN is generous โ the Algolia API is unmetered for reasonable use. Comment-tree scraping at large scale is slower because each item is a separate Firebase call.
Are deleted/dead items included? No โ they're filtered automatically.
Can I scrape historical data?
Yes. Use since and until in search mode to pull any historical date range. HN's archive goes back to 2007.
Schedule it? Yes โ set up an Apify Schedule for daily/hourly trend tracking.
