VOOZH about

URL: https://apify.com/nominated_tupelo/hacker-news-scraper

โ‡ฑ ๐Ÿ“ฐ Hacker News Scraper โ€” Stories, Comments, Users & Search ยท Apify


๐Ÿ‘ Hacker News Scraper: Stories, Comments, Users & Search avatar

Hacker News Scraper: Stories, Comments, Users & Search

Pricing

Pay per usage

Go to Apify Store

Hacker News Scraper: Stories, Comments, Users & Search

Scrape Hacker News stories, comments, user profiles, and search by keyword using the official HN Firebase API and Algolia search API. No auth required.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

๐Ÿ‘ kade

kade

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

10 days ago

Last modified

Share

What does Hacker News Scraper do?

Hacker News Scraper extracts stories, comments, user profiles, and full-text search results from Hacker News โ€” the legendary tech community run by Y Combinator. It uses the official Firebase API (no rate limits) and the Algolia search API (full archive search) to pull clean, structured JSON data. No API key required. No browser. No proxy needed.

Use it to monitor trending discussions, research historical topics, track user activity, analyze YC startup trends, or build datasets for LLM fine-tuning and sentiment analysis.

Why use Hacker News Scraper?

  • Research: Find every HN discussion about a technology, company, or topic across all time
  • Monitoring: Track daily top/new/best stories and alert on keywords
  • Data science: Build training datasets from high-quality technical discussion
  • Competitive intel: Monitor what the dev community says about your product or competitors
  • Hiring & HR: Find Ask HN job threads and talent signals
  • Podcast & newsletter: Auto-curate top HN content for weekly digests

How to use Hacker News Scraper

  1. Go to the Input tab and select a Scrape Mode
  2. For Top/New/Best/Ask/Show/Jobs: set Max Items and optional Minimum Score filter
  3. For Keyword Search: enter a query, choose sort (relevance or date), and optionally restrict to stories or comments
  4. For Thread: paste a story ID or HN URL to get the story + all comments
  5. For User: enter a username to get their profile and submissions
  6. Click Start and find your results in the Output tab as clean JSON

Input

FieldTypeDescription
scrapeModeenumtopStories, newStories, bestStories, askStories, showStories, jobStories, search, thread, user
searchQuerystringKeywords to search (Algolia full-text). Used with search mode
searchSortByenumrelevance or date (newest first)
searchTypeenumstory, comment, or all
storyIdstringStory ID or HN URL for thread mode
usernamestringHN username for user mode
maxItemsintegerMax items to return (0 = no limit, default: 100)
includeCommentsbooleanFetch full comment trees for each story in feed/search modes
maxCommentsPerStoryintegerMax comments per story (default: 50)
minScoreintegerFilter stories below this score (default: 0 = no filter)

Output

Each scraped item is a JSON object. You can download the dataset in JSON, CSV, HTML, or Excel format.

Story example

{
"type":"story",
"id":40123456,
"title":"Show HN: I built a tool that does X",
"by":"username",
"score":342,
"descendants":87,
"url":"https://example.com/article",
"text":null,
"createdAt":"2026-06-01T14:30:00+00:00",
"hnUrl":"https://news.ycombinator.com/item?id=40123456",
"commentIds":[40123457,40123458]
}

Comment example

{
"type":"comment",
"id":40123457,
"parentId":40123456,
"storyId":40123456,
"by":"commenter",
"text":"<p>This is really interesting because...</p>",
"depth":0,
"createdAt":"2026-06-01T14:45:00+00:00",
"hnUrl":"https://news.ycombinator.com/item?id=40123457"
}

User example

{
"type":"user",
"id":"pg",
"karma":184923,
"about":"<p>Co-founder of Y Combinator.</p>",
"createdAt":"2006-10-09T18:00:00+00:00",
"submittedCount":1245,
"hnUrl":"https://news.ycombinator.com/user?id=pg"
}

Data fields

FieldDescription
typestory, comment, user, or job
idHN item/user ID
titleStory headline
byAuthor username
scoreStory points (upvotes)
descendantsTotal comment count
urlExternal article URL
textSelf-post body or comment text (HTML)
createdAtISO 8601 timestamp
hnUrlDirect HN link
commentIdsIDs of top-level replies
depthComment nesting depth
karmaUser karma score (user items only)

Pricing

This actor uses the Pay Per Event model. You are charged per item scraped:

  • ~500 stories from top feed: ~$0.05
  • 1,000 search results: ~$0.05
  • Full thread with 200 comments: ~$0.02

Hacker News uses an open public API with no rate limits, so runs complete quickly and cheaply.

Tips

  • Monitoring: Use topStories with minScore: 50 to get only high-signal stories
  • Research: search mode with searchSortBy: date gives you chronological archives
  • Comment depth: includeComments: false with feed modes keeps costs minimal while capturing discussion
  • Thread analysis: thread mode gives the full discussion tree including nested replies

FAQ & disclaimers

Is this legal? Yes. This actor uses the official public Hacker News Firebase API provided by Y Combinator/Firebase and the public Algolia search API. No ToS violation occurs.

Can I get all HN data ever? HN has over 40 million items. The Algolia search API covers the full archive. For bulk exports use search mode with broad queries.

Something broken? Open an issue on the actor's Issues tab.

You might also like

Hacker News Stories, Comments & Users Scraper

crawlerbros/hacker-news-scraper

Scrape Hacker News - search stories and comments, fetch top/new/best stories, get user profiles and submission history. Uses the official Algolia HN Search API and Hacker News Firebase API.

Hacker News Scraper

moving_beacon-owner1/my-actor-76

A production-ready Apify Actor that scrapes Hacker News stories, comments, user profiles, and search results using the official Firebase API and Algolia HN Search API.

2

Hacker News Scraper

rupom888/hackernews-scraper

Scrape stories, jobs, comments, and polls from Hacker News using the official HN Firebase API. Get top/new/best/ask/show stories with comments, search by keyword via Algolia HN Search API. Reliable and no rate limiting.

๐ŸŸง Hacker News Scraper โ€” Stories, Comments & Search by Keyword

iskoren/hacker-news-scraper

Search and scrape Hacker News stories, comments, and polls by keyword โ€” points, authors, comment counts, dates, and links. Powered by the official HN API.

Hacker News Enhanced Scraper - Stories, Comments & Search

hata1234/hn-scraper

Scrape Hacker News stories, comments, and search results via official Firebase and Algolia APIs. No proxy needed. Supports top, best, new, Ask HN, Show HN, job stories, full-text search, comment extraction, and advanced filtering by points, date, and domain.

Hacker News Scraper โ€” Stories, Jobs, Comments & Users API

bovi/hacker-news-scraper

Scrape Hacker News stories, comments, jobs, and user profiles via the official Firebase and Algolia APIs. No proxy, no auth. Supports top/new/best/ask/show/job feeds, full-text search, comment trees, and user data. Pay per result.

๐Ÿ‘ User avatar

Vitalii Bondarev

1

Hacker News Scraper: Stories, Comments, Users & Search

perconey/hackernews-scraper

Scrape Hacker News via the official Firebase API + Algolia search. Top/new/best/ask/show/jobs stories, full comment trees, user profiles with karma, free-text search. No browser, no proxies, no auth. Pay only per result item.

Hacker News Scraper

cloud9_ai/hackernews-scraper

Scrape Hacker News stories, comments, and user profiles via official Firebase API. Get top, new, best, ask, show stories with scores, comments, and author data.