VOOZH about

URL: https://apify.com/iskoren/hacker-news-scraper

โ‡ฑ Hacker News Scraper โ€” Stories & Comments by Keyword ยท Apify


๐Ÿ‘ ๐ŸŸง Hacker News Scraper โ€” Stories, Comments & Search by Keyword avatar

๐ŸŸง Hacker News Scraper โ€” Stories, Comments & Search by Keyword

Pricing

from $0.01 / 1,000 results

Go to Apify Store

๐ŸŸง Hacker News Scraper โ€” Stories, Comments & Search by Keyword

Search and scrape Hacker News stories, comments, and polls by keyword โ€” points, authors, comment counts, dates, and links. Powered by the official HN API.

Pricing

from $0.01 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Is Koren

Is Koren

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Scrape Hacker News at scale with a fast, reliable Hacker News scraper built on the official Algolia HN Search API. Search Hacker News by keyword to pull matching stories, comments, and polls, or grab the current front page and newest items โ€” no API key, no login, and no anti-bot headaches. Every result is emitted as one clean, structured JSON record ready for analysis, dashboards, alerting, or AI pipelines.

Whether you are tracking what Hacker News says about your product, monitoring keywords like "artificial intelligence", or building a dataset of top stories, this Hacker News scraper gets you there in seconds.

โœจ Features

  • ๐Ÿ”Ž Keyword search across Hacker News stories, comments, and polls.
  • ๐Ÿ“ฐ Front page mode โ€” scrape the current HN front page without a query.
  • ๐Ÿ† Sort by relevance or date (newest first).
  • ๐ŸŽฏ Numeric filters โ€” only keep items above a minimum points or comment count.
  • ๐Ÿ“„ Automatic pagination up to the Algolia ~1000-result cap.
  • ๐Ÿงฑ Flat, structured output โ€” one record per result, ready for CSV/JSON/Excel export.
  • ๐Ÿ›ก๏ธ No anti-bot issues โ€” uses the public Algolia HN API, so runs are cheap and stable.

๐Ÿš€ Quick start

Paste this input to scrape the top 10 stories about artificial intelligence:

{
"query":"artificial intelligence",
"contentType":"story",
"sortBy":"relevance",
"maxItems":10
}

Scrape the current front page (no keyword needed):

{
"query":"",
"contentType":"front_page",
"sortBy":"relevance",
"maxItems":30
}

Find the newest highly-upvoted discussions about a topic:

{
"query":"rust programming",
"contentType":"story",
"sortBy":"date",
"minPoints":50,
"maxItems":100
}

โš™๏ธ Input

FieldTypeDefaultDescription
querystring"artificial intelligence"Keyword or phrase to search for. Leave empty to fetch the latest items / front page.
contentTypeselectstoryWhat to scrape: story, comment, poll, or front_page.
sortByselectrelevancerelevance (best match) or date (newest first).
maxItemsinteger50Maximum total results (1โ€“1000; Algolia caps near 1000).
minPointsintegerโ€”Only keep items with at least this many points.
minCommentsintegerโ€”Only keep items with at least this many comments.
proxyConfigurationproxy{ "useApifyProxy": true }Proxy settings. Datacenter proxies work fine here.

๐Ÿ“ค Output

Each result is pushed as one record to the dataset. Example story record:

{
"query":"artificial intelligence",
"objectID":"39038064",
"title":"The rise of artificial intelligence agents",
"url":"https://example.com/ai-agents",
"author":"pg",
"points":412,
"numComments":187,
"createdAt":"2026-01-12T09:33:00.000Z",
"createdAtTimestamp":1768210380,
"hnUrl":"https://news.ycombinator.com/item?id=39038064",
"storyText":null,
"tags":["story","author_pg","story_39038064"]
}

Comment records additionally include commentText, storyId, and parentId.

FieldDescription
queryThe search query used for the run.
objectIDUnique Hacker News item ID.
titleStory/poll title (null for comments).
urlExternal link (null for Ask/Show HN and text posts).
authorHacker News username of the author.
pointsScore / upvotes.
numCommentsNumber of comments on the item.
createdAtISO 8601 creation timestamp.
createdAtTimestampUnix creation timestamp.
hnUrlCanonical Hacker News discussion URL.
storyTextHTML-stripped self/Ask HN text (if any).
tagsAlgolia _tags array.
commentText(Comments only) HTML-stripped comment body.
storyId(Comments only) ID of the parent story.
parentId(Comments only) ID of the direct parent item.

โ“ FAQ

Do I need a Hacker News API key? No. This Hacker News scraper uses the free, public Algolia HN Search API โ€” no key or login.

How many results can I get? The Algolia HN API caps results at roughly 1000 per query. Set maxItems accordingly.

Why is url sometimes null? Ask HN, Show HN, and text posts have no external link, so url is null. Use hnUrl for the discussion page and storyText for the body.

Can I scrape only comments? Yes โ€” set contentType to comment. Records will include commentText, storyId, and parentId.

Will I get rate-limited or blocked? The Algolia HN API is very tolerant and has no anti-bot protection, so datacenter proxies are fine.

๐Ÿ’ก Tips

  • Use sortBy: "date" with minPoints to build a feed of fresh, already-popular discussions.
  • Combine query with contentType: "comment" to mine sentiment and opinions on a topic.
  • Leave query empty and set contentType: "front_page" to snapshot the HN front page on a schedule.
  • Schedule this actor to run hourly to monitor a keyword and feed results into Slack or a webhook.

You might also like

๐Ÿ”ด Reddit Scraper โ€” Posts, Comments & Data

nexgendata/reddit-scraper

Extract posts, comments & subreddit data from Reddit. Monitor brand mentions, track discussions & build social listening tools. Get upvotes, awards & comment threads. Pay per post.

Hacker News Scraper โ€” Stories, Comments & Jobs

cryptosignals/hackernews-scraper

Scrape Hacker News stories, comments, and user profiles โ€” extract title, URL, score, author, comment threads, and submission time. CSV/JSON output.

6

Hacker News Enhanced Scraper - Stories, Comments & Search

hata1234/hn-scraper

Scrape Hacker News stories, comments, and search results via official Firebase and Algolia APIs. No proxy needed. Supports top, best, new, Ask HN, Show HN, job stories, full-text search, comment extraction, and advanced filtering by points, date, and domain.

Hacker News Scraper

rupom888/hackernews-scraper

Scrape stories, jobs, comments, and polls from Hacker News using the official HN Firebase API. Get top/new/best/ask/show stories with comments, search by keyword via Algolia HN Search API. Reliable and no rate limiting.

HackerNews Insights Scraper โ€” Stories, Comments & Velocity

brilliant_gum/hackernews-insights-scraper

Hacker News stories, full comment trees, user karma and contact info, story velocity tracking, history deltas. Search all 3.7M stories with filters for points, karma, domain, dates, keywords. For VCs hunting Show HN, recruiters mining talent, journalists tracking tech, and AI/RAG pipelines.

๐Ÿ‘ User avatar

Yuliia Kulakova

2

Hacker News Scraper

gentle_cloud/hacker-news-scraper

Scrape Hacker News stories, comments, and user data. Supports top/new/best/ask/show/job story feeds and full-text keyword search via the Algolia API. Extract titles, URLs, scores, authors, comment counts, and timestamps.

59

Hacker News Intelligence Scraper

fascinating_lentil/hacker-news-intelligence-scraper

Scrape Hacker News stories, comments, jobs, Ask HN, Show HN, and keyword search results. Export clean JSON or CSV with scores, authors, URLs, dates, filters, and nested discussions. No login or API key required.

๐Ÿ‘ User avatar

Md Jakaria Mirza

2

Hacker News Scraper

nogards95/hacker-news-scraper

Scrape Hacker News stories, comments, jobs, Ask HN, and Show HN using Algolia Search API and HN Firebase API. Supports full-text search, date/points filters, and live feeds.

2

5.0

(1)