👁 Hacker News Scraper & API - Export Stories, Comments, Data avatar

Hacker News Scraper & API - Export Stories, Comments, Data

Pricing

from $0.50005 / actor start

👁 Hacker News Scraper & API - Export Stories, Comments, Data

Hacker News Scraper & API - Export Stories, Comments, Data

Extract top stories, trending posts, points, comments & authors from Hacker News front page. Real-time data export to JSON/CSV. Monitor tech trends, analyze viral content, track HN activity. Fast Playwright scraper.

Pricing

from $0.50005 / actor start

Rating

0.0

(0)

Developer

👁 Brennan Crawford

Brennan Crawford

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Hacker News Scraper for Apify

A production-ready Apify actor that scrapes stories from Hacker News front page using Playwright.

🚀 Features

Scrapes Hacker News front page stories
Extracts comprehensive story data:
- Title and URL
- Points (upvotes)
- Author username
- Number of comments
- Time posted
- Story rank
- Hacker News discussion URL
Configurable number of stories to scrape
Option to include/exclude job posts
Built with Playwright for reliable scraping
Production-ready for Apify platform

📁 Project Structure

hackernews-scraper/
├── .actor/
│ ├── actor.json # Actor metadata and configuration
│ └── dataset_schema.json # Output data schema
├── apify_actor.py # Main actor entry point
├── hackernews_scraper.py # Core scraper implementation
├── Dockerfile # Docker configuration for Apify
├── requirements.txt # Python dependencies
├── INPUT_SCHEMA.json # Input configuration schema
└── README.md # This file

🔧 Local Testing

Prerequisites

Python 3.11+
pip

Installation

Install dependencies:

$pip install-r requirements.txt

Install Playwright browsers:

$playwright install chromium

Test the scraper locally:

$python hackernews_scraper.py

🌐 Deploy to Apify

Prerequisites

Create an Apify account
Install Apify CLI: npm install -g apify-cli
Login: apify login

Deployment Steps

Navigate to project directory:

$cd hackernews-scraper

Deploy to Apify:

$apify push

Access your actor at Apify Console

Running on Apify

Navigate to your actor in the Apify Console
Click "Run"
Configure input options (optional)
Click "Start" to run the actor
View results in the "Dataset" tab

⚙️ Input Configuration

Field	Type	Default	Description
`maxStories`	integer	30	Maximum number of stories to scrape (1-100)
`includeJobPosts`	boolean	false	Include "Who is hiring?" job posts

Example Input

{
"maxStories":30,
"includeJobPosts":false
}

📊 Output Format

Each story is returned as a JSON object with the following structure:

{
"rank":1,
"title":"Show HN: I built a tool for...",
"url":"https://example.com/article",
"points":342,
"author":"username",
"comments":127,
"timeAgo":"2024-01-15T10:30:00.000Z",
"hackerNewsUrl":"https://news.ycombinator.com/item?id=12345678"
}

Output Fields

Field	Type	Description
`rank`	number	Story position on front page
`title`	string	Story title
`url`	string	Link to the story/article
`points`	number	Number of upvotes
`author`	string	Username who posted the story
`comments`	number	Number of comments
`timeAgo`	string	Timestamp when story was posted
`hackerNewsUrl`	string	URL to Hacker News discussion

🛠️ Built With

Python 3.11 - Programming language
Playwright - Browser automation
Apify SDK - Actor framework
Following Apify best practices and patterns

📝 Use Cases

Monitor trending tech stories
Track specific topics on HN
Build custom HN readers/aggregators
Research what content performs well
Create HN analytics dashboards

🔒 Rate Limiting

The scraper is designed to be respectful of Hacker News:

Single page load per run
No aggressive pagination
Configurable limits on stories scraped

📄 License

This actor is provided as-is for use on the Apify platform.

🤝 Support

For issues or questions:

Check the Apify documentation
Open an issue in the repository
Contact via Apify platform

Ready to deploy in under 10 minutes! 🎉

👁 Hacker News Api Scraper avatar

Hacker News Api Scraper

fresh_cliff/hacker-news-api-scraper

Extract Hacker News top stories, comments, points & authors. No API keys. Real-time JSON/CSV export. Monitor tech trends, analyze viral content, track HN activity. Fast requests-based scraper with alternative frontend fallback.

👁 User avatar

Brennan Crawford

Hacker News Scraper

muscular_quadruplet/hackernews-scraper

Scrape Hacker News stories, comments, and user profiles. Extract top stories, new posts, Show HN, Ask HN. Monitor tech trends, track discussions, build news aggregators. Real-time tech news scraping.

👁 User avatar

Do It

👁 HN Top Stories Scraper avatar

HN Top Stories Scraper

cryptosignals/hn-top-stories

Scrape Hacker News top stories — extract title, URL, score, author, comment count, and submission time. Monitor HN front page in real time. CSV/JSON.

👁 User avatar

Web Data Labs

👁 Hacker News Scraper - Stories & Comments avatar

Hacker News Scraper - Stories & Comments

pear_fight/hackernews-scraper

Scrape Hacker News stories, comments & user profiles. Extract titles, URLs, scores, comment counts, timestamps, full comment threads. Monitor trending tech topics in real time. Pay per result. Export JSON/CSV.

👁 User avatar

Harald

Hacker News Scraper — Stories, Comments & Users

openclawmara/hacker-news-scraper

Scrape Hacker News stories, comments, and user profiles. Extract trending tech news, top stories by score, new submissions, Ask HN, Show HN, and job posts. Filter by date, score, and comment count. Perfect for tech trend analysis, competitive intelligence, and content curation.

👁 User avatar

OpenClaw Mara

👁 Hacker News Stories, Comments & Users Scraper avatar

Hacker News Stories, Comments & Users Scraper

crawlerbros/hacker-news-scraper

Scrape Hacker News - search stories and comments, fetch top/new/best stories, get user profiles and submission history. Uses the official Algolia HN Search API and Hacker News Firebase API.

👁 User avatar

Crawler Bros

Hacker News MCP Server

nyxar_dev/hackernews-mcp

Read top stories, comments, and search Hacker News via MCP. Get real-time tech news, discussions, and trending topics from the HN community.

👁 User avatar

Nyxar Dev

Hacker News Search Scraper

sthiven_r/hacker-news-search-scraper

Search Hacker News by keyword and get stories (title, URL, points, comments, author, date). For tech monitoring & research.

👁 User avatar

Wilker Sthiven Rangel Manrique

Hacker News Scraper - Stories, Comments & Search

wetyr_corporation/hacker-news-scraper

Bulk extract Hacker News stories, comments, jobs, and Show HN. Search by keyword, filter by points and date. Powered by free Algolia HN API for unlimited reliability.

👁 User avatar

WETYR

👁 Hacker News Scraper — Stories, Comments & Jobs avatar

Hacker News Scraper — Stories, Comments & Jobs

cryptosignals/hackernews-scraper

Scrape Hacker News stories, comments, and user profiles — extract title, URL, score, author, comment threads, and submission time. CSV/JSON output.

👁 User avatar

Web Data Labs

URL: https://apify.com/fresh_cliff/hackernews-scraper

⇱ Hacker News Scraper & API – Export Stories & Comments · Apify

Hacker News Scraper & API - Export Stories, Comments, Data

Hacker News Scraper for Apify

🚀 Features

📁 Project Structure

🔧 Local Testing

Prerequisites

Installation

🌐 Deploy to Apify

Prerequisites

Deployment Steps

Running on Apify

⚙️ Input Configuration

Example Input

📊 Output Format

Output Fields

🛠️ Built With

📝 Use Cases

🔒 Rate Limiting

📄 License

🤝 Support

You might also like

Hacker News Api Scraper

Hacker News Scraper

HN Top Stories Scraper

Hacker News Scraper - Stories & Comments

Hacker News Scraper — Stories, Comments & Users

Hacker News Stories, Comments & Users Scraper

Hacker News MCP Server

Hacker News Search Scraper

Hacker News Scraper - Stories, Comments & Search

Hacker News Scraper — Stories, Comments & Jobs