VOOZH about

URL: https://apify.com/oneary/news-scraper

โ‡ฑ News Scraper โ€” Monitor News Articles & Headlines ยท Apify


๐Ÿ‘ News Scraper โ€” Monitor News Articles & Headlines avatar

News Scraper โ€” Monitor News Articles & Headlines

Pricing

$12.00 / 1,000 results

Go to Apify Store

News Scraper โ€” Monitor News Articles & Headlines

Scrape news articles from major publishers. Extract headlines, full content, authors, publish dates, and images for media monitoring.

Pricing

$12.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Luan M.

Luan M.

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

News & Monitoring Scraper โ€” Apify Actor

Scrapes news articles from specified sources using Playwright (headless browser) and Crawlee, with content extraction via Mozilla Readability. Supports keyword filtering, deduplication, date range filtering, and optional full-text extraction.

Features

  • Playwright-powered โ€” handles JavaScript-rendered pages
  • Smart extraction โ€” article title, author, date, category, content summary, and optional full text
  • Keyword filtering โ€” only keep articles matching one or more keywords
  • Date range filtering โ€” restrict by dateFrom / dateTo
  • Deduplication โ€” URL-based dedup within a single run
  • Keyword auto-extraction โ€” extracts relevant keywords from article content
  • Proxy support โ€” integrates with Apify proxy for reliable scraping
  • Multiple output formats โ€” JSON, CSV, or JSON array

Input

FieldTypeRequiredDefaultDescription
startUrlsarrayโœ…โ€”One or more news article / index URLs to scrape
maxArticlesintegerโŒ100Max articles to scrape (0 = unlimited)
proxyConfigurationobjectโŒApify proxy onProxy settings
keywordsarrayโŒ[]Case-insensitive keyword whitelist (empty = keep all)
sourcesarrayโŒ[]Descriptive source names (defaults to domain)
dateFromstringโŒ""ISO date filter (e.g. "2025-01-01")
dateTostringโŒ""ISO date filter (e.g. "2025-12-31")
includeFullTextbooleanโŒfalseExtract full article text via Readability
outputFormatstringโŒ"json"json, csv, or jsonArray

Output

Each dataset item contains:

FieldTypeDescription
titlestringArticle title
urlstringSource URL
publishedDatestring (ISO) | nullDetected publish date
authorstring | nullDetected author
contentSummarystringFirst 500 chars or meta description
fullTextstring | nullFull article text (if includeFullText enabled)
sourcestringSource name or domain
categorystring | nullDetected category/section
keywordsarrayAuto-extracted keywords
scrapedAtstring (ISO)Timestamp of scrape

Usage

Local development

npminstall
npm start

Environment variables

  • APIFY_TOKEN โ€” Apify API token (for proxy & dataset access)

Technical stack

  • Node.js (ESM)
  • Crawlee โ€” crawling & request management
  • Playwright โ€” headless browser
  • @mozilla/readability โ€” article extraction
  • jsdom โ€” DOM parsing for metadata
  • Apify SDK โ€” proxy, dataset, platform integration

License

Apache-2.0

You might also like

Google News Scraper โ€” Headlines, Articles & News Data

oneary/google-news-scraper

Extract the latest Google News articles by keyword. Get headlines, publishers, snippets, publish dates, and article URLs. Perfect for media monitoring, news aggregation, and trend tracking.

Google News Scraper

piotrv1001/google-news-scraper

Scrapes news articles from Google News, extracting titles, sources, publication dates, and links. Search by keywords, browse by topic, or get top headlines with multi-language and region support. Ideal for news monitoring, media analysis, and content aggregation.

Bloomberg Category News Scraper

piotrv1001/bloomberg-category-news-scraper

The Bloomberg Category News Scraper extracts news articles from Bloomberg by category, capturing headlines, authors, publish dates, images, and article links. Ideal for news aggregation, market analysis, and trend monitoring.

65

5.0

Google News Scraper

parseforge/google-news-scraper

Monitor the news automatically with our Google News scraper. Track articles by keyword or topic with flexible date filtering and multi language support. Access structured data including headlines, publishers, links, and more. Built for teams that need reliable news insights without manual work.

Google News Scraper - Articles by Keyword & Topic

fascinating_lentil/google-news-scraper

Scrape Google News articles by search keyword, topic, or top headlines. Extract titles, sources, links, publish dates, and snippets. No login or API key needed.

๐Ÿ‘ User avatar

Md Jakaria Mirza

2

Google News Scraper

scrapebase/google-news-scraper

Stay on top of breaking stories with this Google News scraper ๐Ÿ“ฐโšก Extract headlines, sources, publish dates, snippets, links, and more from Google News results. Perfect for trend tracking, media monitoring, research, and content planning. Get fresh news data fast ๐Ÿš€