VOOZH about

URL: https://apify.com/futurizerush/google-news-scraper

⇱ Google News Scraper Β· Apify


Pricing

$20.00 / 1,000 results

Go to Apify Store

Google News Scraper

Google News Search Scraper - Real-time news aggregation from Google News. Features smart article enrichment with full content extraction. Perfect for market research, trend analysis, and content monitoring.

Pricing

$20.00 / 1,000 results

Rating

5.0

(1)

Developer

πŸ‘ Rush

Rush

Maintained by Community

Actor stats

4

Bookmarked

106

Total users

8

Monthly active users

a month ago

Last modified

Share

Google News Scraper - Real-time News Data Extraction

Extract news articles from Google News RSS feeds with multi-language support, date filtering, and browser enrichment for actual article URLs and content.

What This Actor Does

Google News Scraper searches Google News RSS feeds by keyword, extracts article metadata, then uses a browser to follow Google News redirect links to resolve actual article URLs and extract content from the source websites.

Key Features

  • Keyword Search - Search for any topic across Google News
  • 40+ Languages - Support for English, Chinese (Simplified/Traditional), Japanese, Korean, Arabic, and more
  • 60+ Regions - Target news from specific countries
  • Date Filtering - Filter by past hour, 24 hours, week, or month
  • Browser Enrichment - Resolves actual article URLs, extracts images and article text
  • Resume on Migration - Saves progress and resumes if the Actor is migrated

Use Cases

  • Market Intelligence - Track industry trends and competitor news
  • Brand Monitoring - Monitor brand mentions across news sources worldwide
  • Content Aggregation - Build news dashboards and content platforms
  • Research - Collect news data for sentiment analysis and academic research
  • Media Monitoring - Track coverage of specific topics or events

Input Configuration

ParameterTypeDefaultDescription
searchQueries (required)array-Keywords to search for (1-50 queries, max 200 chars each)
regionstring"us"Country code for news results (e.g., "us", "uk", "tw", "jp", "de")
languagestring"en"Language code (e.g., "en", "zh-TW", "zh-CN", "ja", "ko", "es")
dateFilterstring"1d"Time period: "1h" (past hour), "1d" (24 hours), "1w" (week), "1m" (month), "" (all time)
maxResultsinteger20Maximum articles per query (10-200)

Output Data

Each article in the dataset contains:

FieldTypeDescription
titlestringArticle headline
googleNewsUrlstringOriginal Google News URL
articleUrlstring or nullActual article URL (resolved via browser enrichment)
sourcestringPublisher name (e.g., "TechCrunch")
websiteNamestring or nullWebsite name from meta tags
websiteUrlstring or nullPublisher's homepage URL
imageUrlstring or nullArticle image URL
pubDatestringPublication date from RSS feed
timestampstringISO 8601 timestamp
descriptionstringArticle summary from RSS
excerptstringExtended description (from enrichment or RSS)
articleContentobject or nullExtracted article text with character and token counts
enrichmentTimenumber or nullTime taken for browser enrichment in milliseconds
guidstringUnique article identifier
searchQuerystringThe keyword that found this article
regionstringRegion used for this search
languagestringLanguage used for this search
scrapedAtstringISO 8601 timestamp when the article was scraped

Note: Fields marked "or null" may be null when browser enrichment cannot access the target website (e.g., paywalled sites, geographic restrictions, or sites that block automated access).

When available, articleContent contains:

  • content β€” Article text (max 2,000 characters)
  • characterCount β€” Total character count of the content
  • tokenCount β€” Token count: each CJK character counts as 1 token, non-CJK text is split by spaces

Example Output

{
"title":"Major AI Breakthrough Announced by Research Team",
"googleNewsUrl":"https://news.google.com/rss/articles/...",
"articleUrl":"https://techcrunch.com/2026/02/27/ai-breakthrough",
"source":"TechCrunch",
"websiteName":"TechCrunch",
"websiteUrl":"https://techcrunch.com",
"imageUrl":"https://techcrunch.com/wp-content/uploads/...",
"pubDate":"Thu, 27 Feb 2026 10:30:00 GMT",
"timestamp":"2026-02-27T10:30:00.000Z",
"description":"Researchers announce breakthrough in artificial intelligence...",
"excerpt":"A team of researchers has announced a significant breakthrough...",
"articleContent":{
"content":"Full article text extracted from the website...",
"characterCount":1850,
"tokenCount":312
},
"enrichmentTime":4520,
"guid":"CBMi...",
"searchQuery":"artificial intelligence",
"region":"us",
"language":"en",
"scrapedAt":"2026-02-27T10:35:00.000Z"
}

Quick Start Examples

Monitor Technology News

{
"searchQueries":["artificial intelligence","ChatGPT","technology"],
"language":"en",
"region":"us",
"dateFilter":"1d",
"maxResults":50
}

Track Business News in Taiwan

{
"searchQueries":["台積電","η§‘ζŠ€ζ₯­","θ‚‘εΈ‚"],
"language":"zh-TW",
"region":"tw",
"dateFilter":"1w",
"maxResults":100
}

Real-time Breaking News

{
"searchQueries":["breaking news"],
"language":"en",
"region":"us",
"dateFilter":"1h",
"maxResults":20
}

Japanese Tech News

{
"searchQueries":["AI","γƒ†γ‚―γƒŽγƒ­γ‚ΈγƒΌ"],
"language":"ja",
"region":"jp",
"dateFilter":"1d",
"maxResults":30
}

How to Use

On Apify Platform

  1. Click Start on this Actor's page
  2. Enter your search keywords in the input form
  3. Select language, region, and time filter
  4. Click Run to start scraping
  5. View and export results in JSON, CSV, or Excel

Via API

const{ ApifyClient }=require('apify-client');
const client =newApifyClient({token:'YOUR_API_TOKEN'});
const run =await client.actor('YOUR_ACTOR_ID').call({
searchQueries:["your keywords"],
language:"en",
region:"us",
dateFilter:"1d",
maxResults:20
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Cost and Performance

  • Pricing: Pay-per-event β€” you are charged per Actor run and per result
  • Speed: RSS extraction is fast; browser enrichment adds ~5-10 seconds per article
  • Memory: Optimized for low memory usage with periodic browser restarts
  • Rate Limiting: Built-in rate limiting with exponential backoff to prevent blocking

Tips to Control Costs

  • Start with fewer maxResults (10-20) to test your keywords
  • Use specific keywords for more relevant results
  • Use dateFilter: "1d" to limit to recent articles
  • Fewer search queries per run = lower cost

FAQ

Why are some article URLs null? Some news websites block automated access, use paywalls, or have geographic restrictions. When the browser cannot follow the Google News redirect, the article URL and enriched content will be null. The basic article data (title, source, description) from the RSS feed is always available.

How do I choose the right language and region? Match the language to your target audience's reading language, and the region to the country whose news you want. For example, use language: "zh-TW" and region: "tw" for Traditional Chinese news from Taiwan.

Can I schedule this Actor to run automatically? Yes. On the Apify platform, you can set up schedules to run this Actor at regular intervals (e.g., every hour for breaking news monitoring).

What is the maximum number of articles I can get? Up to 200 articles per query, with up to 50 queries per run. Google News RSS feeds typically return 20-100 articles depending on the topic and time filter.

Does this Actor use proxies? No. This Actor accesses Google News RSS feeds directly, which are publicly available. No proxy or residential IP is needed.

You might also like

Google News Scraper

crawlerbros/google-news-scraper

Scrape Google News in real-time. Supports keyword search, date filters, full-text article extraction, and image extraction.

140

5.0

Google News Realtime Scraper

devisty/google-news

Provide real-time news and articles sourced from Google News

Google News Scraper

epctex/google-news-scraper

Unlock timely news insights with our Google News data retrieval tool. Get the latest news on any news at any time, and more. Effortless and powerful. πŸ“°πŸ” #NewsData

Google News Scraper

easyapi/google-news-scraper

Powerful Google News scraper, collect up to 5000 news articles with flexible search options, language support. Perfect for news aggregation, market research, and sentiment analysis. πŸ“°πŸ”

1.8K

3.8

News Website Crawler & Article Extractor

xtech/news-source-crawler

Scrape all articles from any news website. Extract full text, metadata, keywords, and summaries. Ideal for content analysis, research, and news aggregation.

Free Google News API β€” Search News by Keyword + Country

s-r/google-news

Free Google News scraper β€” get clean structured news results for any query, country, and language. Use it as a Google News API for brand monitoring, topic alerts, news clipping, and bulk article URL harvesting.

Article Extractor & News Scraper

web.harvester/article-extractor-news-scraper

Extract articles from any news site, blog, or webpage. Get title, full text, author, date, images & metadata using 7 extraction engines (Newspaper4k, Trafilatura, Goose3). Anti-bot bypass, proxy rotation, automatic fallback. Perfect for news monitoring, NLP datasets & content aggregation.

50

5.0

Google News Scraper

lhotanova/google-news-scraper

Gets featured articles from Google News with title, link, source, publication date and image.

πŸ‘ User avatar

KristΓ½na LhoΕ₯anovΓ‘

3.1K

4.6

Ultimate News API

glitch_404/Ultimate-News-Scraper

Scrape up to 10000 news articles from over 4500 news sources in less than 20 minutes, news from over 20 categories, e.g., Crypto news, World News, Latest News, Celebrities, and a lot more. You can find news on websites such as Fox News, BBC News, CNN, and Cryptocurrency-Related News Sources.

255

1.0

Feedly Scraper

mscraper/feedly-scraper

Feedly Scraper is a specialized web scraping tool designed to extract news from Feedly. The scraper exports the accumulated data to various formats like JSON, XML, CSV, or Excel