VOOZH about

URL: https://apify.com/khadinakbar/google-news-scraper

⇱ Google News Scraper Β· Apify


πŸ‘ Google News Scraper β€” Articles & Sources avatar

Google News Scraper β€” Articles & Sources

Pricing

from $3.00 / 1,000 article scrapeds

Go to Apify Store

Google News Scraper β€” Articles & Sources

Extract Google News articles by keyword, topic, or region. Returns title, source, snippet, link, published date, and image. No login or API key. MCP-ready.

Pricing

from $3.00 / 1,000 article scrapeds

Rating

0.0

(0)

Developer

πŸ‘ Khadin Akbar

Khadin Akbar

Maintained by Community

Actor stats

0

Bookmarked

23

Total users

9

Monthly active users

9 days ago

Last modified

Share

Extract structured news article data from Google News by keyword, topic, or URL β€” no login, API key, cookies, or browser required. Built for developers, data teams, AI/LLM pipelines, and automated news monitoring workflows.

Runs on Apify β€” schedule it, call it via API, chain it with other actors, or integrate it directly into your MCP-compatible AI agent.


What It Does

This actor fetches Google News RSS feeds, parses them into clean structured records, and optionally visits article pages to extract full body text. Each run produces a consistent JSON dataset you can export to CSV, Excel, JSON, or push to any downstream system.

Output fields per article: title, source_name, source_url, google_news_url, published_at, description, image_url, search_query, topic, full_text, word_count, scraped_at


Key Features

No login or API key required β€” works out of the box with no authentication setup.

Bulk keyword search β€” pass multiple search queries in one run. Each query gets its own RSS feed and result set. Run 50 queries in a single call.

Built-in topic sections β€” monitor World, Technology, Business, Science, Health, Sports, Entertainment, or Nation headlines with a single click.

50+ region/language editions β€” scrape Google News in any edition: US English, German, French, Japanese, Arabic, Spanish, and 45+ more. Every language correctly sets hl, gl, and ceid parameters.

Google search operators supported β€” use quotes for exact phrases, minus to exclude terms, OR for alternatives, and site: to filter by publisher:

OperatorExampleEffect
"exact phrase""interest rate hike"Match exact wording
-keywordapple -fruitExclude term
ORAI OR "machine learning"Either term
site:site:reuters.com earningsPublisher filter

Time range filtering β€” filter by past hour, 24 hours, 7 days, 30 days, or 1 year.

Full article text extraction β€” optionally visit each article URL and extract the full body text. Produces full_text and word_count fields. Designed for AI/RAG pipelines, sentiment analysis, and NLP workloads.

Deduplication β€” across multiple queries and topics, duplicate articles are removed automatically.

Custom topic URLs β€” paste any Google News section URL to extract niche topic feeds not covered by built-in sections.

Structured output schema β€” fully defined dataset schema with typed fields, descriptions, and examples. MCP-compatible: field names and descriptions are optimized for LLM tool-use routing.


Input Options

FieldDescription
searchQueriesArray of keywords/phrases. Supports Google operators.
topicsBuilt-in sections: WORLD, TECHNOLOGY, BUSINESS, etc.
topicUrlsCustom Google News section URLs (HTML or RSS format).
startUrlsRaw RSS feed URLs for advanced use.
maxResultsPerQuery1–100 articles per query/topic (default: 100).
regionLanguageEdition code like US:en, DE:de, JP:ja (50+ options).
timeRange1h, 1d, 7d, 30d, 1y, or any.
extractFullTextVisit article pages and extract body text.
decodeUrlsAttempt to resolve real article URLs from Google redirects.
deduplicateResultsRemove duplicate articles across queries (default: true).

Output Schema

Each article record has these fields:

FieldTypeDescription
titlestringArticle headline, cleaned (source name suffix removed)
source_namestringPublisher name (e.g. "Reuters", "BBC News")
source_urlstring|nullReal article URL (when resolvable)
google_news_urlstringGoogle News redirect URL (always present)
published_atstringISO 8601 publication datetime
descriptionstring|nullArticle snippet/summary
image_urlstring|nullArticle thumbnail image URL
search_querystring|nullThe query that found this article
topicstring|nullBuilt-in topic section (if applicable)
full_textstring|nullFull article body text (requires extractFullText)
word_countinteger|nullWord count of full text (requires extractFullText)
scraped_atstringISO 8601 extraction timestamp

Usage Examples

Monitor a brand or topic daily

{
"searchQueries":["OpenAI","Anthropic","Google DeepMind"],
"maxResultsPerQuery":20,
"timeRange":"1d",
"deduplicateResults":true
}

Collect top headlines across sections

{
"topics":["TECHNOLOGY","BUSINESS","SCIENCE","HEALTH"],
"maxResultsPerQuery":10
}

AI/RAG pipeline β€” full text extraction

{
"searchQueries":["large language models","AI regulation EU"],
"maxResultsPerQuery":15,
"extractFullText":true,
"timeRange":"7d"
}

Monitor non-English news

{
"searchQueries":["KΓΌnstliche Intelligenz","Bundesliga"],
"regionLanguage":"DE:de",
"maxResultsPerQuery":25
}

Bulk competitive intelligence

{
"searchQueries":[
"Tesla earnings","Ford EV","GM electric",
"Rivian news","Lucid Motors","NIO stock",
"BYD sales","Volkswagen EV","BMW electric","Mercedes EV"
],
"maxResultsPerQuery":10,
"timeRange":"7d",
"deduplicateResults":true
}

Performance & Cost

ModeArticles/minCost estimate
Metadata only~200$0.003 per article
With full text~30–60$0.003 + $0.005 per article

Example costs:

  • 100 articles (metadata): $0.30
  • 100 articles (with full text): $0.80
  • 1,000 articles (metadata only): $3.00
  • Daily monitoring, 10 queries Γ— 20 articles: $0.60/day

Google News RSS returns up to 100 articles per feed. A single run with 10 search queries can collect up to 1,000 articles.


API & MCP Integration

Call via Apify API

curl-X POST "https://api.apify.com/v2/acts/khadinakbar~google-news-scraper/runs?token=YOUR_TOKEN"\
-H"Content-Type: application/json"\
-d'{"searchQueries": ["artificial intelligence"], "maxResultsPerQuery": 10}'

Use in an AI agent (MCP)

This actor is optimized for use via the Apify MCP server. When an LLM agent calls call-actor with khadinakbar/google-news-scraper, the structured input schema and output schema enable precise tool-use routing without hallucination.

Typical LLM agent prompt:

"Get the latest 10 news articles about AI regulation from the past week using Google News"

Agent call:

{
"searchQueries":["AI regulation"],
"maxResultsPerQuery":10,
"timeRange":"7d"
}

Comparison vs. Competitors

FeatureThis Actordata_xplorereasyapiscrapestorm
Output schema (typed)βœ…βŒβŒβŒ
MCP-optimized fieldsβœ…βŒβŒβŒ
Bulk keywords (1 run)βœ…βœ…βŒβœ…
Custom topic URLsβœ…βŒβŒβŒ
Full text extractionβœ…βŒβŒβŒ
Deduplicationβœ…βŒβŒβŒ
50+ regionsβœ…βœ…βœ…βœ…
Dataset schemaβœ…βŒβŒβŒ
Limited permissionsβœ…βŒβŒβŒ

Notes

  • source_url may be null for some articles. Google News encodes article URLs using a Base64 scheme that requires JavaScript to decode. The decodeUrls option attempts HTTP redirect following but cannot resolve all URLs. The google_news_url field is always populated and can be used to visit the article in a browser.
  • full_text requires source_url to be resolvable. Articles where source_url is null will have full_text: null even when extractFullText is enabled.
  • Rate limits: Google News RSS feeds are public and rate-limit tolerant. This actor runs within safe request limits.
  • No blocked content: This actor only fetches publicly available RSS data and article pages. It does not bypass paywalls.

Frequently Asked Questions

How many articles can I scrape with Google News Scraper?

There is no hard cap. Each run is bounded by your maxResults input and the news available for the query, topic, or URL you provide. For very large monitoring jobs, split queries by topic, country, or time window and orchestrate them via the Apify API or scheduler.

Can I integrate Google News Scraper with other apps?

Yes. The actor pushes structured JSON to an Apify dataset, which integrates natively with Zapier, Make, n8n, Google Sheets, Airtable, and webhooks. You can also export CSV/JSON/XLSX for BI tools, RAG pipelines, or warehouses like BigQuery and Snowflake.

Can I use Google News Scraper with the Apify API?

Yes. Start runs, fetch dataset items, and stream output via the Apify REST API or any official SDK (JavaScript, Python). The "API & MCP Integration" section above shows the exact endpoints and parameters.

Can I use Google News Scraper through an MCP Server?

Yes. The actor is MCP-ready and can be called from Claude, Cursor, or any MCP-compatible agent via the Apify Actors MCP server. Tool names and input schema follow the standard Apify MCP conventions.

Do I need proxies to scrape Google News?

No. The actor uses Google News RSS feeds and public article pages, so no proxy configuration is required for typical workloads. Heavy-volume users can still attach Apify Proxy via input if they want extra rotation.

Is it legal to scrape Google News data?

This actor only fetches publicly available RSS feeds and article URLs. You are responsible for complying with Google's Terms of Service, the source publishers' terms, copyright law, and any regulations that apply in your jurisdiction (e.g. GDPR, CCPA). Do not use the output to redistribute full article text in ways that infringe publisher rights.

Your feedback

Found an issue, missing field, or have a feature request? Open an Issue on the actor page β€” feedback directly shapes the next release.


Related actors

You might also like

Google News Scraper - Articles by Keyword & Topic

fascinating_lentil/google-news-scraper

Scrape Google News articles by search keyword, topic, or top headlines. Extract titles, sources, links, publish dates, and snippets. No login or API key needed.

πŸ‘ User avatar

Md Jakaria Mirza

2

Google News Scraper

rupom888/google-news-scraper

Scrape Google News articles by keyword, topic (Technology, Business, Sports, etc.), or site. No API key needed. Returns title, URL, source, date, description, and image.

Google News Scraper

lhotanova/google-news-scraper

Gets featured articles from Google News with title, link, source, publication date and image.

πŸ‘ User avatar

KristΓ½na LhoΕ₯anovΓ‘

3.1K

4.6

Google News Scraper

dami_studio/google-news-scraper

Search Google News by keyword or topic and get clean, structured articles with the REAL publisher URL (not Google's redirect), source, date, and snippet. Optional full article text and AI summary + sentiment. No key, no login.

3

5.0

Google News Scraper ( Fast And Cheap )

hung_pham_manh/google-news-scraper-fast-and-cheap

Gets featured articles from Google News with title, link, source, publication date and image.

Google News Scraper

automation-lab/google-news-scraper

Extract Google News articles by keyword or topic: headlines, sources, publication dates, snippets, and URLs. Pure HTTP, no browser or API key needed. Export to JSON, CSV, or Excel.

πŸ‘ User avatar

Stas Persiianenko

234

Google News Scraper β€” Search & Topics RSS API

bovi/google-news-scraper

Scrape Google News by keyword or topic. Returns title, source, published date, direct RSS link, and parse_confidence. No proxy needed. Pay per result.

πŸ‘ User avatar

Vitalii Bondarev

2