Guardian News Scraper

Pricing

from $2.00 / 1,000 results

Guardian News Scraper

Scrape full The Guardian articles with headline, body, authors, section, and tags. Supports `mode: latest` to get newest news via Guardian world RSS. HTTP-only.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

👁 Farhan Febrian Nauval

Farhan Febrian Nauval

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

11 days ago

Last modified

The Guardian Article Scraper

Extract full article text, author, publication date, section, and description from any theguardian.com article URL. The Guardian is one of the world's most-read English-language news sites with extensive international coverage across politics, culture, and science.

Why Use This Actor?

Academic research - Guardian long-form journalism is widely used in media studies and political research.
Content curation - aggregate Guardian articles by topic for newsletters or reading lists.
Sentiment and bias analysis - Guardian editorial stance makes it a reference in media bias research.
Open access - Guardian content is freely available globally with no paywall or geo-restriction.

How It Works

This actor uses only HTTP requests - no browser, no Selenium, no Playwright. Articles are extracted in seconds with RAM usage well under 256 MB.

Input

{
"url":"https://www.theguardian.com/world/2026/apr/13/example-article",
"urls":[
"https://www.theguardian.com/world/2026/apr/13/article-one",
"https://www.theguardian.com/technology/2026/apr/12/article-two"
]
,
"mode":"article",
"limit":10
}

Output

{
"url":"https://www.theguardian.com/world/2026/may/15/mali-airstrikes-rebel-alliance-separatists",
"source":"The Guardian",
"title":"Mali’s forces target rebel alliance in junta’s fight to keep power",
"description":"Army supported by Russian mercenaries launches airstrikes after offensive by coalition of Islamist extremists and Tuareg separatists",
"content":"Mali’s armed forces, supported by Russian mercenaries, have launched airstrikes targeting a rebel alliance of Islamist extremists and Tuareg separatists as the ruling junta struggles to maintain its hold on power in the unstable west African country. Earlier this week warplanes targeted the key northern town of Kidal,which was lostwhen the rebels launched a surprise offensive across much of Mali in late April....",
"image":"https://i.guim.co.uk/img/media/e6d26af1123d872554af9a427c5d33abf01bc499/650_22_3090_2473/master/3090.jpg?width=1200&height=630&quality=85&auto=format&fit=crop&precrop=40:21,offset-x50,offset-y0&overlay-align=bottom%2Cleft&overlay-width=100p&overlay-base64=L2ltZy9zdGF0aWMvb3ZlcmxheXMvdGctZGVmYXVsdC5wbmc&enable=upscale&s=46f9527d36a676fc922f988649bb5fe9",
"language":"en",
"word_count":847,
"published_date":"2026-05-15T14:57:35.000Z",
"modified_date":"",
"authors":[],
"categories":"",
"tags":""
}

Fetch Latest News

Set mode to "latest" to fetch the newest article URLs and titles from The Guardian instead of extracting a single article.

Input:

{
"mode":"latest",
"limit":10
}

Output - array of objects:

[
{
"url":"https://www.theguardian.com/world/2026/apr/20/madagascar-gen-z-protesters-fear-new-regime",
"title":"Arrests fuel fears among Madagascar’s gen Z protesters that new regime no better than one they overthrew",
"published_date":"Mon, 20 Apr 2026 04:00:02 GMT",
"source":"The Guardian"
}
//...
]

Source: https://www.theguardian.com/world/rss (RSS feed)

Cron Schedule: Auto-Fetch Newest Articles

Combine mode: "latest" and mode: "article" to keep a fresh feed running on autopilot:

Schedule a recurring run of this Actor with {"mode": "latest", "limit": 20} via Apify Schedules (UI ▸ Schedules ▸ Create new). A cron expression like */30 * * * * runs it every 30 minutes.
Webhook the dataset of the latest run into another Actor run with mode: "article" and the new URLs as input — Apify integrations let you chain runs via the "Actor finished" webhook without any glue code.
The article-mode run extracts the full body, image, authors, and metadata for each URL and appends to your master dataset.

Common cron expressions:

Frequency	Cron
Every 15 minutes	`/15 * * *`
Hourly	`0 * * * *`
Every 6 hours	`0 /6 * *`
Daily at 06:00 UTC	`0 6 * * *`

Notes

The Guardian rarely paywalls content; full article text is usually returned
For high-volume production use, register for The Guardian's free Content API

Other News Actors

Need a different news source? All actors in this collection:

Actor	Source
`aljazeera-scraper`	Al Jazeera
`apnews-scraper`	AP News
`bbc-scraper`	BBC News
`cnbc-scraper`	CNBC
`forbes-scraper`	Forbes
`fortune-scraper`	Fortune
`ft-scraper`	Financial Times
`guardian-scraper`	The Guardian
`msn-scraper`	MSN News
`nytimes-scraper`	New York Times
`reuters-scraper`	Reuters
`scmp-scraper`	South China Morning Post
`techcrunch-scraper`	TechCrunch
`upi-scraper`	UPI
`yahoo-finance-scraper`	Yahoo Finance
`smart-news-loader`	Any URL - adaptive HTTP loader
`bloomberg-scraper`	Bloomberg

All actors support mode: "latest" for fetching newest article URLs from each source.

Guardian Scraper

chimerical_quicklime/guardian-scraper

Scrape The Guardian articles via the open Content API: title, section, byline, publication date, trail text, thumbnail, and URL. Filter by query or section. Built for news monitoring and media datasets.

👁 User avatar

Khrystyna Skotte

👁 Guardian Singapore Reviews Scraper avatar

Guardian Singapore Reviews Scraper

hello.datawizards/Guardian-Singapore-Scraper

The Guardian Singapore Reviews Scraper extracts real customer reviews, ratings, and product insights from Guardian Singapore product pages in structured JSON. Ideal for market research, brand analysis, and consumer sentiment tracking with fast, accurate, and proxy-supported scraping.

👁 User avatar

datawizards

👁 The Guardian Article Search & Archive Scraper avatar

The Guardian Article Search & Archive Scraper

parseforge/guardian-content-search-scraper

Search The Guardian's full article archive (2.6M+ articles since 1999). Filter by query, section, tag, contributor, date, or production office. Returns headline, byline, body, tags, contributors, and publication metadata.

👁 User avatar

ParseForge

👁 The Guardian Scraper avatar

The Guardian Scraper

theo/the-guardian-scraper

Scrape news data from theguardian.com with this unofficial API. Extract articles, monitor their popularity and performance and automate the fight against fake news. Filter the results by authors, topics, categories, or publication dates. Preview or download the results in your preferred format.

👁 User avatar

Theo Vasilis

👁 Universal News Scraper avatar

Universal News Scraper

moving_beacon-owner1/my-actor-62

Universal News Scraper Scrapes BBC, CNN, Reuters, Al Jazeera, The Guardian, and NYT using RSS feeds + web scraping. No API keys or login needed.

👁 User avatar

Jamshaid Arif

Aljazeera News Scraper

flamboyant_liner/aljazeera-news-scraper

Scrape Al Jazeera English news articles: headline, author, date, body, images, section, keywords. Filter by section. Real-time international news data.

👁 User avatar

Khrystyna Skotte

👁 Child Content Guardian avatar

Child Content Guardian

minionbond/child-content-guardian

Is your child watching... What ...?

👁 User avatar

Harshad Velapure

👁 Reuters News Scraper avatar

Reuters News Scraper

xtracto/reuters-scraper

Extract full Reuters wire articles. Bypasses DataDome bot protection - no residential proxies required. Supports `mode: latest` to get newest news. HTTP-only.

👁 User avatar

Farhan Febrian Nauval

👁 BBC News Articles Scraper | UK and World Headlines avatar

BBC News Articles Scraper | UK and World Headlines

parseforge/bbc-news-articles-scraper

Collect BBC News articles with headline, author, date, section, summary, and full body text. Filter by topic, region, or keyword. Useful for media monitoring, sentiment analysis, NLP training datasets, and competitive intelligence across global news.

👁 User avatar

ParseForge

👁 Google Latest News Scraper avatar

Google Latest News Scraper

mansurmqlfelvin/google-latest-news-scraper

Google Latest News Scraper

👁 User avatar

Jerry Chao

URL: https://apify.com/xtracto/guardian-scraper