VOOZH about

URL: https://apify.com/klondikeking/wikipedia-article-scraper

โ‡ฑ Wikipedia Article Scraper - Search & Extract Content ยท Apify


๐Ÿ‘ Wikipedia Article Scraper - Search & Extract Content avatar

Wikipedia Article Scraper - Search & Extract Content

Pricing

$1.00 / 1,000 article extracteds

Go to Apify Store

Wikipedia Article Scraper - Search & Extract Content

Search and extract Wikipedia article metadata, summaries, and content via the official MediaWiki API. No scraping overhead โ€” pure API integration with high reliability.

Pricing

$1.00 / 1,000 article extracteds

Rating

0.0

(0)

Developer

๐Ÿ‘ Pierrick McD0nald

Pierrick McD0nald

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

Wikipedia Article Scraper โ€” Search & Extract Content

Extract Wikipedia article metadata, summaries, and content via the official MediaWiki API. This Actor searches Wikipedia by keyword and returns structured data for every matching article โ€” no browser overhead, no scraping complexity, just clean API integration.

Use Cases

  • Content Research โ€” Gather article summaries and metadata for academic research, content marketing, or knowledge base building.
  • SEO & Topic Analysis โ€” Extract word counts, article sizes, and publication dates to analyze content depth and freshness across topics.
  • Data Enrichment โ€” Augment datasets with Wikipedia summaries, thumbnail images, and canonical URLs for entity linking and NLP pipelines.
  • Multilingual Content โ€” Search across 300+ Wikipedia language editions to build localized content collections.

Input

FieldTypeRequiredDescription
searchQueryStringYesSearch term to find Wikipedia articles (e.g., "machine learning", "quantum computing")
maxResultsNumberNoMaximum articles to extract, 1โ€“500 (default: 25)
includeExtractBooleanNoFetch article introduction/summary text (default: true)
includeImagesBooleanNoFetch thumbnail image URLs (default: false)
languageStringNoWikipedia language code: en, es, fr, de, ja, etc. (default: "en")
proxyConfigurationObjectNoProxy settings (optional โ€” Wikipedia API does not require proxy)

Output

The Actor outputs a dataset with the following fields:

{
"pageId":233488,
"title":"Machine learning",
"url":"https://en.wikipedia.org/wiki/Machine_learning",
"snippet":"Machine learning (ML) is a field of study in artificial intelligence...",
"extract":"Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of statistical algorithms...",
"wordCount":15287,
"size":141291,
"thumbnail":"https://upload.wikimedia.org/wikipedia/commons/thumb/...",
"timestamp":"2026-05-15T10:30:00Z",
"language":"en"
}

Pricing

Pay per event: $0.001 per article extracted.

No minimums, no subscriptions. You only pay for the results you receive. The Wikipedia MediaWiki API is free and public, so compute costs are minimal and margins stay high.

Limitations

  • Maximum 500 results per run (Wikipedia API limit)
  • Article extracts are limited to the introduction/summary section
  • Thumbnail images are only available when includeImages is enabled and the article has an image
  • Rate limits apply per Wikipedia language edition (handled automatically with retries)

FAQ

Q: Do I need a Wikipedia API key? A: No. This Actor uses the public MediaWiki API with no authentication required.

Q: Can I search in languages other than English? A: Yes. Set the language field to any valid Wikipedia language code (e.g., "es" for Spanish, "ja" for Japanese).

Q: What happens if my search returns thousands of results? A: The Actor respects the maxResults limit and paginates through the API automatically. You only pay for the number of articles actually extracted.

Changelog

  • v1.0.0 โ€” Initial release

You might also like

Wikipedia Scraper

automation-lab/wikipedia-scraper

Search and extract Wikipedia articles โ€” titles, summaries, full content, categories, and images. Uses the free MediaWiki API.

๐Ÿ‘ User avatar

Stas Persiianenko

20

Wikipedia Article Scraper

crawlerbros/wikipedia-scraper

Extract structured data from Wikipedia articles. Get summaries, categories, images, metadata, and descriptions using Wikipedia's official API. Supports 300+ languages.

Wikipedia Scraper

gio21/wikipedia-scraper

Search Wikipedia and return article summaries or full text via the public REST API. Supports 300+ languages. Useful for knowledge extraction, research, content generation, and entity enrichment.

Wikipedia Article Scraper

rupom888/wikipedia-article-scraper

Scrape Wikipedia articles using the official MediaWiki REST API. Search by keyword, look up specific titles, or scrape by URL. Extracts full article text, sections, infobox data, categories, references, images, and related articles. Supports 300+ languages.