VOOZH about

URL: https://apify.com/extremescrapes/news-article-to-markdown

โ‡ฑ News Article To Markdown ยท Apify


Pricing

from $50.00 / 1,000 result extracteds

Go to Apify Store

News Article To Markdown

Extract news articles as clean, ad-free Markdown with automatic author and publish date detection.

Pricing

from $50.00 / 1,000 result extracteds

Rating

0.0

(0)

Developer

๐Ÿ‘ Extreme Scrapes

Extreme Scrapes

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Categories

Share

Extract news articles as clean, ad-free Markdown with automatic author and publish date detection. Strips navigation, ads, related articles, comments, and social sharing widgets.

Features

  • Clean extraction โ€” removes nav, footer, ads, related articles, newsletters, comments, social widgets
  • Author detection โ€” automatically extracts author name from article header
  • Date detection โ€” automatically extracts publish date in various formats
  • Image captions โ€” generates alt text for images lacking captions
  • Batch processing โ€” extract multiple articles in a single run
  • Works with any news site โ€” BBC, CNN, Reuters, NYT, TechCrunch, etc.

How It Works

  1. Provide news article URLs as input.
  2. The Actor fetches each article, stripping all non-content elements.
  3. Author and publish date are extracted from the first 20 lines.
  4. Clean Markdown with metadata is stored in the Apify dataset.

Input

{
"startUrls":[
{"url":"https://www.bbc.com/news/technology-67988517"},
{"url":"https://techcrunch.com/2024/01/15/some-article"}
]
}

Output

{
"url":"https://www.bbc.com/news/technology-67988517",
"author":"Jane Smith",
"publishDate":"2024-01-15",
"markdown":"# Article Title\n\nArticle content..."
}

Use Cases

  • Media monitoring and news aggregation
  • Build news datasets for AI training
  • Track coverage of specific topics
  • Feed news into summarization pipelines

Keywords

news scraper, article extractor, news to markdown, media monitoring, news parser, article scraper

Pricing

$50 per 1,000 article extractions.

You might also like

Medium Article Extractor

extremescrapes/medium-article-extractor

Extract news articles as clean, ad-free Markdown with automatic author and publish date detection.

๐Ÿ‘ User avatar

Extreme Scrapes

2

Fast News Content Scraper

datapilot/fast-news-content-scraper

Fast News Content Scraper Actor collects news articles using Fast News RSS and . It extracts title, URL, publish date, author, description, and full article text. Supports multiple queries, anti-bot delays, and outputs structured JSON with source site and scrape timestamp.

Free Google News API โ€” Search News by Keyword + Country

s-r/google-news

Free Google News scraper โ€” get clean structured news results for any query, country, and language. Use it as a Google News API for brand monitoring, topic alerts, news clipping, and bulk article URL harvesting.

Google News Article Scraper

webscrap18/google-news-article-scraper

Scrape Google News, Extract full content with Title, Article Text, Images and Structured data.

News Website Crawler & Article Extractor

xtech/news-source-crawler

Scrape all articles from any news website. Extract full text, metadata, keywords, and summaries. Ideal for content analysis, research, and news aggregation.