VOOZH about

URL: https://apify.com/codingfrontend/any-website-url-to-article-summarizer

โ‡ฑ Any Website URL to Article Summarizer ยท Apify


๐Ÿ‘ Any Website URL to Article Summarizer avatar

Any Website URL to Article Summarizer

Pricing

from $4.99 / 1,000 results

Go to Apify Store

Any Website URL to Article Summarizer

Extract and summarize articles from any website URL. Returns title, author, publish date, word count, reading time, full text, and a concise AI-style summary using extractive summarization.

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Coding Frontned

Coding Frontned

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

8 days ago

Last modified

Share

Extract and summarize article content from any website URL. Works on news sites, blogs, Wikipedia, Medium, documentation, and more.

Uses extractive summarization (selects the most important sentences from the article itself) โ€” no external AI API key required.

Features

  • ๐Ÿ“ฐ Extract articles from any URL โ€” news sites, blogs, Wikipedia, Medium, etc.
  • ๐Ÿ“ Automatic extractive summary generation (selects most informative sentences)
  • ๐Ÿ”‘ Key points โ€” top 5 most important sentences from the article
  • ๐Ÿ‘ค Extracts title, author, publish date, description, and hero image
  • ๐Ÿ“Š Word count and reading time estimation
  • ๐Ÿ”’ No AI API key required โ€” all summarization happens locally
  • ๐ŸŒ Supports multiple URLs per run
  • ๐Ÿ’พ Optional: include full cleaned article text in output

How Summarization Works

This actor uses extractive summarization:

  1. Article text is cleaned and split into sentences
  2. Each sentence is scored by word frequency (TF-style scoring)
  3. The top-scoring sentences are selected and returned in their original reading order
  4. Position bias boosts early sentences (intros are typically more important)

This approach works across all languages and domains without requiring an LLM or external API.

Input

FieldTypeDefaultDescription
urlsarrayrequiredList of article URLs to summarize
summaryLengthstring"medium"short (3 sentences), medium (5), long (8)
includeFullTextbooleanfalseInclude full cleaned article text in output
maxItemsinteger10Maximum number of articles to process

Example Input

{
"urls":[
"https://en.wikipedia.org/wiki/Artificial_intelligence",
"https://techcrunch.com/2024/01/01/sample-article/"
],
"summaryLength":"medium",
"includeFullText":false,
"maxItems":10
}

Output

Each dataset record represents one summarized article:

FieldTypeDescription
positionintegerPosition in results
urlstringArticle URL
domainstringWebsite domain (e.g. "techcrunch.com")
titlestringArticle title
authorstring|nullArticle author name
publishDatestring|nullPublish date (ISO format or raw string)
descriptionstring|nullMeta description or excerpt
summarystringExtractive summary of the article
keyPointsarrayTop 5 key sentences from the article
wordCountintegerTotal word count
readingTimestringEstimated reading time (e.g. "5 min read")
imagestring|nullHero image URL (og:image)
siteNamestring|nullWebsite name (og:site_name)
languagestring|nullDocument language code
fullTextstringFull cleaned article text (if includeFullText=true)
scrapedAtstringISO 8601 scrape timestamp

Example output

{
"position":1,
"url":"https://en.wikipedia.org/wiki/Machine_learning",
"domain":"en.wikipedia.org",
"title":"Machine learning - Wikipedia",
"author":null,
"publishDate":null,
"description":"Machine learning (ML) is a field of study...",
"summary":"Machine learning is a subset of artificial intelligence...",
"keyPoints":["Machine learning models are often vulnerable to...","..."],
"wordCount":9653,
"readingTime":"48 min read",
"image":null,
"siteName":null,
"language":"en",
"scrapedAt":"2025-08-01T12:00:00.000Z"
}

Dataset Views

  • Articles Overview โ€” table with title, author, date, word count, reading time, URL, and summary
  • Summaries โ€” focused view showing title, summary, key points, URL, and domain

Technical Notes

  • Uses real Google Chrome browser (Playwright) for handling JavaScript-rendered pages
  • Fingerprint injection for natural browser behavior
  • Article content is extracted using a multi-selector heuristic that prioritizes <article>, [itemprop="articleBody"], and common blog/CMS CSS classes
  • Wikipedia [edit] and footnote [1] markers are automatically removed
  • Reference sections (.reflist, .references) are removed from Wikipedia pages
  • For paywalled articles, only publicly visible content is extracted

License

Apache-2.0

You might also like

Smart Article Extractor

parseforge/article-extractor

Extract clean article content from any news, blog, or publisher site! Pull full body text, author, publish date, word count, language, reading time, images, and metadata at scale. Ideal for content research, media monitoring, SEO audits, and AI training. Start extracting articles in minutes!

Google News Scraper

scrapeai/google-news-scraper

Scrape Google News articles from news.google.com using any search query. Extract title, source, date, link, and snippet. Optional deep scrape visits each article to collect full text, author, images, keywords, metadata, word count, and reading time.

Smart Article Extractor

datapilot/smart-article-extractor

News Article Extractor Actor fetches article URLs and extracts structured content using Requests, , and Newspaper3k. It collects title, author, publish date, text, summary, keywords, images, and word count. Supports proxy use and outputs clean JSON results.

AI Text Summarizer ๐Ÿ“

easyapi/ai-text-summarizer

๐Ÿค– Transform long texts into concise, meaningful summaries with AI! Support multiple languages, customizable summary lengths, and different summary styles. Perfect for content creators, researchers, and professionals who need quick, accurate text summarization.

YouTube Video Summarizer

api-empire/youtube-video-summarizer

Summarize YouTube videos instantly with this Apify YouTube Video Summarizer. Extract transcripts and generate concise summaries using AI. Perfect for research, content analysis, and quick insights without watching full videos. Fast, reliable, and easy to integrate into workflows.

YouTube Video Summarizer

scrapier/youtube-video-summarizer

Summarize YouTube videos quickly with the YouTube Video Summarizer. Generate concise, timestamped summaries, key points, and chapter-style outlines from any video URL. Ideal for research, note-taking, SEO, and accessibility. Fast, accurate, and easy to integrate. Exports in TXT JSON CSV; API ready .

News Article Scraper โ€” Newsroom & Press Release Extractor

scrapepilot/company-ok

Scrape full article content from any newsroom, press release page, or blog. Get title, author, publish date, summary, SEO keywords, word count, and full body text. Auto-discovers article links. Checkpoint resume. $5 per 1,000 articles

๐Ÿค– Any Website URL to Article Summarizer

easyapi/any-website-url-to-article-summarizer

Transform any article, blog post, or web content into concise, AI-powered summaries. Get key insights and main points instantly with smart text analysis and markdown formatting. Perfect for researchers, content creators, and busy professionals who need quick, accurate content digests.