VOOZH about

URL: https://apify.com/jacksu/public-article-intelligence-agent

โ‡ฑ Public Article Intelligence & Citation Extractor ยท Apify


๐Ÿ‘ Public Article Intelligence & Citation Extractor avatar

Public Article Intelligence & Citation Extractor

Pricing

from $5.00 / 1,000 useful article results

Go to Apify Store

Public Article Intelligence & Citation Extractor

Extract clean article text, metadata, summaries, citations, diagnostics, and change signals from public article URLs.

Pricing

from $5.00 / 1,000 useful article results

Rating

0.0

(0)

Developer

๐Ÿ‘ jack su

jack su

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

9 days ago

Last modified

Categories

Share

Extract clean article text, metadata, summary bullets, source snippets, and change signals from public article URLs.

This Actor is designed for AI agents, RAG preparation, newsletter workflows, SEO review, competitive research, and content monitoring where a generic web scraper is too noisy or unpredictable.

What It Returns

  • Clean article text and preview
  • Title, description, author, dates, canonical URL, language, and keywords
  • Deterministic summary bullets
  • Matched focus terms
  • Content hash and new, changed, or unchanged status
  • Evidence snippets and evidence URLs
  • Confidence, completeness, missing fields, diagnostics, and readable errors

Pricing Design

The intended pay-per-event setup is:

  • apify-actor-start: a tiny run-start fee
  • useful-article-result: charged only for useful public article records
  • no apify-default-dataset-item

Short pages, private-network URLs, sensitive token-like paths, failed fetches, duplicates, and unchanged comparison records should not charge the useful article event.

Good Fits

  • Summarizing public blog posts or news articles for AI agents
  • Preparing public article records for RAG or spreadsheets
  • Monitoring whether important articles changed
  • Checking article metadata completeness
  • Building source-linked research briefs

Boundaries

This Actor does not log in, bypass paywalls, use cookies, crawl private feeds, or enrich private persons. It accepts public HTTP and HTTPS article URLs only. Credentials, query parameters, fragments, private-network addresses, localhost, .local, account/invite/reset/unsubscribe paths, and token-like paths are rejected or safely redacted.

You might also like

Article Extraction API

tugelbay/article-extractor

Extract clean article text and metadata from URLs as Markdown, text, or HTML for RAG, AI agents, monitoring, and research. Guide: https://konabayev.com/tools/article-extractor/?utm_source=apify_info&utm_medium=referral&utm_campaign=article-extractor

๐Ÿ‘ User avatar

Tugelbay Konabayev

44

๐Ÿง  Smart Article Extractor

scrapio/smart-article-extractor

Smart Article Extractor

datapilot/smart-article-extractor

News Article Extractor Actor fetches article URLs and extracts structured content using Requests, , and Newspaper3k. It collects title, author, publish date, text, summary, keywords, images, and word count. Supports proxy use and outputs clean JSON results.

Google News Article Scraper

webscrap18/google-news-article-scraper

Scrape Google News, Extract full content with Title, Article Text, Images and Structured data.

Article Content Extractor ๐Ÿ“„

easyapi/article-content-extractor

Extract clean article content, metadata and structured information from any web page. Supports multiple URLs and returns well-formatted JSON with title, description, content, author, publish date and more. ๐Ÿ”๐Ÿ“„