VOOZH about

URL: https://apify.com/andok/rss-parser

โ‡ฑ RSS & Atom Feed to JSON Scraper ยท Apify


Pricing

from $1.00 / 1,000 item parseds

Go to Apify Store

RSS & Atom Feed to JSON Scraper

Monitor blogs, news sites, and podcasts. Convert any RSS or Atom feed into structured JSON data for instant content syndication.

Pricing

from $1.00 / 1,000 item parseds

Rating

0.0

(0)

Developer

๐Ÿ‘ Andok

Andok

Maintained by Community

Actor stats

0

Bookmarked

30

Total users

9

Monthly active users

3 months ago

Last modified

Share

RSS & Atom Feed Parser

Parse any RSS or Atom feed into structured JSON and plug it straight into your monitoring pipeline. Tracking competitor blogs, industry news, or podcast releases becomes a scheduled job that outputs clean, normalized data โ€” no XML wrangling required. Feed it dozens of URLs at once and get back every item with title, link, date, author, and content snippet.

Features

  • Multi-feed processing โ€” parse dozens of RSS and Atom feeds in a single run
  • Format-agnostic โ€” handles RSS 2.0, RSS 1.0, and Atom feeds automatically
  • Normalized output โ€” consistent JSON structure regardless of feed format differences
  • Content snippets โ€” extracts plain-text previews from HTML content fields
  • Category extraction โ€” preserves article categories and tags from feed metadata
  • Configurable limits โ€” control how many items to extract per feed
  • Timeout protection โ€” per-feed timeout prevents slow feeds from blocking the entire run

Input

FieldTypeRequiredDefaultDescription
urlsarrayYesโ€”List of RSS or Atom feed URLs to parse (e.g. https://news.ycombinator.com/rss)
maxItemsintegerNo100Maximum number of items to extract per feed (1-500)
timeoutSecondsintegerNo15Per-feed fetch timeout in seconds (1-120)

Input Example

{
"urls":[
"https://news.ycombinator.com/rss",
"https://feeds.arstechnica.com/arstechnica/index"
],
"maxItems":50,
"timeoutSeconds":15
}

Output

Each dataset item represents one feed item (article, post, or episode). Key fields:

  • feedUrl (string) โ€” the source feed URL
  • feedTitle (string) โ€” title of the parent feed
  • itemTitle (string) โ€” title of the individual item
  • itemLink (string) โ€” URL to the full content
  • pubDate (string) โ€” publication date
  • creator (string) โ€” author or creator name
  • contentSnippet (string) โ€” plain-text preview of the content
  • guid (string) โ€” unique identifier for the item
  • categories (array) โ€” list of tags or categories

Output Example

{
"feedUrl":"https://news.ycombinator.com/rss",
"feedTitle":"Hacker News",
"itemTitle":"Show HN: Open-source tool for managing database migrations",
"itemLink":"https://github.com/example/db-migrate",
"pubDate":"Mon, 20 Jan 2025 18:45:00 +0000",
"creator":null,
"contentSnippet":"A lightweight CLI for managing SQL migrations across Postgres, MySQL, and SQLite.",
"guid":"https://news.ycombinator.com/item?id=38901234",
"categories":[]
}

Pricing

EventCost
Item ParsedPay-per-event (see actor pricing page)

Use Cases

  • Content monitoring โ€” schedule daily runs to track blog posts, news articles, or podcast episodes
  • Competitive intelligence โ€” follow competitor RSS feeds for product announcements and blog updates
  • Data pipeline ingestion โ€” normalize feed data into JSON for downstream processing, databases, or dashboards
  • Newsletter automation โ€” aggregate content from multiple sources into a curated feed
  • AI agent context โ€” supply structured article data to LLM pipelines and RAG applications
  • Slack/webhook alerts โ€” combine with Apify integrations to push new items to Slack or email

Related Actors

ActorWhat it adds
Google News ScraperSearch Google News by keyword instead of parsing known feed URLs
Hacker News ScraperGet richer Hacker News data (scores, comments) via the Firebase API
Markdown ExtractorConvert the linked articles into clean Markdown for LLM ingestion

You might also like

RSS / Atom Feed Scraper

rupom888/rss-atom-feed-scraper

Scrape any RSS or Atom feed. Works with news sites, blogs, podcasts, YouTube channels (/feeds/videos.xml?channel_id=...), Reddit (/r/subreddit/.rss), and any standard feed URL. Extracts title, description, author, publish date, categories, and full content.

RSS Feed Scraper โ€” Atom, Podcast & Multi-Feed

devilscrapes/rss-feed-scraper

Parse and convert any RSS or Atom feed to a clean dataset โ€” title, link, author, published date, summary, full HTML content, tags, GUID โ€” export to JSON or CSV. A drop-in RSS feed parser for RSS 2.0, Atom 1.0, and the content:encoded / dc:creator extensions.

RSS Feed Scraper & RSS to JSON Converter

xtech/feed-extractor

Scrape and parse RSS, Atom, JSON Feed (and podcast RSS) URLs into clean, structured JSON. Outputs one dataset row per feed entry/item for easy export to CSV/JSON and automations.

RSS & Atom Feed Reader - Monitor Any Feed at Scale

santamaria-automations/rss-feed-reader

Extract articles from RSS, Atom, and JSON feeds including title, author, content, images, categories, enclosures, and publication dates. Export data, run via API, schedule and monitor runs, or integrate with other tools.

RSS Feed Reader - Bulk RSS & Atom Feed Parser

logiover/bulk-rss-feed-reader

Read and parse RSS, Atom and RDF feeds in bulk, or auto-discover feeds from any website. Extract thousands of articles with full metadata for news monitoring, content aggregation and AI/RAG pipelines. No API key, export to CSV or JSON.