VOOZH about

URL: https://apify.com/werednalux/content-intelligence-extractor

⇱ Reddit & News to Markdown for LLM Pipelines Β· Apify


Pricing

from $5.00 / 1,000 page converteds

Go to Apify Store

Content Intelligence Extractor

Extract clean Markdown from Reddit threads and news sites. Built for LLM pipelines, n8n workflows, and AI content analysis. Uses Mozilla Readability + Reddit JSON API for noise-free output.

Pricing

from $5.00 / 1,000 page converteds

Rating

0.0

(0)

Developer

πŸ‘ Andrew Luxem

Andrew Luxem

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

4 months ago

Last modified

Share

Converts Reddit threads and film/entertainment news articles into clean, structured Markdown optimized for LLM pipelines, AI content analysis, and n8n automation workflows.

What it does

Give it a list of URLs β€” Reddit threads or articles from sites like Screen Rant, CBR, IGN, or any news site β€” and it returns clean Markdown with engagement signals, metadata, and source-specific fields ready to pipe directly into Claude, GPT, or any LLM.

Reddit threads are extracted via Reddit's JSON API (no browser needed) with post body, top comments sorted by upvotes, and engagement data.

Film/news sites are extracted using Mozilla Readability + Turndown β€” the same engine Firefox uses to strip ads, sidebars, author bios, and newsletter popups before converting to clean Markdown.

Use cases

  • Content gap analysis β€” feed competitor articles to an LLM to find unexplored angles
  • n8n content pipelines β€” schedule weekly runs, pipe output to Claude or GPT for article briefs
  • Reddit trend monitoring β€” extract high-upvote fan theories or discussions for content research
  • SEO research β€” extract and analyze top-ranking articles in bulk
  • RAG knowledge bases β€” clean Markdown is ideal for vector embeddings

Example input

{
"urls":[
"https://www.reddit.com/r/FanTheories/comments/abc123/my_theory/",
"https://screenrant.com/some-article/"
],
"maxRedditComments":10,
"includeEngagementData":true
}

Example output

{
"url":"https://www.reddit.com/r/FanTheories/comments/abc123/",
"sourceType":"reddit",
"title":"Theory: The ending means something else entirely",
"markdown":"# Theory: The ending...\n\nFull post body...\n\n## Top Comments\n\n...",
"metadata":{
"wordCount":847,
"estimatedReadTime":4,
"engagementSignal":3200
},
"redditSpecific":{
"subreddit":"FanTheories",
"upvotes":3200,
"commentCount":143,
"topComments":[{"body":"...","upvotes":412}]
}
}

n8n integration

Use the native Apify n8n node to trigger this actor on a schedule:

  1. Schedule Trigger β€” weekly or daily
  2. Apify: Run Actor β€” pass your URL list
  3. Apify: Get Dataset β€” fetch results
  4. Loop + LLM node β€” Claude/GPT analysis prompt
  5. Google Sheets / Notion β€” store content briefs

Pricing

Pay-per-page: $0.005 per URL processed. First 20 pages free.

Supported sources

  • Reddit (all subreddits via JSON API)
  • Screen Rant, CBR, IGN, Variety, Hollywood Reporter
  • Any article-based news or blog site
  • Custom CSS selectors to strip site-specific noise

You might also like

Reddit Scraper - Markdown for AI & n8n

clearpath/reddit-to-llm-api

Extract Reddit posts and comments as LLM-ready Markdown. No API key needed. Direct n8n/Make integrationβ€”connect output to AI nodes instantly. 20x faster than browser scrapers. Perfect for lead gen, product validation, and market research workflows.

Reddit Answers API "Ask Reddit" - AI Insights for n8n Pipelines

clearpath/reddit-answers-api

Extract AI-powered answers in 6 languages from Reddit discussions at scale. Structured JSON + markdown for n8n, Make, and LLM pipelines. Includes full post/comment context, quotes with citations, and subreddit metadata. 6 languages supported. No login required. Pay per successful answer.

n8n-mcp

nourishing_courier/web-data-for-ai

n8n-mcp

πŸ‘ User avatar

Ani BjΓΆrkstrΓΆm

4

n8n Documentation MCP Server

agentify/n8n-mcp-server

n8n MCP Server provides AI assistants with structured access to n8n node documentation, properties, and validation tools for building and verifying workflows efficiently.

n8n Workflows Scraper

dadhalfdev/n8n-workflows-scraper

This scraper extracts pre-built, free workflow templates directly from the n8n template library. Pick a category and sort order, and the scraper will navigate n8n's library to extract not only the metadata of each workflow but the full, raw JSON configuration. Get up to 150 workflows per run.

πŸ‘ User avatar

Marco Rodrigues

3

n8n Workflow Automation Templates Scraper

scraped/n8n-workflow-automation-templates-scraper

A tool that automatically scrapes and collects n8n workflow automation templates from the n8n for easy access and use.

n8n Community Creators Scraper

lexis-solutions/n8n-community-creators

Scrape top n8n community creators by engagement: capture profiles, reputation metrics, locations, bios, plus linked n8n creator pages with workflow counts and verification for expert discovery and lead generation.

πŸ‘ User avatar

Lexis Solutions

2

Reddit Answers API - Ask Reddit (Rental)

clearpath/reddit-answers-api-rental

Extract AI-powered answers in 6 languages from Reddit discussions at scale. Structured JSON + markdown for n8n, Make, and LLM pipelines. Includes full post/comment context, quotes with citations, and subreddit metadata. 6 languages supported. No login required. Pay per successful answer.

LLM Markdown Crawler

sleek_waveform/llm-markdown-crawler

Crawl any website and extract clean, boilerplate-free Markdown optimized for LLMs, RAG pipelines, and AI training datasets. Uses Mozilla Readability to strip navigation and ads, then converts to clean Markdown. No browser required β€” fast and cheap.

πŸ‘ User avatar

Daniel Dimitrov

5

n8n Creators Scraper

lexis-solutions/n8n-creators-scraper

Scrape n8n.io/creators β€” extract verified creator profiles, workflow counts, bios, links, categories & pricing. Perfect for automation agencies, recruiters & no-code researchers. Structured JSON output.

πŸ‘ User avatar

Lexis Solutions

2

Related articles

How to publish your Apify Actor as an n8n node
Read more