VOOZH about

URL: https://apify.com/opendata-labs/lemmy-scraper

โ‡ฑ Lemmy Scraper ยท Apify


Pricing

Pay per usage

Go to Apify Store

Scrape posts, comments, communities and search results from any Lemmy instance via the official API. Clean structured data (JSON/CSV), no login required.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

๐Ÿ‘ Joao Paulo

Joao Paulo

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Scrape posts, comments, communities, and search results from any Lemmy instance โ€” fast, structured, and auth-free. Built on the official public Lemmy API v3 (ActivityPub / fediverse), so it works on lemmy.world or any other decentralized server.

What it does

Lemmy Scraper turns the public Lemmy REST API into clean, flat dataset rows you can export to JSON, CSV, or Excel. Point it at any instance, pick a mode, and collect data from the fediverse without logging in.

Four modes:

  • Community posts โ€” every post in a community, sorted by Hot / Active / New / Top.
  • Search โ€” keyword search across the instance.
  • Community info โ€” metadata for a single community (subscribers, post count, description).
  • Post comments โ€” all comments under a specific post.

Features

  • Works on any Lemmy instance โ€” just change the instance input.
  • No authentication, no cookies, no tokens โ€” pure public API reads.
  • Flattened output โ€” clean columns, not raw nested JSON blobs.
  • Automatic pagination with a maxItems cap.
  • Polite request pacing plus automatic retries on transient HTTP errors.
  • Pay-per-result pricing friendly (PPE item-scraped events).

Input

FieldTypeDescription
modeenumcommunity_posts, search, community_info, or post_comments.
instancestringLemmy host, e.g. lemmy.world (default).
communityNamestringCommunity to scrape โ€” name or name@instance.tld. Required for community modes.
querystringSearch keywords. Required for search mode.
postIdintegerPost ID. Required for post_comments mode.
sortenumHot, Active, New, TopDay, TopWeek (community posts).
maxItemsintegerMax rows to collect (default 1000).

Example input

{
"mode":"community_posts",
"instance":"lemmy.world",
"communityName":"technology",
"sort":"Hot",
"maxItems":500
}

Output example

Each post becomes one flat row:

{
"id":12345678,
"title":"Open-source project hits 1.0",
"body":"Release notes inside...",
"url":"https://example.com/release",
"creatorName":"dev_user",
"creatorActorId":"https://lemmy.world/u/dev_user",
"communityName":"technology",
"score":842,
"upvotes":870,
"downvotes":28,
"commentsCount":134,
"published":"2026-06-20T14:03:11.000Z",
"postUrl":"https://lemmy.world/post/12345678"
}

Comments and community-info modes produce their own flat schemas (content, creator, score, subscribers, etc.).

Use cases

  • OSINT & research โ€” monitor communities and discussions across the fediverse.
  • Journalism โ€” track emerging stories and public sentiment on decentralized platforms.
  • Brand monitoring โ€” find mentions of your product or company via search mode.
  • AI / ML training data โ€” collect open social text and threaded discussions at scale.

Why this actor

Lemmy exposes a stable, official public REST API backed by ActivityPub. This scraper talks to that API directly instead of fragile HTML parsing, so it keeps working through UI changes and runs against any Lemmy server in the fediverse. No login, no rate-limit gymnastics, no brittle selectors โ€” just structured decentralized social data.


Keywords: Lemmy scraper, Lemmy API, scrape Lemmy, ActivityPub, fediverse data, decentralized social, Lemmy posts, Lemmy comments, federated Reddit alternative.

You might also like

Lemmy Scraper - Federated Reddit Alternative

legend006/lemmy-scraper

Scrape posts and comments from any Lemmy instance (the open, federated Reddit alternative). Filter by community, search keyword, or pull instance-wide feeds. No login required. Built for AI training datasets, fediverse research, and community monitoring.

๐Ÿ‘พ Lemmy Scraper - Federated Reddit Posts & Comments

benthepythondev/lemmy-scraper

Scrape Lemmy (the federated Reddit alternative) from any instance via the public API โ€” no login needed. Get front-page or per-community posts, comments, keyword search, and community data. Clean JSON with scores, upvotes & comment counts.

Lemmy Scraper

dami_studio/lemmy-scraper

Scrapes public Lemmy posts from any instance (default lemmy.world) by front-page feed, community, or keyword search. Returns title, link, body, author, community, score, comments, votes, NSFW flag and thumbnail as JSON. Best for brand and product mon

2

5.0

(1)

Lemmy Scraper โ€” Posts, Comments & Community Data

devilscrapes/lemmy-community-scraper

Scrape posts and comments from any public Lemmy community on any Fediverse instance. Fingerprint rotation, retries, and proxy fallback handled for you. Typed dataset rows, ready for SQL, CSV, or JSON.

Lemmy Scraper: Posts, Comments, Communities & Users

perconey/lemmy-scraper

Scrape any Lemmy instance (lemmy.world, lemmy.ml, beehaw.org and other Lemmyverse nodes) via the official /api/v3/* REST API. Posts with upvote/downvote counts, comment trees, communities with subscriber counts, user profiles, full-text search. No auth, no proxies. Pay per result.

๐Ÿฆ‹ Bluesky Scraper - Posts, Profiles & Search

benthepythondev/bluesky-scraper

Scrape Bluesky (bsky.app) via the public AT Protocol API โ€” no login needed. Get a user's posts, profile data, followers/follows, or search posts by keyword/hashtag. Clean JSON with engagement counts, links & images.

๐Ÿ˜ Mastodon Scraper - Hashtags, Posts & Trends

benthepythondev/mastodon-scraper

Scrape Mastodon (any instance) via the public REST API โ€” no login needed. Get hashtag posts, a user's posts, the public/federated timeline, trending posts, or profile data. Clean JSON with engagement counts, media & hashtags.

Lemmy Community Posts Scraper

parseforge/lemmy-community-posts-scraper

Track social activity from Lemmy Community Posts with profile name, follower count, posts, replies and timestamps. Designed for community managers, brand watchers and trend researchers. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

PeerTube Scraper: Videos, Channels, Accounts & Search

perconey/peertube-scraper

Scrape any PeerTube instance via the official /api/v1 REST API. Videos, channels, accounts, search - cross-instance federation routing. No browser, no proxies, no auth. Pay only per result item.