VOOZH about

URL: https://apify.com/legend006/bluesky-firehose-scraper

โ‡ฑ Bluesky Scraper ยท Apify


Pricing

from $0.50 / 1,000 results

Go to Apify Store

Scrape Bluesky (AT Protocol) posts by keyword, hashtag, author handle, or custom feed. Export likes, reposts, replies, hashtags, mentions, embeds, and full metadata as JSON or CSV. Built for AI training datasets, social analytics, brand monitoring, and trend tracking.

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ NIJ KANANI

NIJ KANANI

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

2 months ago

Last modified

Categories

Share

๐Ÿฆ‹ Bluesky Scraper

Scrape posts from Bluesky (the AT Protocol social network) โ€” by keyword, hashtag, author handle, or any custom feed. No coding, no rate-limit headaches. Export to JSON, CSV, Excel, or push directly into your stack via API.

โšก Built for AI dataset builders, social media analysts, brand-monitoring teams, and trend hunters who need clean, structured Bluesky data at scale.


โœจ What you can do

  • ๐Ÿ”Ž Search posts by keyword or hashtag (#ai, bitcoin, climate change)
  • ๐Ÿ‘ค Pull any user's full feed by handle (jay.bsky.team)
  • ๐Ÿ“ก Subscribe to custom feeds by AT URI
  • ๐Ÿ“… Filter by date range and language
  • ๐Ÿ’ฌ Optionally include replies
  • ๐Ÿ“ค Get rich post metadata: text, likes, reposts, replies, embeds, hashtags, mentions, links

๐ŸŽฏ Use cases

WhoWhy
๐Ÿค– AI / LLM teamsBuild clean training datasets from a fast-growing social network
๐Ÿ“Š Social analyticsTrack hashtags, brand mentions, sentiment over time
๐Ÿ•ต๏ธ Competitor monitoringWatch what competitors post, what gets engagement
๐Ÿ“ฐ Journalists & researchersArchive public discourse around news events
๐Ÿ“ˆ Trend huntersFind rising topics & influencers before they hit mainstream

๐Ÿš€ Quick start

  1. Click Try for free
  2. Choose a mode: search, author, or feed
  3. Enter your search terms / handles / feed URIs
  4. (Optional but recommended) Add your Bluesky handle + an App Password for higher rate limits and full reliability
  5. Click Start

That's it. Your data appears in the Dataset tab in seconds.


๐Ÿ“ฅ Input

FieldTypeDescription
modeenumsearch / author / feed
searchTermsarrayKeywords or hashtags (search mode)
authorsarrayBluesky handles (author mode)
feedUrisarrayAT URIs (feed mode)
maxItemsintCap per target (default 1000)
since / untilISO dateDate range filter
languagestring2-letter code (e.g. en)
includeRepliesboolInclude replies in author mode
bskyHandlestringOptional โ€” your Bluesky handle
bskyAppPasswordsecretOptional โ€” your App Password

Example input

{
"mode":"search",
"searchTerms":["#ai","llm"],
"maxItems":5000,
"since":"2026-04-01",
"language":"en",
"bskyHandle":"yourname.bsky.social",
"bskyAppPassword":"xxxx-xxxx-xxxx-xxxx"
}

๐Ÿ“ค Output (per post)

{
"uri":"at://did:plc:.../app.bsky.feed.post/...",
"cid":"bafyrei...",
"author":{
"did":"did:plc:...",
"handle":"username.bsky.social",
"displayName":"Display Name",
"avatar":"https://..."
},
"text":"Full post text",
"createdAt":"2026-04-15T12:34:56.000Z",
"indexedAt":"2026-04-15T12:34:57.000Z",
"langs":["en"],
"likeCount":42,
"repostCount":7,
"replyCount":3,
"quoteCount":1,
"embed":{/* images, video, quoted posts */},
"tags":["ai"],
"mentions":["did:plc:..."],
"links":["https://..."],
"isReply":false,
"replyParent":null,
"replyRoot":null,
"bskyUrl":"https://bsky.app/profile/username.bsky.social/post/abc"
}

๐Ÿ’ก Why authentication?

Bluesky's public API rate-limits anonymous requests aggressively from datacenter IPs. Adding your own free Bluesky App Password (NOT your main password โ€” App Passwords are revocable single-purpose tokens) lifts limits and gives reliable, full-speed scraping. Your credentials are never stored โ€” they're passed only to Bluesky's official servers per run.


โ“ FAQ

Is this legal? Bluesky's public API is open by design โ€” the AT Protocol is built around portable, public data. This Actor uses official endpoints only.

Will my account get banned? No. App Passwords are intended for read access. Treat normal rate limits and you'll be fine.

Can I run this on a schedule? Yes โ€” use Apify's Schedule feature to run hourly/daily.

How fast is it? Authenticated runs typically pull ~3,000 posts/minute.


Got questions or feature requests? Open an issue or message us.

You might also like

Scrape Bluesky Posts: Search, Author Feed and Threads

danielainsworth/bluesky-posts

Extract posts from Bluesky by keyword search, author feed, or full thread. Engagement metrics, images, embeds. Uses official AT Protocol API.

๐Ÿ‘ User avatar

Daniel Ainsworth

2

BlueSky Author Feed Scraper

sandaliaapps/bluesky-author-feed-scraper

Extract posts, engagement metrics, author details, embeds, and metadata from Bluesky author feeds in structured JSON format.

2

Bluesky Scraper โ€” Posts, Profiles & Search

cryptosignals/bluesky-scraper

Scrape Bluesky profiles and posts โ€” extract handle, bio, followers, following, post text, likes, reposts, and timestamps. CSV/JSON output. No API key.

27

Bluesky Scraper

kelvinosse/bluesky-scraper

Scrape profiles, posts, followers, and threads from Bluesky AT Protocol API.

BlueSky Feed Scraper

harvest/bluesky-feed-scraper

Scrapes data from a specified BlueSky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and statistics such as likes, replies, and reposts.

40

3.9