VOOZH about

URL: https://apify.com/zhenyu_towne/lightburn-forum-scrapper

โ‡ฑ LightBurn Forum Scrapper ยท Apify


Pricing

from $0.05 / 1,000 results

Go to Apify Store

LightBurn Forum Scrapper

LightBurn Forum Crawler extracts LightBurn forum topics, posts, and replies into clean, flat CSV/JSON records for semantic analysis, with one row per post or comment including type, original IDs, author, cleaned text, URLs, timestamps, likes, source, and matched keyword when applicable.

Pricing

from $0.05 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Zhenyu Towne

Zhenyu Towne

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

LightBurn Forum Semantic Crawler

Extract clean, semantic-analysis-ready posts and comments from the LightBurn Software forum.

This Actor crawls LightBurn forum topics through the public Discourse JSON API and exports one flat dataset row per original post or reply. The output is designed for NLP, LLM, embedding, semantic search, clustering, topic modeling, support trend analysis, and spreadsheet workflows.

What It Does

  • Crawls the latest LightBurn forum topics or searches by keyword.
  • Fetches topic posts and replies.
  • Converts forum HTML into clean plain text.
  • Exports one row per post or comment.
  • Preserves original Discourse post and topic IDs.
  • Includes author, URL, timestamp, likes, source mode, and matched keyword.
  • Removes image data from the main text field so exports are easier to analyze.

Use Cases

  • Build a semantic search index from LightBurn forum discussions.
  • Analyze common user issues and support patterns.
  • Cluster posts by topic or intent.
  • Prepare forum text for embeddings or LLM classification.
  • Export clean CSV or JSON data for spreadsheets and BI tools.

Input Options

FieldDescription
baseUrlForum base URL. Defaults to https://forum.lightburnsoftware.com.
keywordsOptional keyword or comma-separated keywords. If empty, the Actor crawls latest topics.
startDateOptional start date filter, for example 2026-01-01.
endDateOptional end date filter, for example 2026-01-31.
timeFieldDate field used for filtering: created_at, last_posted_at, or bumped_at.
maxTopicsMaximum number of topics to process.
maxPagesMaximum number of listing or search pages to scan.
includeRepliesSet to false to export only the original topic post.
maxPostsPerTopicMaximum number of posts/comments exported from each topic.
categoryIdsOptional list of Discourse category IDs to include.
requestDelayMillisDelay between forum API requests.

Output

Each dataset item is a single flat record.

FieldDescription
recordTypepost for the first post in a topic, comment for replies.
originalPostIdOriginal Discourse post ID.
originalTopicIdOriginal Discourse topic ID.
topicTitleForum topic title.
topicUrlURL of the forum topic.
postUrlDirect URL to the post or comment.
postNumberPost number within the topic.
replyToPostNumberReferenced post number when the comment is a reply.
authorUsernameForum username.
authorNameDisplay name when available.
originalTextCleaned plain text extracted from the post body.
createdAtPost creation timestamp.
updatedAtPost update timestamp.
likeCountNumber of likes on the post.
sourcelatest or search.
matchedKeywordKeyword that matched the topic in search mode.
crawledAtTimestamp when the row was exported.

Example Output

{
"recordType":"comment",
"originalPostId":605754,
"originalTopicId":190079,
"topicTitle":"Downloaded 2.1.01 and my laser will not come to full power",
"topicUrl":"https://forum.lightburnsoftware.com/t/example-topic/190079",
"postUrl":"https://forum.lightburnsoftware.com/t/example-topic/190079/2",
"postNumber":2,
"replyToPostNumber":null,
"authorUsername":"MikeyH",
"authorName":"Mike Hembrey",
"originalText":"Check your Units settings. The upgrade might have flipped the switch.",
"createdAt":"2026-05-25T22:02:31.813Z",
"updatedAt":"2026-05-25T22:02:31.813Z",
"likeCount":0,
"source":"latest",
"matchedKeyword":null,
"crawledAt":"2026-05-26T03:07:28.862Z"
}

Notes

This Actor is built for structured text extraction. It does not download images or include image URLs in the main semantic text field. The resulting dataset is intentionally flat so CSV and JSON exports remain easy to analyze.

You might also like

SlickDeals Forum Threads Scraper (Original Merchant Links)

scralab/slickdeals-forum-threads-scraper

Scrape deals from **any Slickdeals forum thread list page**. Simply provide a forum URL โ€” Hot Deals, filtered views, category pages, or any custom forum query โ€” and get structured deal data with **direct links to the original merchant**.

Discourse Forum Scraper

automation-lab/discourse-scraper

Extract topics, posts, and discussions from any public Discourse forum. Supports latest topics, category filtering, and keyword search. No login required.

๐Ÿ‘ User avatar

Stas Persiianenko

22

RedFlagDeals Forum Threads Scraper (Original Merchant Links)

scralab/redflagdeals-forum-threads-scraper

Scrape deals from any RedFlagDeals (Canada) forum thread list page. Provide any forum URL (e.g., Hot Deals, filtered views, category pages) and get structured deal data with direct merchant links.

Discourse Forum Topics Scraper

parseforge/discourse-forum-topics-scraper

Gather social activity from Discourse Forum Topics with profile name, follower count, posts, replies and timestamps. Loved by community managers, brand watchers and trend researchers. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

Fragrantica Forum Comment Poster

scrapyspider/fragrantica-forum-comment-poster

Post comments on Fragrantica forum threads using your account. Automatically embeds perfume and brand links, bypasses anti-bot protection, and returns structured results.

Discord Forum To Website

code-node-tools/discord-forum-to-website

Transform your Discord forum channels into beautiful, SEO-optimized static websites with customizable templates and one-click deployment to Vercel, Netlify, or GitHub Pages.

1

HardwareZone.com forum Scraper (No Cookies Needed)

expected_knight/hardwarezone-com-forum-scraper-no-cookies-needed

This Scraper is designed to quickly and accurately gather posts from the Hardwarezone.com forum, capturing comprehensive details like post content, publication date, url etc...

OpenClaw Forum Scraper

jungle_synthesizer/openclaw-forum-scraper

Scrape trending posts, top AI agents, and community rankings from OpenClaw Forum (openclawforum.org). Extracts engagement metrics, karma scores, member counts, and activity levels from the Moltbook ecosystem.

๐Ÿ‘ User avatar

BowTiedRaccoon

3

Tweakers Forum Scraper

peppy_hydrocarbon/tweakers-forum-scraper

Scrapes Tweakers.net forum threads into structured JSON. Each post includes poster info, timestamp, page number, and content converted to Markdown. Configure with a thread ID from the URL and an optional date filter for incremental runs.