VOOZH about

URL: https://apify.com/jungle_synthesizer/clawhub-skill-content-scraper

โ‡ฑ ClawHub Skill Scraper - AI Skills & SKILL.md Content ยท Apify


Pricing

Pay per event

Go to Apify Store

ClawHub Skill Scraper

Scrape AI agent skills from the ClawHub marketplace. Extracts SKILL.md definitions, metadata, stats, and changelogs for 8,000+ OpenClaw skills.

Pricing

Pay per event

Rating

0.0

(0)

Developer

๐Ÿ‘ BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

14

Total users

1

Monthly active users

a month ago

Last modified

Share

Scrape AI agent skill data from the ClawHub marketplace. Extract skill definitions (SKILL.md), metadata, download stats, install counts, star ratings, changelogs, and author information for 8,000+ skills in the OpenClaw ecosystem.


ClawHub Skill Scraper Features

  • Extracts full SKILL.md content from the OpenClaw GitHub archive
  • Collects marketplace stats: downloads, installs, stars, comments, version count
  • Two modes: metadata-only (fast) or full content with SKILL.md files
  • Handles 8,000+ skills with automatic pagination and rate limiting
  • Merges data from the ClawHub listing API and GitHub raw content
  • Outputs structured JSON with 20+ fields per skill record

Who Uses ClawHub Skill Data?

  • AI tool directories โ€” Index and categorize AI agent skills for discovery platforms
  • Developer analytics โ€” Track skill adoption trends, download velocity, and popularity across the OpenClaw ecosystem
  • Competitive research โ€” Monitor new skill releases, author activity, and marketplace growth
  • Dataset builders โ€” Collect SKILL.md definitions to train or evaluate AI agents on real-world skill specifications
  • Market research โ€” Analyze the AI agent skill marketplace for investment or product positioning

How the ClawHub Scraper Works

  1. List skills โ€” Paginates through the ClawHub listing API to collect skill metadata and marketplace stats
  2. Resolve authors โ€” Fetches owner handles from individual skill detail endpoints
  3. Download SKILL.md โ€” Pulls full skill definitions and package metadata from the OpenClaw GitHub archive

Steps 2 and 3 only run when downloadContent is enabled. With it disabled, the scraper returns metadata only and runs significantly faster.


Input

{
"maxItems":50,
"downloadContent":true
}
FieldTypeDefaultDescription
maxItemsinteger50Maximum number of skills to scrape. Set to 0 for unlimited.
downloadContentbooleantrueFetch SKILL.md content from GitHub. When disabled, only ClawHub API metadata is collected (faster).
proxyConfigurationobjectnoneProxy settings. Not required โ€” the scraper works without proxies.

ClawHub Scraper Output Fields

{
"slug":"web-search",
"displayName":"Web Search",
"version":"1.2.0",
"summary":"Search the web and return structured results",
"author":"openclaw",
"tags":{"latest":"1.2.0","stable":"1.1.5"},
"downloads":12450,
"installsAllTime":8320,
"installsCurrent":1540,
"stars":89,
"comments":12,
"versions":8,
"changelog":"Added timeout configuration and retry logic",
"skillMd":"# Web Search\n\nSearch the web using...",
"supportingFiles":[],
"ownerId":"usr_abc123",
"createdAt":"2024-06-15T10:30:00Z",
"updatedAt":"2025-02-20T14:15:00Z",
"publishedAt":"2025-02-20T14:15:00Z"
}
FieldTypeDescription
slugstringUnique skill identifier (URL slug)
displayNamestringHuman-readable skill name
versionstringLatest published version
summarystringShort skill description
authorstringSkill author from skill.yaml metadata
tagsobjectTag-to-version mapping (e.g. latest, stable)
downloadsnumberTotal download count
installsAllTimenumberLifetime install count
installsCurrentnumberCurrent active installs
starsnumberStar/favorite count
commentsnumberComment count
versionsnumberTotal published version count
changelogstringLatest version changelog text
skillMdstringFull SKILL.md file content
supportingFilesarrayAdditional files in the skill package
ownerIdstringInternal owner identifier
createdAtstringSkill creation timestamp (ISO 8601)
updatedAtstringLast update timestamp (ISO 8601)
publishedAtstringLatest version publish timestamp (ISO 8601)

FAQ

How do I scrape all skills from ClawHub?

Set maxItems to 0 and downloadContent to true. The ClawHub Skill Scraper will paginate through the full marketplace and download SKILL.md content for every skill.

How much does the ClawHub Skill Scraper cost to run?

A full run with content download for all 8,000+ skills uses approximately 256 MB of memory and takes under an hour. Metadata-only mode is significantly faster and cheaper.

What is a SKILL.md file?

SKILL.md is the standard definition file for AI agent skills in the OpenClaw ecosystem. It describes what the skill does, its inputs, outputs, and configuration. The ClawHub Skill Scraper extracts these files directly from the OpenClaw GitHub archive.

Can I scrape ClawHub without downloading SKILL.md content?

Yes. Set downloadContent to false to collect only marketplace metadata (names, stats, tags) without fetching content from GitHub. This mode is much faster and skips the GitHub download phase entirely.


Need More Features?

Need custom fields, filters, or a scraper for a different AI marketplace? File an issue or get in touch.

Why Use the ClawHub Skill Scraper?

  • Complete โ€” Combines ClawHub marketplace data with full SKILL.md content from GitHub in a single run
  • Flexible โ€” Metadata-only mode for fast runs, full mode for complete skill definitions
  • Reliable โ€” Rate limiting, retry logic, and graceful error handling for 8,000+ skill records

You might also like

Agent Skills Scraper

parsebird/agent-skills-scraper

Extract deep metadata from skills.sh, the open agent skills directory. Scrape weekly installs, GitHub stars, security audits, agent adoption breakdown, SKILL.md content, and more from every skill listing.

Skill Curator Scraper

datapilot/skill-curator-scraper

MCP Skill Scraper collects AI skills from SkillsMP and GitHub. It extracts name, description, stars, license, and URLs, then calculates a quality score. Outputs structured JSON for discovering MCP tools, AI skills, and developer resources.

Agent Skills Generator

wheat_tourist/agent-skill-generator

Transforms any goal into production-ready AI agent skills. Generates validated, atomic skill definitions with execution graphs, failure recovery, and CrewAI/OpenAI exports. Supports optional web scraping, self-healing validation, semantic versioning, and cross-modal content workflows out of the box.

LinkedIn Learning Courses Scraper

crawlerbros/linkedin-learning-courses-scraper

Scrape LinkedIn Learning course catalog - search by keyword or skill category and return courses with title, instructor, duration, skill level, skills taught, and more. No login required.

Freelancer Scraper โ€” Jobs, Budgets, Skills, Bids

brilliant_gum/freelancer-jobs-scraper

Scrape Freelancer.com jobs, budgets, required skills, and bid statistics at scale. Filter by skill, budget, language, competition. USD-normalised pricing. Built-in analytics report: skill demand, budget distribution, top opportunities. Proxies included.

๐Ÿ‘ User avatar

Yuliia Kulakova

2

Hermes Skill Builder for Apify & APIs

solutionssmart/hermes-skill-builder-for-apify-apis

Generate Hermes-ready AI agent skill packages from Apify Actors, API docs, GitHub repositories, and OpenAPI specs.

๐Ÿ‘ User avatar

Solutions Smart

29

Upwork Rate & Market Intelligence โ€” Freelance Skill Rates

bovi/upwork-market-intelligence

Benchmark any freelance skill on Upwork: live demand (open jobs), hourly & fixed rate distribution (median, p25, p75), hourly-vs-fixed split, experience-tier mix, posting freshness and adjacent skills. One clean row per skill.

๐Ÿ‘ User avatar

Vitalii Bondarev

2

Related articles

Introducing Apify Agent Skills
Read more
How to deploy OpenClaw and give it Apify skills
Read more
OpenClaw web search: How to extract structured data at scale
Read more