👁 cdnjs Libraries Catalog Scraper avatar

cdnjs Libraries Catalog Scraper

Pricing

from $27.00 / 1,000 results

cdnjs Libraries Catalog Scraper

Scrape the cdnjs library catalog. Browse 4,000+ open-source front-end JavaScript and CSS libraries with versions, file lists, SRI hashes, and repository links. Modes: full catalogue, keyword search, single-library detail.

Pricing

from $27.00 / 1,000 results

Rating

0.0

(0)

Developer

👁 ParseForge

ParseForge

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

25 days ago

Last modified

☁️ cdnjs Libraries Scraper

🚀 Export the cdnjs front-end catalogue in seconds. Browse, search, or drill into any of 4,000+ open-source JavaScript and CSS libraries hosted on Cloudflare's free CDN, with versions, file lists, SRI hashes, and repository links in one run. No sign-up, no token plumbing, no manual paging.

🕒 Last updated: 2026-05-15 · 📊 13 fields per record · ☁️ 4,000+ libraries · 🌐 200B+ requests/month · 🔁 Live per run

The cdnjs Libraries Scraper queries the official cdnjs catalogue and returns 13 fields per library, including name, description, latest version, homepage, repository, license, author, keywords, versions, sample file paths, and SRI hashes. cdnjs is a free, fast, and reliable open-source CDN operated by Cloudflare, serving over 200 billion requests per month for front-end developers worldwide.

The catalogue spans 4,000+ libraries including React, Vue, jQuery, Bootstrap, Tailwind, Three.js, D3, Lodash, and thousands more. This Actor streams matching records as CSV, Excel, JSON, or XML in under five minutes. Three run modes (catalogue, search, library) let you sweep the whole catalogue, filter by keyword, or pull a single library with full file listing and per-version SRI hashes.

🎯 Target Audience	💡 Primary Use Cases
Front-end developers, security engineers, build engineers, OSS researchers, technical writers, DevOps teams, marketing analysts	Asset inventories, SRI hash collection, version tracking, dependency discovery, license audits, library popularity research

📋 What the cdnjs Libraries Scraper does

Three run modes in a single Actor:

🌐 Catalogue mode. Sweep the entire 4,000+ library catalogue with the most useful metadata fields.
🔍 Search mode. Free-text keyword filter (e.g. react, chart, tailwind) across the catalogue.
📦 Library mode. Pull a single library with full version history, file list, and per-version SRI hashes.

Each catalogue record includes the library name, description, latest version, homepage, repository, license, author, and keywords. Library mode adds the complete versions array, a sample of hosted files for the latest version, and SRI hashes ready to drop into a <script integrity="..."> attribute.

💡 Why it matters: every front-end stack pulls libraries from cdnjs or a similar CDN. Knowing exactly which versions, files, and hashes are available is the difference between a fast, reproducible build and a broken Subresource Integrity check in production.

🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded cdnjs dataset.

⚙️ Input

Input	Type	Default	Behavior
mode	string	"catalogue"	One of catalogue, search, or library.
query	string	""	Keyword used in search mode. Ignored in other modes.
libraryName	string	""	Exact library slug for library mode (e.g. jquery, bootstrap).
fields	string	"name,description,version,homepage,repository,license,author,keywords"	Comma-separated catalogue fields to request.
maxItems	integer	10	Records to return. Free plan caps at 10, paid plan at 1,000,000.

Example: search for chart-related libraries.

{
"mode":"search",
"query":"chart",
"maxItems":25,
"fields":"name,description,version,homepage,repository,license,author,keywords"
}

Example: deep dive on jQuery with file list and SRI hashes.

{
"mode":"library",
"libraryName":"jquery"
}

⚠️ Good to Know: library mode returns the full versions array (often 100+ entries for popular libraries) and a sample of hosted files for the latest version (up to 25 files). SRI hashes use SHA-512 and are ready to drop into a Subresource Integrity attribute. Library slugs are case-sensitive (jquery, not jQuery).

📊 Output

Each cdnjs library record contains up to 13 fields. Download the dataset as CSV, Excel, JSON, or XML.

🧾 Schema

Field	Type	Example
📦 `name`	string	`"jquery"`
🔗 `url`	string	`"https://cdnjs.com/libraries/jquery"`
📝 `description`	string	`"JavaScript library for DOM operations"`
🏷️ `latestVersion`	string	`"3.7.1"`
🔗 `homepage`	string \| null	`"https://jquery.com/"`
🔗 `repository`	object \| null	`{"type":"git","url":"git+https://github.com/jquery/jquery.git"}`
📜 `license`	string \| null	`"MIT"`
👤 `author`	string \| null	`"JS Foundation and other contributors"`
🏷️ `keywords`	string[]	`["jquery","javascript","library"]`
📦 `versions`	string[]	`["1.2.3","...","3.7.1"]`
📦 `fileSamples`	string[]	`["jquery.min.js","jquery.js","jquery.slim.min.js"]`
🔐 `sri`	object \| string	`{"jquery.min.js":"sha512-..."}`
🕒 `scrapedAt`	ISO 8601	`"2026-05-15T00:00:00.000Z"`

📦 Sample records

✨ Why choose this Actor

	Capability
☁️	Catalogue-wide reach. 4,000+ libraries served by cdnjs, one of the world's largest open-source CDNs.
🎯	Three run modes. Catalogue sweep, keyword search, or single-library deep dive in one Actor.
🔐	SRI hashes included. Drop-in Subresource Integrity attributes for every file in library mode.
⚡	Fast. 10 libraries in under 5 seconds, full catalogue sweep in 1-2 minutes.
🔁	Always fresh. Every run hits the live catalogue, so versions and file lists reflect the current CDN state.
🚫	No authentication. Public catalogue access. No login, no token.
📥	Multi-format export. CSV, Excel, JSON, or XML, ready for spreadsheets, warehouses, or pipelines.

📊 Up-to-date CDN metadata is the foundation of every static-site build, security policy, and front-end performance audit on the modern web.

📈 How it compares to alternatives

Approach	Cost	Coverage	Refresh	Filters	Setup
⭐ cdnjs Libraries Scraper (this Actor)	$5 free credit, then pay-per-use	4,000+ libraries	Live per run	catalogue, search, library	⚡ 2 min
Hand-rolled catalogue client	Free + dev time	Full	Per request	Custom code	🐢 Days
Package registry exports	Free	Per registry	Daily	Registry-specific	🕒 Variable
Manual catalogue browsing	Free	One library at a time	Live	UI only	🕒 Tedious

Pick this Actor when you want a single dataset of CDN-hosted libraries with SRI hashes and version history, ready for analytics or build automation.

🚀 How to use

📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
🌐 Open the Actor. Go to the cdnjs Libraries Scraper page on the Apify Store.
🎯 Set input. Pick a run mode, enter a search query or library slug, and set maxItems.
🚀 Run it. Click Start and let the Actor pull your matching libraries.
📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.

💼 Business use cases

🔐 Security & Compliance

Build SRI hash inventories for CSP and SRI policies
Track library version drift across deployed sites
License audits across thousands of front-end packages
Detect outdated libraries before disclosure windows

⚡ Performance & Build Engineering

Cache-warm strategies for CDN-hosted assets
Generate static manifests for build pipelines
Compare hosted file sizes across library versions
Pin specific versions for reproducible builds

📊 Developer Intelligence

Trending front-end library tracking
Topic clustering by keywords and tags
Adoption benchmarks for competing libraries
Maintenance signals from version cadence

📝 Technical Content & Education

Tutorial generators with canonical library URLs
Documentation site sidebars with current versions
Tooling inventories for boot-camp curricula
Newsletter feeds of newly published libraries

🔌 Automating cdnjs Libraries Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

🟢 Node.js. Install the apify-client NPM package.
🐍 Python. Use the apify-client PyPI package.
📚 See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly, daily, or weekly refreshes keep build manifests and security dashboards in sync.

🌟 Beyond business use cases

The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

Front-end ecosystem studies with longitudinal version data
Library-adoption research for software-engineering papers
Coursework on CDN architecture and SRI
Reproducible studies with cited dataset pulls

🎨 Personal and creative

Indie web-dev dashboards and portfolio demos
Personal cheat-sheets for favourite libraries
Visualisations of the front-end landscape
Side projects exploring keyword-driven library discovery

🤝 Non-profit and civic

Curated library lists for community workshops
Inclusive tooling guides for under-resourced classrooms
Transparency on third-party assets in civic websites
Sponsorship targeting based on download impact

🧪 Experimentation

Train classifiers that recommend libraries from prompts
Validate developer-tool product ideas with adoption data
Prototype agent pipelines that scaffold projects from a theme
Test library-search UX with live catalogue data

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

🧩 How does it work?

Pick a run mode, enter a query or library slug, and click Start. The Actor pulls the matching records from the official catalogue and returns one structured record per library. No browser automation, no captchas, no setup.

📏 How accurate is the data?

Every field reflects what the official catalogue serves at run time. Version lists, files, and SRI hashes are pulled directly from the CDN manifest, so they match exactly what your <script> tag would load.

🔁 How often is the catalogue refreshed?

The catalogue is updated whenever new versions are published by upstream maintainers. Every run of this Actor pulls the latest state.

🔐 What are SRI hashes and why do I need them?

Subresource Integrity hashes let your <script> and <link> tags verify the file hasn't been tampered with. Library mode returns SHA-512 hashes for each file, ready to drop into the integrity attribute.

📂 Can I get the file list for any library?

Yes. Run the Actor in library mode with the exact library slug (e.g. jquery) to get up to 25 sample file paths for the latest version, plus the SRI hash for each.

⏰ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval (hourly, daily, weekly) and keep a downstream database in sync.

⚖️ Is this data legal to use?

cdnjs publishes library metadata under an open licence. Always honour the underlying library licences when redistributing the actual files (the Actor returns metadata only).

💼 Can I use this data commercially?

Yes. Catalogue metadata is publicly accessible for analytics and discovery use. Each individual library is governed by its own licence, which you should review before bundling.

💳 Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you scheduling, higher concurrency, and larger datasets.

🔁 What happens if a run fails or gets interrupted?

Apify automatically retries transient errors. If a run still fails, you can inspect the log in the Runs tab, fix the input, and re-run. Partial datasets from failed runs are preserved so you never lose progress.

🆘 What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.

🔌 Integrate with any app

cdnjs Libraries Scraper connects to any cloud service via Apify integrations:

Make - Automate multi-step workflows
Zapier - Connect with 5,000+ apps
Slack - Get run notifications in your channels
Airbyte - Pipe catalogue data into your warehouse
GitHub - Trigger runs from commits and releases
Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh library data into your build pipeline, or alert your team in Slack on new versions of watched libraries.

🔗 Recommended Actors

📦 NuGet Packages Scraper - .NET package catalogue and download history
🦀 Crates.io Scraper - Rust crate metadata and downloads
📦 Hex.pm Elixir Packages Scraper - Elixir/Erlang package catalogue
🎮 Modrinth Mods Scraper - Minecraft mods, modpacks, shaders, resource packs
🐳 Docker Hub Scraper - Container image metadata and pulls

💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.

🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.

⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by cdnjs, Cloudflare, or any of the catalogue's library maintainers. All trademarks mentioned are the property of their respective owners. Only publicly available catalogue metadata is collected.

👁 Javascript Library Detail Scraper avatar

Javascript Library Detail Scraper

cykieffodh/javascript-library-detail-scraper

Javascript Library Detail Scraper

👁 User avatar

Michael Laflin

👁 Open Library Book Intelligence avatar

Open Library Book Intelligence

benthepythondev/book-intelligence

Extract book metadata from Open Library's catalog of 20+ million books. Search by title, author, subject, or ISBN. Get cover images, ratings, edition counts, and publication data. Perfect for publishers, bookstores, libraries, app developers, and researchers.

👁 User avatar

ben

Open Library Books Scraper

klondikeking/open-library-books-scraper

👁 User avatar

Pierrick McD0nald

👁 Open Library Scraper avatar

Open Library Scraper

crawlerbros/openlibrary-scraper

Scrape Open Library, Internet Archive's open catalog of 50M+ books. Search by title/author/subject, fetch by ISBN or work ID, get full bibliographic metadata, cover images, ratings, and edition counts.

👁 User avatar

Crawler Bros

Product Catalog API

vivid_astronaut/product-catalog

👁 User avatar

Fabio Suizu

Open Library Scraper

fortuitous_pirate/open-library-scraper

Scrape books and publications from Open Library (openlibrary.org). Search by title, author, subject, or language.

👁 User avatar

Fortuitous Pirate

👁 Facebook Ad Library Scraper (Keyword Search) avatar

Facebook Ad Library Scraper (Keyword Search)

scraped/facebook-ad-library-scraper-keyword-search

Scrape data from Facebook Ad Library based on your keyword.

👁 User avatar

scraped

👁 Open Library Scraper avatar

Open Library Scraper

parseforge/open-library-scraper

Comprehensive scraper for Open Library to extract books, authors, subjects, and list data from the Internet Archive’s platform. Supports multiple search types and ebook filtering, providing automated, structured access to Open Library’s extensive bibliographic collection.

👁 User avatar

ParseForge

5.0

👁 Facebook Ads Library Scraper v2 avatar

Facebook Ads Library Scraper v2

prodiger/facebook-ads-library-scraper-v2

Search Meta's Ad Library by keyword, page handle, page URL, or full Ad Library URL. Pay-per-result.

👁 User avatar

Arnas

👁 Open Library Scraper avatar

Open Library Scraper

viralanalyzer/open-library-scraper

Search and extract book data from Open Library: titles, authors, editions, subjects, and availability. Literary research at scale.

👁 User avatar

viralanalyzer

4.7

👁 Blog article image

Single JavaScript file Actors are being deprecated

URL: https://apify.com/parseforge/cdnjs-libraries-scraper