VOOZH about

URL: https://apify.com/parseforge/cdnjs-libraries-scraper

โ‡ฑ cdnjs Libraries Catalog Scraper ยท Apify


Pricing

from $27.00 / 1,000 results

Go to Apify Store

cdnjs Libraries Catalog Scraper

Scrape the cdnjs library catalog. Browse 4,000+ open-source front-end JavaScript and CSS libraries with versions, file lists, SRI hashes, and repository links. Modes: full catalogue, keyword search, single-library detail.

Pricing

from $27.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

25 days ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

โ˜๏ธ cdnjs Libraries Scraper

๐Ÿš€ Export the cdnjs front-end catalogue in seconds. Browse, search, or drill into any of 4,000+ open-source JavaScript and CSS libraries hosted on Cloudflare's free CDN, with versions, file lists, SRI hashes, and repository links in one run. No sign-up, no token plumbing, no manual paging.

๐Ÿ•’ Last updated: 2026-05-15 ยท ๐Ÿ“Š 13 fields per record ยท โ˜๏ธ 4,000+ libraries ยท ๐ŸŒ 200B+ requests/month ยท ๐Ÿ” Live per run

The cdnjs Libraries Scraper queries the official cdnjs catalogue and returns 13 fields per library, including name, description, latest version, homepage, repository, license, author, keywords, versions, sample file paths, and SRI hashes. cdnjs is a free, fast, and reliable open-source CDN operated by Cloudflare, serving over 200 billion requests per month for front-end developers worldwide.

The catalogue spans 4,000+ libraries including React, Vue, jQuery, Bootstrap, Tailwind, Three.js, D3, Lodash, and thousands more. This Actor streams matching records as CSV, Excel, JSON, or XML in under five minutes. Three run modes (catalogue, search, library) let you sweep the whole catalogue, filter by keyword, or pull a single library with full file listing and per-version SRI hashes.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Front-end developers, security engineers, build engineers, OSS researchers, technical writers, DevOps teams, marketing analystsAsset inventories, SRI hash collection, version tracking, dependency discovery, license audits, library popularity research

๐Ÿ“‹ What the cdnjs Libraries Scraper does

Three run modes in a single Actor:

  • ๐ŸŒ Catalogue mode. Sweep the entire 4,000+ library catalogue with the most useful metadata fields.
  • ๐Ÿ” Search mode. Free-text keyword filter (e.g. react, chart, tailwind) across the catalogue.
  • ๐Ÿ“ฆ Library mode. Pull a single library with full version history, file list, and per-version SRI hashes.

Each catalogue record includes the library name, description, latest version, homepage, repository, license, author, and keywords. Library mode adds the complete versions array, a sample of hosted files for the latest version, and SRI hashes ready to drop into a <script integrity="..."> attribute.

๐Ÿ’ก Why it matters: every front-end stack pulls libraries from cdnjs or a similar CDN. Knowing exactly which versions, files, and hashes are available is the difference between a fast, reproducible build and a broken Subresource Integrity check in production.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded cdnjs dataset.


โš™๏ธ Input

InputTypeDefaultBehavior
modestring"catalogue"One of catalogue, search, or library.
querystring""Keyword used in search mode. Ignored in other modes.
libraryNamestring""Exact library slug for library mode (e.g. jquery, bootstrap).
fieldsstring"name,description,version,homepage,repository,license,author,keywords"Comma-separated catalogue fields to request.
maxItemsinteger10Records to return. Free plan caps at 10, paid plan at 1,000,000.

Example: search for chart-related libraries.

{
"mode":"search",
"query":"chart",
"maxItems":25,
"fields":"name,description,version,homepage,repository,license,author,keywords"
}

Example: deep dive on jQuery with file list and SRI hashes.

{
"mode":"library",
"libraryName":"jquery"
}

โš ๏ธ Good to Know: library mode returns the full versions array (often 100+ entries for popular libraries) and a sample of hosted files for the latest version (up to 25 files). SRI hashes use SHA-512 and are ready to drop into a Subresource Integrity attribute. Library slugs are case-sensitive (jquery, not jQuery).


๐Ÿ“Š Output

Each cdnjs library record contains up to 13 fields. Download the dataset as CSV, Excel, JSON, or XML.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ“ฆ namestring"jquery"
๐Ÿ”— urlstring"https://cdnjs.com/libraries/jquery"
๐Ÿ“ descriptionstring"JavaScript library for DOM operations"
๐Ÿท๏ธ latestVersionstring"3.7.1"
๐Ÿ”— homepagestring | null"https://jquery.com/"
๐Ÿ”— repositoryobject | null{"type":"git","url":"git+https://github.com/jquery/jquery.git"}
๐Ÿ“œ licensestring | null"MIT"
๐Ÿ‘ค authorstring | null"JS Foundation and other contributors"
๐Ÿท๏ธ keywordsstring[]["jquery","javascript","library"]
๐Ÿ“ฆ versionsstring[]["1.2.3","...","3.7.1"]
๐Ÿ“ฆ fileSamplesstring[]["jquery.min.js","jquery.js","jquery.slim.min.js"]
๐Ÿ” sriobject | string{"jquery.min.js":"sha512-..."}
๐Ÿ•’ scrapedAtISO 8601"2026-05-15T00:00:00.000Z"

๐Ÿ“ฆ Sample records


โœจ Why choose this Actor

Capability
โ˜๏ธCatalogue-wide reach. 4,000+ libraries served by cdnjs, one of the world's largest open-source CDNs.
๐ŸŽฏThree run modes. Catalogue sweep, keyword search, or single-library deep dive in one Actor.
๐Ÿ”SRI hashes included. Drop-in Subresource Integrity attributes for every file in library mode.
โšกFast. 10 libraries in under 5 seconds, full catalogue sweep in 1-2 minutes.
๐Ÿ”Always fresh. Every run hits the live catalogue, so versions and file lists reflect the current CDN state.
๐ŸšซNo authentication. Public catalogue access. No login, no token.
๐Ÿ“ฅMulti-format export. CSV, Excel, JSON, or XML, ready for spreadsheets, warehouses, or pipelines.

๐Ÿ“Š Up-to-date CDN metadata is the foundation of every static-site build, security policy, and front-end performance audit on the modern web.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ cdnjs Libraries Scraper (this Actor)$5 free credit, then pay-per-use4,000+ librariesLive per runcatalogue, search, libraryโšก 2 min
Hand-rolled catalogue clientFree + dev timeFullPer requestCustom code๐Ÿข Days
Package registry exportsFreePer registryDailyRegistry-specific๐Ÿ•’ Variable
Manual catalogue browsingFreeOne library at a timeLiveUI only๐Ÿ•’ Tedious

Pick this Actor when you want a single dataset of CDN-hosted libraries with SRI hashes and version history, ready for analytics or build automation.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the cdnjs Libraries Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set input. Pick a run mode, enter a search query or library slug, and set maxItems.
  4. ๐Ÿš€ Run it. Click Start and let the Actor pull your matching libraries.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐Ÿ” Security & Compliance

  • Build SRI hash inventories for CSP and SRI policies
  • Track library version drift across deployed sites
  • License audits across thousands of front-end packages
  • Detect outdated libraries before disclosure windows

โšก Performance & Build Engineering

  • Cache-warm strategies for CDN-hosted assets
  • Generate static manifests for build pipelines
  • Compare hosted file sizes across library versions
  • Pin specific versions for reproducible builds

๐Ÿ“Š Developer Intelligence

  • Trending front-end library tracking
  • Topic clustering by keywords and tags
  • Adoption benchmarks for competing libraries
  • Maintenance signals from version cadence

๐Ÿ“ Technical Content & Education

  • Tutorial generators with canonical library URLs
  • Documentation site sidebars with current versions
  • Tooling inventories for boot-camp curricula
  • Newsletter feeds of newly published libraries

๐Ÿ”Œ Automating cdnjs Libraries Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

  • ๐ŸŸข Node.js. Install the apify-client NPM package.
  • ๐Ÿ Python. Use the apify-client PyPI package.
  • ๐Ÿ“š See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly, daily, or weekly refreshes keep build manifests and security dashboards in sync.


๐ŸŒŸ Beyond business use cases

The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Front-end ecosystem studies with longitudinal version data
  • Library-adoption research for software-engineering papers
  • Coursework on CDN architecture and SRI
  • Reproducible studies with cited dataset pulls

๐ŸŽจ Personal and creative

  • Indie web-dev dashboards and portfolio demos
  • Personal cheat-sheets for favourite libraries
  • Visualisations of the front-end landscape
  • Side projects exploring keyword-driven library discovery

๐Ÿค Non-profit and civic

  • Curated library lists for community workshops
  • Inclusive tooling guides for under-resourced classrooms
  • Transparency on third-party assets in civic websites
  • Sponsorship targeting based on download impact

๐Ÿงช Experimentation

  • Train classifiers that recommend libraries from prompts
  • Validate developer-tool product ideas with adoption data
  • Prototype agent pipelines that scaffold projects from a theme
  • Test library-search UX with live catalogue data

๐Ÿค– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


โ“ Frequently Asked Questions

๐Ÿงฉ How does it work?

Pick a run mode, enter a query or library slug, and click Start. The Actor pulls the matching records from the official catalogue and returns one structured record per library. No browser automation, no captchas, no setup.

๐Ÿ“ How accurate is the data?

Every field reflects what the official catalogue serves at run time. Version lists, files, and SRI hashes are pulled directly from the CDN manifest, so they match exactly what your <script> tag would load.

๐Ÿ” How often is the catalogue refreshed?

The catalogue is updated whenever new versions are published by upstream maintainers. Every run of this Actor pulls the latest state.

๐Ÿ” What are SRI hashes and why do I need them?

Subresource Integrity hashes let your <script> and <link> tags verify the file hasn't been tampered with. Library mode returns SHA-512 hashes for each file, ready to drop into the integrity attribute.

๐Ÿ“‚ Can I get the file list for any library?

Yes. Run the Actor in library mode with the exact library slug (e.g. jquery) to get up to 25 sample file paths for the latest version, plus the SRI hash for each.

โฐ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval (hourly, daily, weekly) and keep a downstream database in sync.

โš–๏ธ Is this data legal to use?

cdnjs publishes library metadata under an open licence. Always honour the underlying library licences when redistributing the actual files (the Actor returns metadata only).

๐Ÿ’ผ Can I use this data commercially?

Yes. Catalogue metadata is publicly accessible for analytics and discovery use. Each individual library is governed by its own licence, which you should review before bundling.

๐Ÿ’ณ Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you scheduling, higher concurrency, and larger datasets.

๐Ÿ” What happens if a run fails or gets interrupted?

Apify automatically retries transient errors. If a run still fails, you can inspect the log in the Runs tab, fix the input, and re-run. Partial datasets from failed runs are preserved so you never lose progress.

๐Ÿ†˜ What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.


๐Ÿ”Œ Integrate with any app

cdnjs Libraries Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe catalogue data into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh library data into your build pipeline, or alert your team in Slack on new versions of watched libraries.


๐Ÿ”— Recommended Actors

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


โš ๏ธ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by cdnjs, Cloudflare, or any of the catalogue's library maintainers. All trademarks mentioned are the property of their respective owners. Only publicly available catalogue metadata is collected.

You might also like

Javascript Library Detail Scraper

cykieffodh/javascript-library-detail-scraper

Javascript Library Detail Scraper

๐Ÿ‘ User avatar

Michael Laflin

2

Open Library Book Intelligence

benthepythondev/book-intelligence

Extract book metadata from Open Library's catalog of 20+ million books. Search by title, author, subject, or ISBN. Get cover images, ratings, edition counts, and publication data. Perfect for publishers, bookstores, libraries, app developers, and researchers.

Open Library Scraper

crawlerbros/openlibrary-scraper

Scrape Open Library, Internet Archive's open catalog of 50M+ books. Search by title/author/subject, fetch by ISBN or work ID, get full bibliographic metadata, cover images, ratings, and edition counts.

Facebook Ad Library Scraper (Keyword Search)

scraped/facebook-ad-library-scraper-keyword-search

Scrape data from Facebook Ad Library based on your keyword.

Open Library Scraper

parseforge/open-library-scraper

Comprehensive scraper for Open Library to extract books, authors, subjects, and list data from the Internet Archiveโ€™s platform. Supports multiple search types and ebook filtering, providing automated, structured access to Open Libraryโ€™s extensive bibliographic collection.

12

5.0

Facebook Ads Library Scraper v2

prodiger/facebook-ads-library-scraper-v2

Search Meta's Ad Library by keyword, page handle, page URL, or full Ad Library URL. Pay-per-result.

Open Library Scraper

viralanalyzer/open-library-scraper

Search and extract book data from Open Library: titles, authors, editions, subjects, and availability. Literary research at scale.

2

4.7

Related articles

Single JavaScript file Actors are being deprecated
Read more