VOOZH about

URL: https://apify.com/scrapesage/npm-scraper

โ‡ฑ npm Scraper โ€“ Packages, Downloads & Maintainer Leads ยท Apify


๐Ÿ‘ npm Scraper - Packages, Downloads & Maintainer Leads avatar

npm Scraper - Packages, Downloads & Maintainer Leads

Pricing

from $3.00 / 1,000 package results

Go to Apify Store

npm Scraper - Packages, Downloads & Maintainer Leads

Scrape the npm registry by keyword or package name. Get version, description, keywords, license, GitHub repo, homepage, weekly/monthly downloads, quality/popularity scores, dependencies and maintainer contact emails as B2B developer leads. No key, no login. Export JSON, CSV, Excel.

Pricing

from $3.00 / 1,000 package results

Rating

0.0

(0)

Developer

๐Ÿ‘ Scrape Sage

Scrape Sage

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

npm Scraper โ€” Packages, Downloads & Maintainer Leads

Search the npm registry by keyword or package name and get a rich, structured record for every package โ€” version, description, keywords, license, GitHub repo, homepage, weekly/monthly downloads, quality & popularity scores, dependencies, and the part other scrapers miss: maintainer contact emails and company domains as ready-to-use B2B developer leads.

No API key, no login, no browser โ€” fast JSON extraction straight from the public npm registry with high reliability.

Why this npm scraper?

The npm registry is the largest software registry in the world, and every package carries publicly published maintainer metadata. This actor merges the search API, the registry detail API, and the downloads API into one clean record โ€” and turns a keyword into a scored lead list of the developers and companies behind the packages.

DataTypical scrapersThis actor
Name, version, description, keywordsโœ…โœ…
GitHub repository + homepagepartialโœ…
Weekly & monthly downloadsโŒโœ…
Quality / popularity / maintenance scoresโŒโœ…
Maintainer emailsโŒโœ…
Company domains (from email addresses)โŒโœ…
License, created & last-published datesโŒโœ… opt-in
Dependency list + version countโŒโœ… opt-in
Deprecation statusโŒโœ… opt-in
Lead score (0โ€“100)โŒโœ…
Monitor mode โ€” only new / updated packagesโŒโœ…

Use cases

  • Developer & maintainer lead generation โ€” sell a dev tool, SDK, API, or DevRel service? Search a niche (stripe, kubernetes, web scraping) and reach package maintainers at their published email, filtered to real company domains.
  • Developer-tool market research โ€” map every package in a category with downloads, scores, and last-publish dates to size and track a market.
  • Supply-chain & dependency intelligence โ€” pull packages, their dependency lists, maintainers, and maintenance health for security and risk analysis.
  • Open-source recruiting & partnerships โ€” find the active maintainers behind popular packages by language, topic, or company.
  • Package monitoring โ€” schedule recurring runs to watch a keyword for new and freshly-published packages.

How to use

  1. Sign up for Apify โ€” the free plan is enough to try this actor.
  2. Open the npm Scraper, enter search queries (or package names), and click Start.
  3. Watch packages stream into the dataset table.
  4. Export as JSON, CSV, Excel, XML, or RSS โ€” or pull results programmatically via the Apify API.

Input

{
"searchQueries":["web scraping","keywords:cli"],
"packageNames":["express","@angular/core"],
"maxResults":100,
"includeDownloads":true,
"includeDetails":true,
"includeDependencies":false,
"onlyWithEmail":false,
"minMonthlyDownloads":1000,
"monitorMode":false
}
  • searchQueries โ€” keywords; supports npm qualifiers like keywords:cli, author:sindresorhus, scope:angular.
  • packageNames โ€” exact package names to fetch directly (always fully detailed).
  • maxResults (default 100) โ€” total packages to scrape across all queries.
  • includeDownloads (default true) โ€” add last-week and last-month download counts.
  • includeDetails (default false) โ€” fetch full registry metadata: license, dates, version count, deprecation, named maintainers, dependency count.
  • includeDependencies (default false) โ€” when details are on, output the dependency name list too.
  • onlyWithEmail / onlyWithRepository โ€” keep only contactable / source-linked packages.
  • minMonthlyDownloads (default 0) โ€” keep only packages above a popularity threshold.
  • monitorMode (default false) โ€” output only packages that are new or newly-versioned since the last run.

Output

One record per package (type: "package"):

{
"type":"package",
"name":"got-scraping",
"version":"4.2.1",
"description":"HTTP client made for scraping based on got.",
"keywords":["scraping","http","got","crawlee"],
"npmUrl":"https://www.npmjs.com/package/got-scraping",
"homepage":"https://github.com/apify/got-scraping#readme",
"repositoryUrl":"https://github.com/apify/got-scraping",
"githubRepo":"apify/got-scraping",
"bugsUrl":"https://github.com/apify/got-scraping/issues",
"author":{"name":"Apify","email":"support@apify.com"},
"publisher":{"username":"apify-release","email":"apify-release@apify.com"},
"maintainers":[{"username":"mtrunkat","email":"marek@apify.com"}],
"maintainerEmails":["support@apify.com","marek@apify.com"],
"maintainerCount":4,
"companyDomains":["apify.com"],
"license":"ISC",
"createdAt":"2021-03-18T12:00:00.000Z",
"lastPublishedAt":"2026-05-30T09:14:00.000Z",
"versionCount":73,
"isDeprecated":false,
"dependenciesCount":7,
"weeklyDownloads":412044,
"monthlyDownloads":1820551,
"finalScore":61,
"qualityScore":78,
"popularityScore":49,
"maintenanceScore":100,
"emailFound":true,
"leadScore":88,
"searchQuery":"web scraping",
"scrapedAt":"2026-06-27T12:00:00.000Z"
}

Automate & schedule

Run this actor on autopilot and pull results into your own stack:

  • Apify API โ€” start runs, fetch datasets, and manage schedules over REST.
  • apify-client for JavaScript and apify-client for Python โ€” official SDKs.
  • Schedules โ€” run it daily/weekly with monitorMode to watch a keyword for new and updated packages, collecting only what changed.
  • Webhooks โ€” trigger downstream actions (CRM import, Slack alert, email sequence) the moment a run finishes.
import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'MY_APIFY_TOKEN'});
const run =await client.actor('scrapesage/npm-scraper').call({
searchQueries:['react components'],
maxResults:200,
includeDownloads:true,
onlyWithEmail:true,
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(`Got ${items.length} packages & maintainer leads`);

Integrate with any app

Connect the dataset to 5,000+ apps โ€” no code required:

  • Make โ€” multi-step automation scenarios.
  • Zapier โ€” push new maintainer leads straight into your CRM.
  • Slack โ€” get notified when a monitored keyword gains a new package.
  • Google Drive / Sheets โ€” auto-export every run to a spreadsheet.
  • Airbyte โ€” pipe results into your data warehouse.
  • GitHub โ€” trigger runs from commits or releases.

Use with AI assistants (MCP)

The output is clean, LLM-ready JSON. Call this actor from Claude, ChatGPT, or any agent framework through the Apify MCP server โ€” ask your assistant to "find popular web-scraping npm packages with a maintainer email and over 100k monthly downloads" and let it run this scraper for you.

More scrapers from scrapesage

Build a complete developer & open-source intelligence stack:

Tips

  • Best lead quality: turn on onlyWithEmail and set minMonthlyDownloads to focus on active, popular packages run by reachable maintainers โ€” then filter the companyDomains column for B2B targets.
  • Search smart: npm qualifiers narrow fast โ€” keywords:react, author:<username>, scope:<org>, or not:deprecated.
  • Cheap vs deep: search results already include maintainer emails and scores, so you get leads without includeDetails. Add details only when you need license, dates, dependencies, or deprecation.
  • Recurring monitoring: pair Schedules with monitorMode to track new and updated packages in a niche without re-paying for unchanged rows.

FAQ

Do I need an npm account or token? No. The actor reads the public npm registry and downloads APIs โ€” no key, account, or login required.

Where do the maintainer emails come from? They are the contact emails npm package authors publish in their own package metadata (maintainers, author, publisher). The actor surfaces and de-duplicates them, and extracts the non-personal company domains from them. Automated/no-reply addresses are filtered out.

Can I export to Google Sheets, CSV, or Excel? Yes โ€” one click in the dataset view, or automatically on every run via the Google Drive integration.

How do I monitor a keyword for new packages? Turn on monitorMode and create a Schedule. Each run outputs only packages that are new or have a new version since the previous run.

A field is null โ€” why? Search results don't include license, dates, dependencies, or deprecation โ€” turn on includeDetails for those. Some packages simply don't link a repository or homepage, have no keywords, or expose no maintainer email. Fields are null only when npm doesn't publish them.

Is scraping npm legal? This actor collects publicly available registry data only. You are responsible for using the data in compliance with applicable laws (GDPR/CCPA for personal data) and npm's terms.

Need help?

Open an issue on the actor's Issues tab, or visit the Apify help center. Feature requests are welcome โ€” this actor is actively maintained.

You might also like

๐Ÿ“ฆ npm Package Scraper - Downloads & Dependencies

benthepythondev/npm-package-scraper

Extract npm package data with download stats, dependencies, maintainers, and quality scores. Search by keyword, author, or get popular packages. Perfect for JavaScript ecosystem research, competitor tracking, and dependency analysis. Fast API-based extraction, pay-per-result.

NPM Package Stats Scraper. Downloads, Versions, Dependencies

seemuapps/npm-package-stats-scraper

Get download counts, version history, dependencies, license, repo, and maintainer info for any npm package. Bulk-process a list of packages in one run.

npm Registry Scraper

solidcode/npm-registry-scraper

[๐Ÿ’ฐ $2.5 / 1K] Search the public npm registry or look up exact packages, and extract package metadata, version history, maintainers, repository and homepage links, dependencies, and download statistics (day/week/month).

Developer Tools Scraper

datapilot/developer-tools-scraper

Package & Developer Ecosystem Scraper collects package, extension, and repository data from PyPI, npm, VS Code Marketplace, and GitHub. Extracts names, versions, descriptions, authors, licenses, downloads, ratings, keywords, and URLs. Ideal for developer research, trend analysis, lead generation

PyPI Scraper - Python Package Search & Stats

parseforge/pypi-scraper

Search and scrape Python package data from PyPI including versions, authors, licenses, keywords, download stats, and classifiers. Export to CSV, Excel, JSON, XML.

Maven Central Scraper | Java Package Metadata

parseforge/maven-central-scraper

Extract Java and Kotlin artifacts from Maven Central including group ID, artifact ID, version history, dependencies, publisher, packaging, and license info. Audit JVM dependencies, track ecosystem trends, or feed developer security, SBOM, and intelligence tools at scale.