VOOZH about

URL: https://apify.com/parseforge/osv-vulnerabilities-scraper

โ‡ฑ OSV.dev Open Source Vulnerabilities Scraper ยท Apify


๐Ÿ‘ OSV Open Source Vulnerabilities Scraper avatar

OSV Open Source Vulnerabilities Scraper

Pricing

from $22.87 / 1,000 results

Go to Apify Store

OSV Open Source Vulnerabilities Scraper

Query the OSV.dev open-source vulnerabilities database. Search by package (PyPI/npm/Go/Maven/RubyGems/crates.io/NuGet/Packagist), commit hash, or fetch a specific vulnerability by ID. Returns affected ranges, CVE aliases, severity, and references.

Pricing

from $22.87 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿ“ฆ OSV Vulnerabilities Scraper

๐Ÿš€ Export open source vulnerability data in seconds. Pull advisories from the OSV.dev catalogue covering 30+ ecosystems including PyPI, npm, Go, Maven, RubyGems, crates.io, NuGet, Packagist, and major Linux distributions. No sign-up, no token, no manual pagination.

๐Ÿ•’ Last updated: 2026-05-15 ยท ๐Ÿ“Š 16 fields per record ยท ๐Ÿ“ฆ 200,000+ advisories ยท ๐ŸŒ 30+ ecosystems ยท ๐Ÿ”— Cross-database aliases

The OSV Vulnerabilities Scraper pulls open source vulnerability records from the OSV.dev community catalogue and returns 16 normalised fields per record, including affected package ranges, severity scores, cross-database aliases (GHSA, CVE, PYSEC, RUSTSEC, GO, OSV-xxxx), patched version events, references, and credits. The underlying catalogue is the de facto open source vulnerability database, aggregating data from GitHub, PyPA, RustSec, Go vulnerability database, OSS-Fuzz, and dozens of distro security teams.

The catalogue covers 30+ package ecosystems from language registries (PyPI, npm, Go, Maven, RubyGems, crates.io, NuGet, Packagist, Hex, Pub, Hackage) to operating system distributions (Debian, Ubuntu, Alpine, Rocky, AlmaLinux, SUSE, openSUSE, Wolfi, Chainguard) plus Bioconductor and CRAN for R. This Actor makes that data downloadable as CSV, Excel, JSON, or XML in minutes. Filters apply at the source, so you skip pagination, deduplication, and ecosystem-specific quirks entirely.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
DevSecOps teams, SBOM tool builders, open source maintainers, package registry operators, supply-chain security vendors, container security teamsDependency scanning, SBOM enrichment, package risk reports, ecosystem trend analysis, cross-database aliasing, container vulnerability triage

๐Ÿ“‹ What the OSV Vulnerabilities Scraper does

Three workflows in a single Actor:

  • ๐Ÿ“ฆ Package query. Look up every advisory affecting a package, optionally pinned to a version (e.g. requests on PyPI, lodash on npm, log4j-core on Maven).
  • ๐Ÿ” Commit query. Search by Git commit SHA to surface vulnerabilities introduced in a specific revision.
  • ๐Ÿ†” Vulnerability ID lookup. Fetch a single record or a batch of records by their identifier (GHSA-xxxx, CVE-xxxx, PYSEC-xxxx, RUSTSEC-xxxx, GO-xxxx, OSV-xxxx).

Each record includes the OSV ID, summary and full details, all known cross-database aliases, affected package list with version ranges and PURLs, severity entries with CVSS vectors, references, and credits.

๐Ÿ’ก Why it matters: SBOMs are only useful when paired with a fresh vulnerability feed. Building your own ingestion means handling 30+ ecosystem schemas, alias deduplication, version-range parsing, and the OSV pagination model. This Actor skips all of that and gives you a clean, downloadable dataset.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded vulnerability dataset.


โš™๏ธ Input

InputTypeDefaultBehavior
modeenum"query"query searches by package or commit, byId fetches by vulnerability ID.
packageNamestring""Package to look up (e.g. requests, lodash).
ecosystemenum""One of 30 ecosystems. Empty for cross-ecosystem search.
packageVersionstring""Pin to a version. Without a package, returns vulnerabilities affecting that version across packages.
commitstring""Git commit SHA to search by.
vulnerabilityIdstring""Single vulnerability ID for mode=byId.
vulnerabilityIdsstring[][]Batch list of IDs (recommended max ~100 per run).
maxItemsinteger10Records to return. Free plan caps at 10, paid plan at 1,000,000.

Example: every advisory affecting npm lodash.

{
"mode":"query",
"packageName":"lodash",
"ecosystem":"npm",
"maxItems":100
}

Example: batch lookup of the Log4Shell aliases.

{
"mode":"byId",
"vulnerabilityIds":["GHSA-jfh8-c2jp-5v3q","CVE-2021-44228","GHSA-7rjr-3q55-vv33"]
}

โš ๏ธ Good to Know: OSV records use cross-database aliases, so the same vulnerability can appear under several IDs (e.g. GHSA-jfh8-c2jp-5v3q aliases CVE-2021-44228). When you batch-fetch related IDs, expect duplicate records pointing to the same root advisory.


๐Ÿ“Š Output

Each record contains 16 fields. Download the dataset as CSV, Excel, JSON, or XML.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿ†” idstring"GHSA-jfh8-c2jp-5v3q"
๐Ÿ”— urlstring"https://osv.dev/vulnerability/GHSA-jfh8-c2jp-5v3q"
๐Ÿ“ summarystring | null"Remote code injection in Log4j"
๐Ÿ“„ detailsstring | null"Apache Log4j2 versions 2.0-beta9 through 2.15.0..."
๐Ÿ”— aliasesstring[]["CVE-2021-44228"]
๐Ÿ”— relatedstring[]["CVE-2021-45046"]
๐Ÿ•’ modifiedISO 8601 | null"2025-01-14T08:36:01Z"
๐Ÿ“… publishedISO 8601 | null"2021-12-10T00:00:35Z"
๐Ÿšซ withdrawnISO 8601 | nullnull
๐Ÿงฑ schema_versionstring | null"1.4.0"
๐Ÿ“ฆ affectedobject[][{ "package": { "ecosystem": "Maven", "name": "org.apache.logging.log4j:log4j-core" }, "ranges": [...] }]
๐Ÿชช purlsstring[]["pkg:maven/org.apache.logging.log4j/log4j-core"]
๐Ÿ“š referencesobject[][{ "type": "ADVISORY", "url": "https://nvd.nist.gov/vuln/detail/CVE-2021-44228" }]
๐ŸŽฏ severityobject[][{ "type": "CVSS_V3", "score": "CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:C/C:H/I:H/A:H" }]
๐ŸŽฏ maxSeverityScorenumber | nullnull
๐Ÿ™Œ creditsobject[] | null[{ "name": "Chen Zhaojun" }]
๐Ÿ•’ scrapedAtISO 8601"2026-05-15T00:00:00.000Z"

๐Ÿ“ฆ Sample record


โœจ Why choose this Actor

Capability
๐Ÿ“ฆ30+ ecosystems. PyPI, npm, Go, Maven, RubyGems, crates.io, NuGet, Packagist, Hex, Pub, Hackage, GitHub Actions, plus Linux distros and R registries.
๐Ÿ”—Cross-database aliases. GHSA, CVE, PYSEC, RUSTSEC, GO, OSV-xxxx all surfaced in one record.
๐ŸชชPURL identifiers. Each affected package is tagged with a Package URL ready to join with SBOM tools.
๐Ÿ“Structured version ranges. ECOSYSTEM, SEMVER, and GIT range types with introduced / fixed events.
๐ŸŽฏSeverity vectors. CVSS v2 / v3 / v4 strings preserved verbatim from the source.
๐Ÿ”Always fresh. Every run hits the live OSV catalogue, so the dataset reflects current entries.
๐ŸšซNo sign-up. Works with public open source security data. No login or token needed.

๐Ÿ“Š OSV is the de facto open source vulnerability database. Owning a clean local copy is a multiplier for SBOM tooling, dependency scanners, and supply-chain risk dashboards.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshEcosystemsSetup
โญ OSV Vulnerabilities Scraper (this Actor)$5 free credit, then pay-per-use200,000+ recordsLive per run30+โšก 2 min
Commercial SCA platforms$20,000+/yearCurated subsetStreaming5-15โณ Days
Single-ecosystem feedsFreeSubsetVariable1๐Ÿ› ๏ธ Hours
Self-built ingestionEngineering timeFullCustomCustom๐Ÿข Weeks

Pick this Actor when you want broad ecosystem coverage with cross-database aliases and no parser maintenance.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the OSV Vulnerabilities Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set input. Pick a mode, enter a package + ecosystem, a commit SHA, or a vulnerability ID, then set maxItems.
  4. ๐Ÿš€ Run it. Click Start and let the Actor collect your data.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐Ÿ› ๏ธ DevSecOps & SBOM

  • Daily SBOM enrichment with vulnerability metadata
  • CI gate that fails builds when dependencies have known fixes
  • PURL-based joins between asset inventory and OSV records
  • Alert when patched versions land for tracked dependencies

๐Ÿ“ฆ Package Registry Operators

  • Surface advisories on package detail pages
  • Notify maintainers when their package gets a new advisory
  • Build ecosystem-wide vulnerability trend dashboards
  • Cross-link aliases across CVE / GHSA / OSV identifiers

๐Ÿณ Container & Image Security

  • Scan distro layers (Debian, Ubuntu, Alpine, Wolfi) against OSV
  • Build image-risk reports filtered by base OS
  • Track Wolfi and Chainguard advisories for hardened images
  • Continuous monitoring of running container fleets

๐Ÿ” Supply-Chain Risk Analysis

  • Vendor due-diligence reports based on dependency graphs
  • Open-source risk scoring for procurement teams
  • M&A technical due diligence on target codebases
  • Insurance underwriting models for cyber risk

๐Ÿ”Œ Automating OSV Vulnerabilities Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

  • ๐ŸŸข Node.js. Install the apify-client NPM package.
  • ๐Ÿ Python. Use the apify-client PyPI package.
  • ๐Ÿ“š See the Apify documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly, daily, or weekly refreshes keep your downstream SBOM tooling and dependency scanners in sync automatically.


๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Open source ecosystem vulnerability studies
  • Cross-ecosystem co-occurrence and aliasing research
  • Coursework on supply-chain security and SBOM tooling
  • Reproducible studies with cited, versioned dataset pulls

๐ŸŽจ Personal and creative

  • Hobbyist dependency dashboards for your own projects
  • Newsletter research on the latest open source advisories
  • Portfolio projects that show off security data engineering
  • Personal alerting bots for libraries you maintain

๐Ÿค Non-profit and civic

  • Open-source maintainers monitoring their package risk
  • Civic-tech projects mapping vulnerability exposure
  • Educational outreach for open-source security literacy
  • Community advocacy around responsible disclosure

๐Ÿงช Experimentation

  • Train ML models on advisory text and severity
  • Prototype agent pipelines that summarise advisories
  • Test SBOM diff tooling against historical waves
  • Build dashboards on top of live ecosystem feeds

๐Ÿค– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


โ“ Frequently Asked Questions

๐Ÿงฉ How does it work?

Configure your filters in the input form, click Start, and the Actor pulls matching records from the official OSV catalogue, normalises the schema, and emits one clean record per advisory.

๐Ÿ“ How accurate is the data?

Records are mirror-copies of the OSV catalogue at run time. Affected ranges, severity entries, references, and credits are taken verbatim from the source.

๐Ÿ” How often is the dataset refreshed?

OSV updates continuously as upstream feeds (GitHub, PyPA, RustSec, Go vulnerability database, distro security teams) publish new entries. Every run reflects the catalogue as of run time.

๐ŸŒ Which ecosystems are supported?

PyPI, npm, Go, Maven, RubyGems, crates.io, NuGet, Packagist, Hex, Pub, Hackage, GitHub Actions, Linux kernel, plus Debian, Ubuntu, Alpine, Rocky Linux, AlmaLinux, SUSE, openSUSE, Android, ConanCenter, Bitnami, Photon OS, Mageia, Wolfi, Chainguard, Bioconductor and CRAN. Leave the field empty for cross-ecosystem search.

๐Ÿ”— Why are the same vulnerabilities listed under multiple IDs?

OSV uses cross-database aliasing. The same root advisory may surface as a GHSA on GitHub, a CVE in NVD, a PYSEC in PyPA, and an OSV-xxxx in the OSV namespace. The aliases field links them together.

๐Ÿชช What is a PURL?

PURL (Package URL) is the standard for naming a package across ecosystems, e.g. pkg:npm/lodash or pkg:maven/org.apache.logging.log4j/log4j-core. PURLs are the canonical join key against most SBOM formats.

๐ŸŽฏ Why is maxSeverityScore sometimes null?

The Actor parses numeric severity scores when the source supplies them as raw numbers. CVSS vector strings (e.g. CVSS:3.1/AV:N/...) are preserved in the severity array but their base score is not re-derived. Use the vector string with a CVSS calculator if you need the exact number.

โฐ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval. A common pattern is a daily schedule that pulls every advisory across the npm and PyPI ecosystems and pushes them into a SBOM tool.

โš–๏ธ Is this data legal to use?

OSV is published under permissive open licensing. You should review the upstream source license for your specific application but raw vulnerability metadata is generally public.

๐Ÿ’ณ Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you scheduling, higher concurrency, and larger datasets.

๐Ÿ†˜ What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.


๐Ÿ”Œ Integrate with any app

OSV Vulnerabilities Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get advisory alerts in your security channels
  • Airbyte - Pipe OSV data into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh advisory data into your SBOM tooling, or alert your team in Slack when a new advisory hits a tracked package.


๐Ÿ”— Recommended Actors

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more security and reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


โš ๏ธ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by OSV.dev, Google, or any of the upstream feed maintainers. All trademarks mentioned are the property of their respective owners. Only publicly available open source vulnerability data is collected.

You might also like

OSV.dev Vulnerabilities Scraper

crawlerbros/osv-vulnerabilities-scraper

Scrape OSV.dev, Google's open vulnerability database covering NPM, PyPI, Go, Maven, NuGet, Cargo, RubyGems, GitHub Actions, OS distros, and more. Look up vulnerabilities by package, fetch a specific OSV/GHSA/CVE record, or batch-query an entire dependency tree.

NPM Vulnerability Checker

automation-lab/npm-vulnerability-checker

Check npm packages for known vulnerabilities via OSV.dev API. Input package names with optional versions โ€” get CVE IDs, severity (CRITICAL/HIGH/MEDIUM/LOW), affected ranges, and fix versions. No proxy needed.

๐Ÿ‘ User avatar

Stas Persiianenko

5

CVEs Search API Scraper

dev00/cves-search-apify

Programmatically query the CVE vulnerability databases for historical vulnerabilities, packages vulnerabilities, and detailed CVE record files.

dev00

2

PyPI Vulnerability Scraper

taroyamada/pypi-package-intelligence

Extract Python package metadata from PyPI and enrich it with OSV database alerts. Monitor dependencies for new version releases and critical CVE identifiers.

NPM & PyPI CVE Monitor

conceivable_extension/npm-cve-monitor

Checks npm and PyPI packages against the OSV vulnerability database and npm registry. Detects CVEs, suspicious maintainer patterns, and supply chain risks. Paste your package.json or requirements.txt. ยฃ0.001 per package checked.

2

GitHub Security Advisories Scraper

parseforge/github-security-advisories-ghsa-scraper

Scrape GitHub Security Advisories (GHSA) with full CVE mapping, CVSS severity scores, affected packages, version ranges, references, and patches. Filter by ecosystem (npm, PyPI, RubyGems, etc.) and severity. Perfect for SOC teams, vulnerability researchers, and DevSecOps pipelines.