VOOZH about

URL: https://apify.com/parseforge/ncbi-dbsnp-variants-scraper

โ‡ฑ NCBI dbSNP Variants Scraper ยท Apify


Pricing

from $19.00 / 1,000 results

Go to Apify Store

NCBI dbSNP Variants Scraper

Discover medical and biomedical records from Ncbi Dbsnp Variants with names, identifiers, classifications, descriptions, status and source links. Ideal for healthcare research, pharma teams and clinical analytics. Run on demand or on a recurring schedule and feed every row into your favourite ana.

Pricing

from $19.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Categories

Share

๐Ÿ‘ ParseForge Banner

๐Ÿงฌ NCBI dbSNP Variants Scraper

๐Ÿš€ Pull human SNP variants from NCBI dbSNP in seconds. rsIDs, chromosome and position, alleles, functional class, gene context, clinical significance and global minor allele frequencies from the official NIH database.

๐Ÿ•’ Last updated: 2026-05-27 ยท ๐Ÿ“Š 22 fields per record ยท 1B+ rsIDs ยท Global population frequencies

NCBI dbSNP is the world's authoritative public catalogue of single-nucleotide variants. This scraper wraps the official E-utilities esearch + esummary flow and returns a clean, structured table for any gene, condition or rsID query.

Every record carries the rsID, SPDI string, chromosome position, allele, functional class (intron/upstream/exon), gene symbols and Entrez IDs, validation status, clinical significance and global MAFs from 1000Genomes, gnomAD, TOPMED, TOMMO, ALFA and more.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Geneticists and bioinformaticiansPull variant tables for a gene of interest
Clinical researchersBuild pathogenic-variant lists for a condition
Pharma and biotechAnnotate genotyping panels
Academic teamsRun reproducible analyses without flat-file pulls
Data engineersPipe dbSNP into your variant warehouse

๐Ÿ“‹ What the NCBI dbSNP Variants Scraper does

  • Calls the official E-utilities esearch to resolve a gene/condition/rsID query
  • Calls esummary to fetch full variant metadata
  • Returns rsID, SPDI, chromosome, position, alleles, gene context, clinical significance, global MAFs
  • Stream-delivers to multiple table outputs

๐Ÿ’ก Why it matters: every clinical and pharmacogenomic analysis starts with annotating variants. dbSNP is the canonical source - and this actor makes it queryable from a spreadsheet workflow.

๐ŸŽฌ Full Demo (๐Ÿšง Coming soon)

โš™๏ธ Input

FieldTypeDescription
querystringGene symbol, rsID or condition keyword
maxItemsintegerCap on records returned (free plan: 10)
{"query":"BRCA1","maxItems":25}
{"query":"rs328","maxItems":1}

โš ๏ธ Good to Know: NCBI E-utilities is rate-limited to 3 requests/sec without an API key. The actor batches IDs into a single esummary call to stay well under the limit.

๐Ÿ“Š Output

FieldDescription
๐Ÿ†” rsIddbSNP rs identifier
๐Ÿท snpClassSNV / insertion / deletion / etc.
๐Ÿงฌ chromosome / position / accession / spdiGenomic location
๐Ÿ”ก allelesAllele code
๐Ÿ“‹ functionalClassIntron / upstream / exon / etc.
๐Ÿงช geneSymbols / geneIdsGene context
โš•๏ธ clinicalSignificanceBenign / pathogenic / etc.
โœ… validatedValidation status
๐Ÿ“Š globalMaf / globalMafsGlobal minor allele frequencies
๐Ÿท handle / taxonomyIdSubmitter and species
๐Ÿ“… createDate / updateDate / origBuild / updBuildProvenance
๐Ÿ“ hgvsHGVS notation
๐Ÿ”— sourceUrldbSNP page
๐Ÿ•’ scrapedAtISO timestamp

โœจ Why choose this Actor

  • ๐Ÿ†“ Public NIH/NCBI data, no auth required
  • ๐Ÿ“ก Direct hit on the official E-utilities API
  • ๐Ÿงฌ Returns global MAFs from 25+ populations
  • ๐Ÿงฐ Clean field names - no feed parsing
  • ๐Ÿ“ฆ Pull as multiple table outputs

๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageSetup time
Manual VCF pulls from NCBI FTPFreeBulk onlyHours
Direct E-utilities callsFreeFullCode required
ParseForge dbSNP ScraperPay-per-resultFull + structuredMinutes

๐Ÿš€ How to use

  1. Create a free Apify account (includes $5 credit).
  2. Open the NCBI dbSNP Variants Scraper.
  3. Set query (gene symbol, rsID or condition).
  4. Click Start and use multiple table outputs.
  5. Schedule or trigger from your bioinformatics pipeline.

๐Ÿ’ผ Business use cases

Pharmacogenomics - annotate a drug-response panel with current dbSNP records.

Clinical decision support - pull pathogenic variants for a condition.

Genotyping QC - verify variant annotations match the live dbSNP record.

Variant database curation - keep your internal warehouse in sync with NCBI updates.

๐Ÿ”Œ Automating NCBI dbSNP Variants Scraper

Hook into Make, Zapier, n8n, Airbyte, Pipedream, Slack, GitHub Actions or any HTTP webhook.

๐ŸŒŸ Beyond business use cases

  • Research: explore the global frequency of a candidate variant.
  • Personal: annotate your own genotyping report from 23andMe / Ancestry.
  • Non-profit: support rare-disease variant research.
  • Experimentation: train ML models on annotated variant tables.

๐Ÿค– Ask an AI assistant about this scraper

Ask ChatGPT, Claude, Perplexity or Copilot: "How do I pull every pathogenic BRCA1 variant from NCBI dbSNP using the ParseForge Apify actor?"

โ“ Frequently Asked Questions

Do I need an NCBI API key? No, but providing one increases your rate limit to 10 req/sec. The actor uses unauthenticated mode by default.

Is dbSNP human only? Currently human-focused. Other species are available via the same E-utilities pattern but with different taxonomyId.

Can I query by rsID directly? Yes - set query to rs328 or 328.

Are clinical-significance annotations from ClinVar? dbSNP propagates ClinVar annotations into the summary. Always verify in ClinVar for clinical use.

What's SPDI? The Sequence-Position-Deletion-Insertion notation: NCBI's modern standard for variant representation.

How fresh is dbSNP? dbSNP releases new builds periodically. The actor returns whatever the live API serves.

Can I get VCF output? This actor produces a tabular summary. Combine with NCBI's VCF use for raw genotyping.

Is the actor rate-limited? The actor stays under 3 req/sec to comply with E-utilities limits.

Are alternate assemblies supported? The actor returns the canonical GRCh38 position. Other assemblies require manual liftover.

Can I batch thousands of rsIDs? Yes - set maxItems accordingly. The actor batches into a single esummary call.

๐Ÿ”Œ Integrate with any app

Apify, Make, Zapier, n8n, Pipedream, Slack, Airbyte, GitHub, Google Drive, Power Automate, AWS Lambda, REST webhook.

๐Ÿ”— Recommended Actors

ActorWhat it does
OpenAlex Institutions ScraperGlobal research institutions
EU Clinical Trials Register ScraperClinical trial records
NHTSA Vehicle Complaints ScraperUS vehicle complaint data

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more government and research data scrapers.

๐Ÿ†˜ Need Help? Open our contact form

โš ๏ธ Disclaimer: independent tool, not affiliated with NCBI or NIH. Only publicly available open data is collected.

You might also like

NCBI ClinVar Variants Scraper - Genetic Variation Data

parseforge/ncbi-clinvar-variants-scraper

Sweep medical and biomedical records from Ncbi Clinvar Variants with names, identifiers, classifications, descriptions, status and source links. Ideal for healthcare research, pharma teams and clinical analytics. Run on demand or on a recurring schedule and feed every row into your favourite anal.

UCSC Genome Browser Tracks Scraper

parseforge/ucsc-genome-browser-tracks-scraper

Track medical and biomedical records from Ucsc Genome Browser Tracks with names, identifiers, classifications, descriptions, status and source links. Designed for healthcare research, pharma teams and clinical analytics. Run on demand or on a recurring schedule and feed every row into your favour.

Ulta Beauty Products Scraper

parseforge/ulta-scraper

Track structured records from Ulta with names, identifiers, dates, descriptions, status flags and source links. Designed for research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

Dryad Research Datasets Scraper

parseforge/datadryad-datasets-scraper

Gather structured records from Datadryad Datasets with names, identifiers, dates, descriptions, status flags and source links. Loved by research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

MercadoLibre Mexico Offers Scraper

parseforge/mercadolibre-ofertas-scraper

Track structured records from Mercadolibre Ofertas with names, identifiers, dates, descriptions, status flags and source links. Designed for research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

OpenAlex Topics Scraper

parseforge/openalex-topics-scraper

Scale your structured records from Openalex Topics with names, identifiers, dates, descriptions, status flags and source links. Trusted by research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

Tradera Sweden Auctions Scraper

parseforge/tradera-scraper

Scale your structured records from Tradera with names, identifiers, dates, descriptions, status flags and source links. Trusted by research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

๐Ÿ—‚๏ธ G2 Software Categories Scraper

parseforge/g2-software-categories-scraper

Tap into structured records from G2 Software Categories with names, identifiers, dates, descriptions, status flags and source links. Loved by research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow st.

OpenAlex Institutions Scraper

parseforge/openalex-institutions-scraper

Gather structured records from Openalex Institutions with names, identifiers, dates, descriptions, status flags and source links. Loved by research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.

Crossref Journals Scraper - Academic Publication Metadata

parseforge/crossref-journals-scraper

Unlock structured records from Crossref Journals with names, identifiers, dates, descriptions, status flags and source links. Designed for research, intelligence and operational dashboards. Run on demand or on a recurring schedule and feed every row into your favourite analytics or workflow stack.