VOOZH about

URL: https://apify.com/parseforge/open-targets-platform-scraper

โ‡ฑ Open Targets Platform Scraper ยท Apify


Pricing

from $3.75 / 1,000 result items

Go to Apify Store

Open Targets Platform Scraper

Export drug, target, and disease intelligence from the Open Targets Platform. Pull 63,000+ targets, 20,000+ diseases, and 14,000+ drugs with association scores, tractability, mechanisms of action, clinical phase, and cross-references for drug discovery research.

Pricing

from $3.75 / 1,000 result items

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿงฌ Open Targets Platform Scraper

๐Ÿš€ Export drug, target, and disease intelligence in seconds. Pull 63,000+ targets, 20,000+ diseases, and 14,000+ drugs from the Open Targets Platform with evidence scores, tractability, mechanisms of action, and clinical phase. No registration, no manual extraction.

๐Ÿ•’ Last updated: 2026-05-20 ยท ๐Ÿ“Š 18 fields per record ยท ๐Ÿงฌ 63k+ targets ยท ๐Ÿฆ  20k+ diseases ยท ๐Ÿ’Š 14k+ drugs

The Open Targets Platform Scraper exports drug discovery intelligence from the Open Targets Platform and returns up to 18 fields per record, including identifiers, synonyms, descriptions, tractability assessments, target-disease associations, mechanisms of action, and clinical-trial indications. The underlying dataset is the reference open knowledge graph for therapeutic target identification, jointly built by EMBL-EBI, GSK, Sanofi, Bristol Myers Squibb, and other contributors.

Coverage spans 63,000+ targets (Ensembl IDs), 20,000+ diseases (EFO/MONDO), and 14,000+ drugs (ChEMBL) with association scoring, druggability tiers, ubiquitination signals, pathway memberships, and disease-trait phenotype maps.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Pharma R&D teams, bioinformatics groups, biotech analysts, academic researchers, drug-repurposing startups, clinical-trial intelligenceTarget prioritization, disease-target evidence reports, drug-repurposing scans, tractability snapshots, mechanism-of-action lookups, indication exploration

๐Ÿ“‹ What the Open Targets Platform Scraper does

Three entity-specific workflows in a single run:

  • ๐Ÿงฌ Targets. Pull gene-level records with approved symbol, biotype, synonyms, protein IDs, tractability across small-molecule / antibody / PROTAC / oligo modalities, pathway memberships, and the top associated diseases with evidence scores.
  • ๐Ÿฆ  Diseases. Pull EFO/MONDO records with descriptions, synonyms, therapeutic areas, cross-references, ancestors/descendants, and the top associated targets.
  • ๐Ÿ’Š Drugs. Pull ChEMBL drug records with type, maximum clinical phase, approval status, withdrawals, trade names, mechanisms of action, and indication lists.

Each record includes a direct link back to the entity's page on platform.opentargets.org for spot-checking.

๐Ÿ’ก Why it matters: drug discovery teams spend significant time stitching together target-disease evidence from disparate genetic, expression, and clinical sources. Open Targets is the canonical pre-stitched graph. This Actor flattens the relevant subset into a CSV/JSON snapshot you can drop into a notebook, dashboard, or LLM workflow.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.


โš™๏ธ Input

InputTypeDefaultBehavior
maxItemsinteger10Records to return. Free plan caps at 10, paid plan at 1,000,000.
entityTypestring"target"One of target, disease, or drug.
searchQuerystring""Free-text search. Examples: BRAF, melanoma, pembrolizumab.
targetIdsstring[][]Ensembl gene IDs (used when entityType is target).
diseaseIdsstring[][]EFO or MONDO IDs (used when entityType is disease).
drugIdsstring[][]ChEMBL IDs (used when entityType is drug).

Example: top 25 search hits for "BRAF" as targets.

{
"maxItems":25,
"entityType":"target",
"searchQuery":"BRAF"
}

Example: enrich a list of known drugs.

{
"maxItems":5,
"entityType":"drug",
"drugIds":["CHEMBL1201585","CHEMBL1201218","CHEMBL2108524"]
}

โš ๏ธ Good to Know: when a list of IDs is provided, the scraper skips the search step and pulls those entities directly. When the list is empty, the search query is used. Both modes coexist so you can build a targeted ID-batch run or an exploratory search run from the same input form.


๐Ÿ“Š Output

Each record contains up to 18 fields depending on entity type. Download the dataset as CSV, Excel, JSON, or XML.

๐Ÿงพ Schema (target)

FieldTypeExample
๐Ÿงฌ entityTypestring"target"
๐Ÿ†” idstring"ENSG00000157764"
๐Ÿ”ค approvedSymbolstring"BRAF"
๐Ÿ“› approvedNamestring"B-Raf proto-oncogene, serine/threonine kinase"
๐Ÿงช biotypestring"protein_coding"
๐Ÿ“œ descriptionstring | nullProtein function summary
๐Ÿ” synonymsstring[] | nullSymbol synonyms
๐Ÿงซ proteinIdsobject[] | nullUniProt and Ensembl protein IDs
๐Ÿ”— crossReferencesobject[] | nullDB x-refs
๐ŸŽฏ tractabilityobject[] | nullModality x druggability matrix
๐Ÿ“ subcellularLocationsobject[] | nullCell compartment evidence
๐Ÿ›ฃ๏ธ pathwaysobject[] | nullReactome pathway memberships
๐Ÿงฌ geneticConstraintobject[] | nullgnomAD constraint scores
๐Ÿงช targetClassobject[] | nullChEMBL target-class taxonomy
๐Ÿฆ  associatedDiseasesobject | nullTop disease evidence rows with scores
๐Ÿ”— sourceUrlstringPlatform page
๐Ÿ•’ scrapedAtISO 8601Collection timestamp

๐Ÿ“ฆ Sample records


โœจ Why choose this Actor

Capability
๐ŸงฌFull biomedical graph. 63k+ targets, 20k+ diseases, 14k+ drugs in a single Actor with consistent fields.
๐ŸŽฏMulti-modality tractability. Small molecule, antibody, PROTAC, oligonucleotide, and other modality classes per target.
๐Ÿ“ŠAssociation scores. Top-N disease-target evidence with scores baked into the disease and target records.
๐Ÿ’ŠClinical-phase data. Drug records include maximum clinical phase, approval status, withdrawal flags, and indications.
โšกFast. 25 entities in under 30 seconds, hundreds in a few minutes.
๐Ÿ”Always fresh. Every run pulls the latest Open Targets release through the official graph.
๐ŸšซNo authentication. Open scientific data. No login or token required.

๐Ÿ“Š Open Targets is the consensus reference graph for target identification, used by major pharma R&D teams worldwide.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ Open Targets Platform Scraper (this Actor)$5 free credit, then pay-per-use63k targets ยท 20k diseases ยท 14k drugsLive per runsearch, ID list, entity typeโšก 2 min
Manual portal navigationFreePer-entityManualNone๐Ÿข Tedious
Direct custom queriesEngineering timeFullLiveCustom๐Ÿ› ๏ธ Days
Commercial biomedical databases$$$$/yearVariableVariableManyโณ Long contract

Pick this Actor when you want a clean, filtered slice of Open Targets without writing custom queries, handling pagination, or maintaining schema knowledge.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the Open Targets Platform Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set input. Pick an entity type, then either supply a list of IDs or a free-text search.
  4. ๐Ÿš€ Run it. Click Start and let the Actor collect your data.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐Ÿ’Š Pharma R&D and target ID

  • Score and prioritize target candidates by association evidence
  • Build a tractability matrix across SM, antibody, PROTAC modalities
  • Compare disease overlap across a panel of candidates
  • Power weekly target-deck refreshes for portfolio reviews

๐Ÿงช Drug repurposing

  • Enumerate indications and trial phases per drug
  • Find shared mechanisms across approved drugs
  • Map drugs to alternate diseases through target overlap
  • Seed retrospective evidence packages for repositioning

๐Ÿ“Š Biotech analysts and BD

  • Pull due-diligence snapshots on acquisition targets' assets
  • Generate target/disease landscape reports for clients
  • Track competitive drug pipelines by indication
  • Build investor decks with sourced evidence scores

๐Ÿงฌ Academic bioinformatics

  • Reproducible target/disease enrichment analyses
  • Course datasets for translational informatics modules
  • Hypothesis-generation scans across published evidence
  • Curated cohorts for bench-side validation studies

๐Ÿ”Œ Automating Open Targets Platform Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

  • ๐ŸŸข Node.js. Install the apify-client NPM package.
  • ๐Ÿ Python. Use the apify-client PyPI package.
  • ๐Ÿ“š See the Apify documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Monthly refreshes keep downstream analytics aligned with new Open Targets releases.


๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Translational informatics coursework with reproducible pulls
  • Systematic reviews of target-disease evidence
  • Reproducible dataset snapshots for cited bioinformatics papers
  • Open-data exercises around drug discovery and rare diseases

๐ŸŽจ Personal and creative

  • Bioscience side projects and visualizations
  • Personal target-disease atlas dashboards
  • Educational explainers and biotech newsletters
  • Hobbyist exploration of human genetics and disease

๐Ÿค Non-profit and civic

  • Rare-disease patient organizations cataloging therapies
  • Open-science publishing with cited evidence rows
  • Drug-access transparency around indications and approvals
  • Hackathons for disease research and biotech tooling

๐Ÿงช Experimentation

  • Train biomedical NER and entity linking models
  • Prompt engineering for LLM-driven literature triage
  • Agent pipelines that look up tractability and pipelines
  • Validate retrieval-augmented biomedical search

๐Ÿค– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


โ“ Frequently Asked Questions

๐Ÿงฉ How does it work?

Pick an entity type and either drop in a list of identifiers or run a free-text search. The Actor queries the Open Targets graph and returns one structured record per entity, including association rows and tractability matrices.

๐Ÿ“ How accurate is the data?

The records mirror the latest Open Targets release. Evidence scores and clinical phases are computed by the Open Targets pipeline from curated sources (ChEMBL, OpenFDA, ClinVar, Reactome, and many more). For clinical decisions, validate against primary sources.

๐Ÿ” How often is the dataset refreshed?

Open Targets ships full data releases on a regular cadence. Every run of this Actor pulls live from the graph, so your dataset reflects the latest published release at run time.

๐Ÿงฌ Which identifier systems are supported?

Ensembl gene IDs for targets, EFO and MONDO for diseases, and ChEMBL for drugs. Cross-references in the output map to UniProt, dbSNP, RefSeq, DrugBank, and other systems.

โฐ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval (daily, weekly, monthly) and keep a downstream database in sync with new Open Targets releases.

โš–๏ธ Is this data legal to use?

Open Targets data is published under CC0 with attribution. Underlying source data may carry separate licenses; review the Open Targets documentation for your specific use case.

๐Ÿ’ผ Can I use this data commercially?

Yes. Open Targets data is free for academic and commercial use. The CC0 license places it in the public domain.

๐Ÿ’ณ Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you access to scheduling and higher concurrency.

๐Ÿ” What happens if a run fails or gets interrupted?

Apify automatically retries transient errors. Single-entity failures are skipped with a warning and the run continues. Partial datasets are preserved.

๐Ÿงช What if I need full evidence trails?

This Actor returns the top-N association rows per target or disease. For deeper evidence and provenance, query individual entity pages on platform.opentargets.org or request a companion scraper through the contact form.

๐Ÿ†˜ What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.


๐Ÿ”Œ Integrate with any app

Open Targets Platform Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe biomedical data into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push refreshed biomedical evidence into your product backend or notebook environment.


๐Ÿ”— Recommended Actors

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


You might also like

Open Targets Platform Scraper

parseforge/opentargets-scraper

Query Open Targets Platform GraphQL API for targets (genes), diseases, or drugs and return rich associations including drugs by disease, known drugs, indications, mechanisms of action, pathways, tractability and genetic constraint. No API key required.

ChEMBL Targets Scraper

parseforge/chembl-targets-scraper

Query the ChEMBL target catalog by ID, keyword, organism, or target type. Records include target ChEMBL ID, preferred name, organism, target type, gene symbol, tax ID, components with accession and description, and cross references. Useful for drug discovery research and target review.

Link Building Prospect Finder - Outreach Targets

santhej/link-prospect-finder

Find link-building prospects for any niche: resource pages, guest-post targets, link roundups and competitors - pulled from Google search with the right footprints. Outreach-ready domain lists.

๐Ÿ‘ User avatar

Santhej Kallada

4

5.0

Zillion-redfin-new-construction-tracker-scraper

funny_electrician/Korak1923

Targets developers and contractors.

๐Ÿ‘ User avatar

Milton Gardener

2

FDA Drug Recall Search

ryanclinton/fda-drug-recalls

FDA Drug Recall Search queries the U.S. Food and Drug Administration's openFDA drug enforcement endpoint to retrieve detailed, structured data about pharmaceutical drug recalls.

openFDA Drug Events & Recalls Scraper

scrapers_lat/openfda-drug-events-scraper

Scrape FDA drug adverse event reports (FAERS) and drug recall and enforcement actions from the official openFDA API. Search by drug, reason or date. Export to JSON, CSV, Excel.

2

5.0

Clinical Trials & Drug Data Aggregator

fascinating_lentil/clinical-trials-drug-data-aggregator

Aggregate non-personal clinical trial, drug label, NDC, recall, and aggregate adverse-event metrics from official US government open APIs.

๐Ÿ‘ User avatar

Md Jakaria Mirza

2