VOOZH about

URL: https://apify.com/parseforge/hpo-phenotypes-scraper

โ‡ฑ HPO Human Phenotype Ontology Scraper ยท Apify


๐Ÿ‘ HPO Human Phenotype Ontology Scraper avatar

HPO Human Phenotype Ontology Scraper

Pricing

from $7.50 / 1,000 results

Go to Apify Store

HPO Human Phenotype Ontology Scraper

Search the Human Phenotype Ontology by keyword or HP ID and pull back terms with hpoId, name, definition, synonyms, parents, children, plus optional associated diseases and genes. Useful for rare disease research, clinical curation, and genomic variant annotation pipelines.

Pricing

from $7.50 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

25 days ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿง  HPO Phenotypes Scraper

๐Ÿš€ Export Human Phenotype Ontology terms in seconds. HPO ID, name, definition, synonyms, parents, children, associated diseases, and genes.

๐Ÿ•’ Last updated: 2026-06-05 ยท ๐Ÿ“Š 10 fields per record ยท Jackson Lab HPO API ยท 17,000+ phenotype terms ยท Real-time

The HPO Phenotypes Scraper turns the public Jackson Laboratory HPO API into a flat dataset of phenotype terms. Search by keyword or HPO ID, and optionally enrich each term with its associated diseases and causal genes.

The Human Phenotype Ontology is the standard vocabulary of phenotypic abnormalities used in clinical genetics worldwide.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
๐Ÿงฌ Clinical geneticistsMap patient phenotypes to candidate genes
๐Ÿ”ฌ Rare disease researchersBuild phenotype profiles for cohorts
๐Ÿฉบ Diagnostic teamsEnrich HPO term picklists
๐Ÿค– BioinformaticiansMirror HPO into local pipelines
๐Ÿ“Š Pharma R&DTrack phenotypes linked to drug targets
๐Ÿ‘ฉโ€๐Ÿ’ป DevelopersSkip the OBO file parsing

๐Ÿ“‹ What the HPO Phenotypes Scraper does

  • Searches the HPO term catalog by keyword or resolves a single HPO ID.
  • For each term, returns name, definition, synonyms, parent and child terms.
  • Optionally enriches with associated OMIM / Orphanet diseases and causal genes.
  • Flattens nested arrays to delimited strings for spreadsheet imports.
  • Returns clean error rows on empty searches.
  • Exports to your preferred dataset format.

๐Ÿ’ก Why it matters: Most HPO consumers download the entire OBO file, parse it themselves, and then build a search index. This actor lets you query HPO directly with one click.

๐ŸŽฌ Full Demo

๐Ÿšง Coming soon.

โš™๏ธ Input

FieldTypeRequiredDescription
searchTermstringNoKeyword or HPO ID like HP:0001250.
maxItemsintegerNoFree 10, paid up to 1,000,000.
includeAssociationsbooleanNoFetch associated diseases and genes per term.

Example 1, search for 'seizure'

{"searchTerm":"seizure","maxItems":20,"includeAssociations":true}

Example 2, lookup a single HPO ID

{"searchTerm":"HP:0001250"}

โš ๏ธ Good to Know: Disease and gene enrichment adds one extra request per term. Disable it for fast catalog dumps.

๐Ÿ“Š Output

FieldTypeDescription
๐Ÿ†” hpoIdstringHPO term identifier.
๐Ÿง  namestringTerm name.
๐Ÿ“– definitionstringFormal definition.
๐Ÿ”„ synonymsstringExact and related synonyms.
โฌ†๏ธ parentsstringParent term IDs.
โฌ‡๏ธ childrenstringChild term IDs.
๐Ÿฉบ associatedDiseasesstringOMIM / Orphanet disease names.
๐Ÿงฌ genesstringAssociated gene symbols.
๐Ÿ•’ scrapedAtstringFetch timestamp.
โŒ errorstringError message if any.
{
"hpoId":"HP:0001250",
"name":"Seizure",
"definition":"A seizure is an intermittent abnormality of nervous system physiology...",
"synonyms":"Seizures; Epileptic seizure",
"parents":"HP:0012638",
"children":"HP:0002197; HP:0002353",
"associatedDiseases":"Dravet syndrome; Tuberous sclerosis",
"genes":"SCN1A, TSC1, TSC2",
"scrapedAt":"2026-06-05T12:00:00.000Z",
"error":null
}

โœจ Why choose this Actor

| ๐Ÿ†“ | Public Jackson Lab API, no key required. | | ๐Ÿงน | Flat rows, no OBO parsing. | | ๐ŸŽฏ | Optional disease and gene enrichment per term. | | ๐Ÿ”Œ | One actor, the full HPO. | | ๐Ÿ’พ | Push to dataset, instant export. |

๐Ÿ“ˆ How it compares to alternatives

ApproachSetupSearchEnrichment
Download OBO and parseHoursBuild your own indexManual joins
HPO API + custom client30 minYesManual
This Actor5 secBuilt-inBuilt-in

๐Ÿš€ How to use

  1. Click Try for free.
  2. Type a phenotype keyword or HPO ID.
  3. Toggle disease and gene enrichment.
  4. Click Start.

๐Ÿ’ผ Business use cases

๐Ÿงฌ Phenotype-driven diagnosis. Pull HPO terms with the right disease links for clinical decision support.

๐Ÿ“Š Cohort building. Build phenotype profiles for rare disease cohorts.

๐Ÿค– Curation tools. Mirror HPO into your internal annotation UI.

๐Ÿ“ˆ Drug-target research. Map phenotypes to target genes.

๐Ÿ”Œ Automating HPO Phenotypes Scraper

  • Make / Zapier trigger and push to Sheets or a database.
  • Cron scheduler via Apify.
  • Webhooks on run completion.
  • Pipe to BigQuery / Snowflake / Postgres via integrations.

๐ŸŒŸ Beyond business use cases

๐ŸŽ“ Teaching. Walk medical students through phenotype hierarchies.

๐Ÿงช Personal research. Build a phenotype tracker for a rare condition.

๐Ÿค Open science. Public phenotype-disease maps.

๐Ÿงฐ Prototyping. Add HPO autocomplete to a new tool in minutes.

๐Ÿค– Ask an AI assistant about this scraper

Paste this README into your assistant and describe your phenotype workflow.

โ“ Frequently Asked Questions

โ“ Do I need an API key? No.

โ“ Can I look up a single ID? Yes, paste it in searchTerm.

โ“ How big is HPO? 17,000+ terms.

โ“ What's the rate limit? Be reasonable, the API is public.

โ“ Are arrays flattened? Yes, semicolon-joined.

โ“ Can I skip the enrichment? Yes, toggle off.

โ“ Can I schedule runs? Yes.

โ“ Is this scraping? API only.

โ“ Will the schema change? Core fields stable.

โ“ Download format? Any format Apify supports.

๐Ÿ”Œ Integrate with any app

Apify ships native integrations with Make, Zapier, Slack, Discord, Google Drive, Google Sheets, Gmail, Airbyte, Keboola, Telegram, GitHub, and any REST or webhook endpoint.

๐Ÿ”— Recommended Actors

ActorWhat it does
ParseForge ClinVar Variants ScraperVariant interpretations.
ParseForge dbSNP Variants ScraperNCBI dbSNP variants.
ParseForge Disease Ontology ScraperDisease Ontology terms.
ParseForge NIH Reporter Grants ScraperNIH funded grants.

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for 900+ production-grade scrapers across business intelligence, real estate, e-commerce, sports, finance, and public records.


Disclaimer: This actor scrapes only publicly available data. ParseForge is not affiliated with Jackson Laboratory or HPO. Users are responsible for complying with the target site's terms of service and applicable law. Create a free account w/ $5 credit.

You might also like

HPO Human Phenotype Ontology Scraper

parseforge/hpo-phenotype-jax-scraper

Search the Human Phenotype Ontology from Jackson Laboratory for clinical phenotype terms. Returns HPO IDs, term names, definitions, synonyms, and parent classes. Useful for rare disease diagnostics, phenotype to gene mapping, and building structured clinical descriptors.

Mondo Disease Ontology Scraper

parseforge/mondo-disease-scraper

Pull harmonized disease terms from the Mondo Disease Ontology. Query a keyword or stream the full hierarchy and collect Mondo IDs, labels, definitions, synonyms, and cross references to OMIM, Orphanet, DOID, and MeSH. Useful for rare disease research, ontology mapping, and EHR coding.

EBI Ontology Lookup Service Scraper

parseforge/ebi-ols-ontology-scraper

Browse the EBI Ontology Lookup Service catalogue of biomedical ontologies. Returns ontology id, title, description, term count, property count, version, homepage, and namespace. Useful for bioinformatics tooling, ontology alignment, and semantic research across life science domains.

Orphanet Rare Diseases Scraper

parseforge/orphanet-rare-diseases-scraper

Pull rare disease records from Orphanet and Orphadata by ORPHAcode. Returns the preferred name, synonyms, disorder group, clinical definition, plus mappings to ICD-10, ICD-11, OMIM, UMLS, and MeSH. Useful for terminology mapping, health record enrichment, and rare disease research.

EBI OLS Ontologies List Scraper

parseforge/ebi-ols-ontologies-list-scraper

Browse every ontology indexed by the EMBL EBI Ontology Lookup Service. Returns ontology identifiers, titles, descriptions, term counts, version, and source URLs. Useful for picking the right vocabulary for biomedical annotation, FAIR data harmonization, and semantic search index building.

NCBI ClinVar Variant Scraper

parseforge/clinvar-esummary-scraper

Query NCBI ClinVar for human genetic variants tied to disease. Search by gene symbol, variant ID, or clinical significance and pull variation IDs, gene info, molecular consequence, clinical assertions, and review status. Useful for clinical genomics, variant curation, and research.

Rare Disease Lookup โ€” OMIM + Orphanet + MedGen API

azureblue/rare-disease-scraper

Search rare diseases via NCBI MedGen. Returns disease name, definition, synonyms, semantic type and cross-references to OMIM, Orphanet and SNOMED CT.

Ensembl Genomics Scraper (Genes, Variants, Sequences)

parseforge/ensembl-genomics-scraper

Query the Ensembl genome reference for 200+ species. Look up genes by symbol or stable ID, list features in a genomic region, fetch DNA sequence, or resolve human variants (rsIDs). Returns biotype, coordinates, transcript IDs, descriptions, and assembly metadata.

NLM MeSH SPARQL Scraper

parseforge/mesh-sparql-scraper

Pull terms from the NLM Medical Subject Headings SPARQL endpoint by release year and optional label substring. Returns descriptor URIs, preferred labels, tree numbers, scope notes, and concept relationships. Useful for biomedical indexing, literature search tooling, and ontology alignment.