VOOZH about

URL: https://apify.com/parseforge/reactome-pathways-scraper

โ‡ฑ Reactome Pathways Scraper ยท Apify


Pricing

from $2.00 / 1,000 results

Go to Apify Store

Reactome Pathways Scraper

Pull curated biology pathways, reactions, proteins, and complexes from the Reactome knowledgebase. List every top level pathway for a species, search by gene or process keyword, or resolve a stable ID. Returns names, species, schema class, disease flags, and direct browser links.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿงฌ Reactome Pathways Scraper

๐Ÿš€ Export curated biology pathways in seconds. Pull the full set of top-level human pathways (29 entries), search 10+ result types per keyword, or look up any entity by stable ID with its summary and sub-event count.

๐Ÿ•’ Last updated: 2026-06-04 ยท ๐Ÿ“Š Up to 13 fields per record ยท 10 species supported ยท Reactome ContentService (open, keyless)

Reactome is a free, open-source, peer-reviewed knowledgebase of biological pathways. This Actor turns its public ContentService into clean, structured records you can drop straight into a notebook, a spreadsheet, or a downstream pipeline. Point it at a species to list every top-level pathway, search by a gene or process keyword, or resolve a single stable identifier such as R-HSA-109581.

Coverage spans curated pathways, reactions, proteins, complexes, and sets. Each record carries the Reactome stable identifier, display name, species, schema class, disease flag, diagram availability, and a direct Pathway Browser link, so you always know exactly which entity you are looking at and where to view it.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Bioinformaticians and computational biologistsBuilding pathway reference tables for enrichment analysis
Data scientists in life sciences and pharmaMapping genes and proteins to curated pathways
Researchers and graduate studentsPulling pathway summaries and sub-event counts for review
Knowledge engineers and database buildersSeeding internal catalogs with stable Reactome identifiers

๐Ÿ“‹ What the Reactome Pathways Scraper does

  • Lists every top-level pathway for a chosen species (Homo sapiens returns 29 entries).
  • Searches Reactome by keyword and returns matching pathways, reactions, proteins, complexes, and sets.
  • Looks up a single entity by stable identifier and adds a plain-text summary plus a sub-event count.
  • Strips embedded HTML from names and summaries so values are ready to use.
  • Builds a direct Pathway Browser link for every record.

๐ŸŽฌ Full Demo (๐Ÿšง Coming soon)

โš™๏ธ Input

Provide one of three modes. A stable identifier takes priority, then a search term, otherwise the Actor lists top-level pathways for the selected species.

FieldTypeDescription
searchTermstringKeyword to search across Reactome, for example apoptosis, glucose, or TP53.
stIdstringReactome stable identifier to look up a single entity, for example R-HSA-109581.
speciesselectSpecies for the top-level listing. Defaults to Homo sapiens (9606). Ten species supported.
maxItemsintegerFree users limited to 10 items. Paid users up to 1,000,000.

Example 1, list top-level human pathways:

{
"species":"9606",
"maxItems":10
}

Example 2, search by keyword:

{
"searchTerm":"apoptosis",
"maxItems":25
}

โš ๏ธ Good to Know: Reactome search names and summaries arrive wrapped in HTML highlight tags. This Actor removes those tags for you, so the values you receive are clean text. Stable identifiers are species-specific, for example R-HSA for human and R-MMU for mouse.

๐Ÿ“Š Output

Each record is one pathway, reaction, or entity. Conditional fields appear only when the source provides them: name on listing and lookup records, summary when a description exists, subEventCount in lookup mode, and searchType in search mode.

FieldDescription
๐Ÿ†” stIdReactome stable identifier
๐Ÿ”ข dbIdInternal Reactome database identifier
๐Ÿ“Œ displayNameHuman readable name
๐Ÿท nameAll known names, joined
๐Ÿงฌ speciesNameSpecies the entity belongs to
๐Ÿ—‚ schemaClassReactome class, such as Pathway or TopLevelPathway
๐Ÿฉบ isInDiseaseWhether the entity is associated with disease
๐Ÿ–ผ hasDiagramWhether a pathway diagram exists
โœจ hasEHLDWhether an enhanced high level diagram exists
๐Ÿ”— urlDirect Pathway Browser link
๐Ÿ“ summaryPlain text description, when available
๐Ÿ” subEventCountNumber of contained sub-events, lookup mode
๐Ÿ”Ž searchTypeResult type label, search mode
๐Ÿ•’ scrapedAtTimestamp of collection
โŒ errorError message, null on success

Three real records from a live run:

{
"stId":"R-HSA-9612973",
"dbId":9612973,
"displayName":"Autophagy",
"name":"Autophagy",
"speciesName":"Homo sapiens",
"schemaClass":"TopLevelPathway",
"isInDisease":false,
"hasDiagram":true,
"hasEHLD":true,
"url":"https://reactome.org/PathwayBrowser/#/R-HSA-9612973",
"scrapedAt":"2026-06-04T19:42:28.000Z",
"error":null
}
{
"stId":"R-HSA-1640170",
"dbId":1640170,
"displayName":"Cell Cycle",
"name":"Cell Cycle",
"speciesName":"Homo sapiens",
"schemaClass":"TopLevelPathway",
"isInDisease":false,
"hasDiagram":true,
"hasEHLD":true,
"url":"https://reactome.org/PathwayBrowser/#/R-HSA-1640170",
"scrapedAt":"2026-06-04T19:42:28.000Z",
"error":null
}
{
"stId":"R-HSA-1500931",
"dbId":1500931,
"displayName":"Cell-Cell communication",
"name":"Cell-Cell communication",
"speciesName":"Homo sapiens",
"schemaClass":"TopLevelPathway",
"isInDisease":false,
"hasDiagram":true,
"hasEHLD":true,
"url":"https://reactome.org/PathwayBrowser/#/R-HSA-1500931",
"scrapedAt":"2026-06-04T19:42:28.000Z",
"error":null
}

โœจ Why choose this Actor

  • Three modes in one tool, listing, search, and single lookup.
  • Keyless and stable, built on the official Reactome ContentService.
  • Clean values, HTML highlight tags removed before you receive them.
  • Conditional fields only when real, so you never ship always-null columns.
  • Direct Pathway Browser links on every record for instant context.

๐Ÿ“ˆ How it compares to alternatives

ApproachSetupClean outputMulti-modeMaintenance
This ActorNone, run and goYesListing, search, lookupManaged
Manual API callsWrite and host codeYou parse HTML yourselfYou build each callYours
Copy from the websiteSlow and manualNoNoHigh effort

๐Ÿš€ How to use

  1. Create a free Apify account using this sign-up link.
  2. Open the Reactome Pathways Scraper in the Apify Console.
  3. Choose a mode, set a species, type a search term, or paste a stable identifier.
  4. Set how many records you want with the items limit.
  5. Run the Actor and collect your structured pathway records.

๐Ÿ’ผ Business use cases

Pharma and biotech research

NeedHow this helps
Target discovery contextMap proteins and genes to curated pathways
Disease association reviewFilter records by the disease flag

Bioinformatics pipelines

NeedHow this helps
Reference tablesSeed enrichment analysis with stable identifiers
ReproducibilityPin records to versioned Reactome entries

Academic and teaching

NeedHow this helps
Course materialsPull pathway names and summaries for slides
Literature reviewsResolve identifiers to readable descriptions

Knowledge base and catalog building

NeedHow this helps
Internal catalogsPopulate tables with names, species, and links
Cross referencingUse stable identifiers as join keys

๐Ÿ”Œ Automating Reactome Pathways Scraper

Connect the Actor to the tools your team already uses. Trigger runs and route results through Make, Zapier, Slack, Airbyte, GitHub Actions, or Google Drive using the Apify API and integrations. Schedule recurring runs to keep a local pathway catalog fresh, or post new lookups to a Slack channel for your research group.

๐ŸŒŸ Beyond business use cases

  • Research: assemble pathway datasets for a thesis or a paper.
  • Personal: explore the biology behind a gene or a condition you care about.
  • Non-profit: build open educational resources around cell biology.
  • Experimentation: prototype a pathway browser or a graph of biological events.

๐Ÿค– Ask an AI assistant

Pair the structured output with ChatGPT, Claude, Perplexity, or Microsoft Copilot to summarize pathway descriptions, group results by schema class, or draft an enrichment report from your collected records.

โ“ Frequently Asked Questions

Is this Actor affiliated with Reactome? No. It is an independent tool that reads the public Reactome ContentService. Only publicly available data is collected.

Do I need an API key? No. The Reactome ContentService is open and keyless.

Which species are supported? Ten, including Homo sapiens, Mus musculus, Rattus norvegicus, Danio rerio, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana, Bos taurus, and Gallus gallus.

How many top-level pathways does human return? Twenty-nine at the time of writing. Mouse returns twenty-eight.

What is a stable identifier? A persistent Reactome ID such as R-HSA-109581. The R-HSA prefix marks human entities, R-MMU marks mouse.

What gets returned in search mode? Matching pathways, reactions, proteins, complexes, and sets, each with a result type label.

Why are some fields missing on some records? Fields appear only when the source provides them. Search entries, for example, do not carry diagram flags, so those fields are omitted rather than shipped as null.

Does it return pathway descriptions? Yes, when available. Lookup mode and search results include a plain text summary with HTML tags removed.

How many records can I collect? Free users get up to 10 items per run. Paid users can collect up to 1,000,000.

Can I see the pathway diagram? Each record includes a direct Pathway Browser link where you can open the diagram.

How fresh is the data? Records reflect the live Reactome ContentService at the moment of the run.

Can I automate recurring runs? Yes. Use Apify schedules and integrations to run the Actor on a cadence and route results to your tools.

๐Ÿ”Œ Integrate with any app

Use the Apify API, webhooks, and the integrations catalog to push records into your database, data warehouse, or workflow automation tool. Every run produces a structured dataset you can fetch programmatically.

๐Ÿ”— Recommended Actors

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection.

๐Ÿ†˜ Need Help? Open our contact form

โš ๏ธ Disclaimer: independent tool, not affiliated with Reactome. Only publicly available data collected.

You might also like

KEGG Pathways Scraper

parseforge/kegg-pathways-scraper

Export biological pathway data from KEGG, the reference encyclopedia of genes, proteins, and metabolic networks. List, fetch, or search entries from pathway, module, KO, genome, compound, glycan, reaction, enzyme, drug, and disease databases.

iNaturalist Species & Observations Scraper

crawlerbros/inaturalist-species-observations

Scrape iNaturalist - the world's largest nature observation platform with 200M+ wildlife sightings. Search observations by species name, browse taxa, filter by location and date, find common species in a region. Completely free, no API key required.

GBIF Scraper

crawlergang/gbif-scraper

Scrape GBIF (Global Biodiversity Information Facility), 2.7B+ biodiversity occurrence records and 10M+ species. Search species taxonomy, browse occurrence records, explore contributing datasets, or fetch species details with vernacular names.

2

5.0

GBIF Scraper

crawlerbros/gbif-scraper

Scrape GBIF (Global Biodiversity Information Facility), 2.7B+ biodiversity occurrence records and 10M+ species. Search species taxonomy, browse occurrence records, explore contributing datasets, or fetch species details with vernacular names.

Realtor Agents Scraper With Email

realtorscraper/realtor-agents-scraper-with-email

Access agent email contacts through three distinct identification pathways. Using Profile ID, Fulfillment ID, or NRDS ID, you can instantly retrieve verified email addresses and professional contact information.

๐Ÿ‘ User avatar

Realtor Scraper

112

GBIF Biodiversity Data Scraper

parseforge/gbif-biodiversity-scraper

Export species records and occurrence data from GBIF, the Global Biodiversity Information Facility. 2.5B+ occurrences across every species on Earth. Search by scientific name or fetch occurrences by location, taxon, or collector. Pull taxonomy, coordinates, and dataset metadata.

Ensembl Gene Lookup Scraper

parseforge/ensembl-gene-lookup-scraper

Resolve human gene symbols against the Ensembl REST API to fetch stable gene identifiers, chromosome location, strand, biotype, and description. Useful for variant annotation, RNA seq pipelines, and gene set enrichment workflows that need clean Ensembl mappings from a list of HGNC symbols.