VOOZH about

URL: https://apify.com/jungle_synthesizer/ema-medicines-authorisation-scraper

โ‡ฑ EMA Medicines Scraper - EU Drug Authorisation Register ยท Apify


๐Ÿ‘ EMA Medicines Scraper - European Drug Authorisation Register avatar

EMA Medicines Scraper - European Drug Authorisation Register

Pricing

Pay per event

Go to Apify Store

EMA Medicines Scraper - European Drug Authorisation Register

Extract EU drug authorisation data from the European Medicines Agency (EMA) register. Human and veterinary medicines: active substance, ATC code, MAH, authorisation status, orphan/biosimilar/generic flags, and product URLs. Filter by category, status, therapeutic area, or ATC code.

Pricing

Pay per event

Rating

0.0

(0)

Developer

๐Ÿ‘ BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

3 days ago

Last modified

Share

EMA Medicines Scraper โ€” European Drug Authorisation Register

Extract the complete European Medicines Agency (EMA) centralised medicines authorisation register. Covers all human and veterinary medicines that have received, or applied for, a centralised EU marketing authorisation.

Data source: EMA nightly XLSX bulk export at ema.europa.eu. Updated every night by the EMA.


What data does it extract?

Each record corresponds to one medicine and includes:

FieldDescription
medicine_nameBrand name
categoryHuman or Veterinary
ema_product_numberEMA product number (e.g. EMEA/H/C/004781)
authorisation_statusAuthorised, Withdrawn, Refused, or Suspended
innInternational Non-proprietary Name / common name
active_substanceActive substance(s)
therapeutic_areaTherapeutic area (MeSH terms)
atc_codeATC code (human) or ATCvet code (veterinary)
pharmacotherapeutic_groupPharmacotherapeutic group
marketing_authorisation_holderMAH company name
first_authorised_dateFirst EU marketing authorisation date (DD/MM/YYYY)
orphan_designationOrphan medicine designation flag
biosimilarBiosimilar flag
generic_or_hybridGeneric or hybrid application flag
conditional_marketing_authorisationConditional approval flag
additional_monitoringAdditional monitoring (black triangle) flag
accelerated_assessmentAccelerated assessment flag
exceptional_circumstancesExceptional circumstances flag
product_urlEMA product page URL

Input options

ParameterTypeDefaultDescription
medicineCategoryStringhumanFilter: human, veterinary, or leave blank for all
authorisationStatusStringAuthorisedFilter: Authorised, Withdrawn, Refused, Suspended, or blank for all
therapeuticAreaString(blank)Filter by therapeutic area substring, case-insensitive (e.g. Diabetes)
atcCodeString(blank)Filter by ATC code prefix (e.g. L01 for antineoplastics)
authorisationDateFromString(blank)Include only medicines authorised on or after this date (YYYY-MM-DD or DD/MM/YYYY)
authorisationDateToString(blank)Include only medicines authorised on or before this date (YYYY-MM-DD or DD/MM/YYYY)
maxItemsInteger15Maximum number of records to return (0 = all)

How it works

The actor downloads EMA's nightly XLSX bulk export (approximately 885 KB, ~2,700 records) using a single HTTP request. No browser automation, no pagination, no proxy required. The XLSX is parsed in-memory using Node.js built-in modules, then filtered and saved to the Apify dataset.

Performance: Typically completes in under 10 seconds.

Memory: 256 MB is sufficient. The actor is configured for 512 MB to be safe.


Example run

Input:

{
"medicineCategory":"human",
"authorisationStatus":"Authorised",
"atcCode":"L01",
"maxItems":5
}

Sample output record:

{
"medicine_name":"Keytruda",
"category":"Human",
"ema_product_number":"EMEA/H/C/003820",
"authorisation_status":"Authorised",
"inn":"pembrolizumab",
"active_substance":"pembrolizumab",
"therapeutic_area":"Melanoma; Carcinoma, Non-Small-Cell Lung; ...",
"atc_code":"L01FF02",
"pharmacotherapeutic_group":"Antineoplastic agents, monoclonal antibodies",
"marketing_authorisation_holder":"Merck Sharp & Dohme B.V.",
"first_authorised_date":"17/07/2015",
"orphan_designation":false,
"biosimilar":false,
"generic_or_hybrid":false,
"conditional_marketing_authorisation":false,
"additional_monitoring":true,
"accelerated_assessment":false,
"exceptional_circumstances":false,
"product_url":"https://www.ema.europa.eu/en/medicines/human/EPAR/keytruda"
}

Use cases

  • Pharma intelligence: Monitor which medicines have EU authorisation and track MAH portfolios
  • Biotech business development: Identify orphan, biosimilar, or conditionally approved medicines
  • Regulatory consulting: Track EU status of medicines by active substance or therapeutic area
  • Academic research: Build datasets of authorised medicines by ATC code or indication
  • Generics manufacturers: Identify authorised generic/hybrid medicines

Notes

  • Data is updated nightly by the EMA. Each actor run downloads the latest version.
  • The dataset covers approximately 2,700 medicines in the centralised authorisation procedure. Nationally authorised medicines are not included.
  • Withdrawn medicines remain in the dataset with status Withdrawn.

You might also like

European Medicines Agency Medicines Scraper

parseforge/ema-medicines-scraper

Export EU authorised medicines from the European Medicines Agency. Pull medicine name, INN, ATC code, authorisation holder, therapeutic indication, status, and authorisation date. Filter by status, medicine type (human/veterinary), and therapeutic area.

EMA EPAR Lookup โ€” One-Shot EU Medicines Snapshot

azureblue/ema-drug-approval-scraper

Extract EMA-approved medicines data from the official European Medicines Agency EPAR dataset. Filter by name, active substance, or authorisation status.

EMA European Medicines Search

ryanclinton/ema-medicines-search

Search the European Medicines Agency (EMA) database for medicines authorized across the European Union. This actor queries the official EMA medicines dataset -- updated twice daily -- and returns structured, clean data on human and veterinary medicines.

13

EMA Medicines Scraper ๐Ÿ’Š

shahidirfan/ema-medicines-scraper

Scrape European Medicines Agency data for drug approvals, clinical trials & pharmaceutical information. Extract EMA medicines, regulatory documents & authorization data at scale. Perfect for pharma research & compliance.

WHO Essential Medicines List โ€” Procurement Lookup API

azureblue/who-essential-medicines-scraper

Scrape the WHO Essential Medicines List (667 medicines, 1400+ recommendations). Filter by name, category, or section. Returns medicine name, category, dosage forms, indications, and EML URL.

Aifa Drug Prices Scraper

ayrtondavoli97/italyrxprices-official-aifa-drug-price-database

Extract Italy's official AIFA drug price database. 8,500+ generic medicines with NHS reference prices, price gaps, ATC codes and manufacturer data. CC-BY open data, updated monthly.

๐Ÿ‘ User avatar

Francesco Ayrton Davoli

2

Medicine/Drug Information Search

aurumworks/drug-information-search

Search for medicines, drugs, and pharmaceutical products using official FDA databases. Get comprehensive, verified drug information including brand names, uses, side effects, warnings, active ingredients, and manufacturer details.

16

ANVISA Brazil Medicines Registry Scraper

parseforge/anvisa-brazil-medicines-scraper

Search the ANVISA medicines registry by product name or active ingredient and pull product_name, active_ingredient, manufacturer, registration, expiry_date, category, and presentation. Handy for pharma market research, regulatory monitoring, and competitive intelligence across Brazil.

๐Ÿ‡ฆ๐Ÿ‡บ TGA Australia Medicines Scraper

parseforge/tga-australia-medicines-scraper

Search the TGA Australian Register of Therapeutic Goods (ARTG). Export medicine name, ARTG ID, sponsor, registration status, product type, dosage form, active ingredients, and approval date as CSV, Excel, JSON, JSONL, XML, or HTML. Public-data export with no login required.

RxNorm Drug Concept Search Scraper

compute-edge/rxnorm-drug-search-scraper

Look up RxNorm concepts (RxCUI, brand vs generic, dose form, strength, ATC class, DEA schedule, prescribable status) from the NLM RxNav REST API by drug name.