FDA Orange Book Scraper

Pricing

Pay per event

FDA Orange Book Scraper

Search public FDA Orange Book / Drugs@FDA records by brand, generic, ingredient, sponsor, or application number for pharma research.

Pricing

Pay per event

Rating

0.0

(0)

Developer

👁 Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

What does FDA Orange Book Scraper do?

FDA Orange Book Scraper queries the public openFDA Drugs@FDA API and saves normalized application-level records to an Apify dataset.

It turns FDA application JSON into export-ready rows with application numbers, sponsor names, product summaries, active ingredients, dosage forms, routes, strengths, marketing statuses, submissions, and openFDA identifiers.

The actor is API-first, so it does not need a browser, login, cookies, or a private FDA account.

Who is it for?

🧪 Regulatory affairs teams checking approved drug applications.
💊 Generic-drug portfolio analysts comparing brand and ingredient coverage.
⚖️ Pharma IP and market-access teams building patent-cliff research datasets.
📊 Competitive-intelligence teams monitoring sponsors and application families.
🔬 Healthcare data teams joining FDA application records with internal databases.

Why use it?

It provides a simple Apify interface around openFDA Drugs@FDA search.
It supports buyer-friendly inputs instead of requiring users to remember API field names.
It saves one normalized dataset row per application record.
It includes nested products and submissions for downstream auditing.
It can include the raw openFDA record when your compliance workflow needs source evidence.

Data source

The actor uses:

https://api.fda.gov/drug/drugsfda.json

This is a public FDA/openFDA endpoint.

No FDA API token is required for normal use.

What data can you extract?

Field	Description
`applicationNumber`	NDA, ANDA, or BLA application number from Drugs@FDA.
`sponsorName`	Application sponsor / applicant.
`brandNames`	Brand names found in openFDA and product data.
`genericNames`	Generic names from openFDA.
`activeIngredients`	Active ingredient names from product records.
`dosageForms`	Dosage forms across products.
`routes`	Administration routes.
`strengths`	Product strengths.
`marketingStatuses`	Product marketing statuses where provided.
`products`	Nested product summaries.
`submissions`	Nested submission summaries.
`openfda`	Original openFDA identifiers and classification fields.
`patentDataAvailable`	Whether patent records were available from the source.
`exclusivityDataAvailable`	Whether exclusivity records were available from the source.

Search modes

You can search by:

Brand name.
Generic name.
Active ingredient.
Sponsor / applicant.
Exact application number.
Raw openFDA query syntax.

Input example

{
"queries":[
"aspirin",
{"term":"ibuprofen","field":"ingredient"},
{"term":"PFIZER","field":"sponsor"}
],
"applicationNumbers":["NDA020639"],
"searchField":"brand",
"maxItems":100,
"includeRawRecord":false
}

Output example

{
"searchTerm":"aspirin",
"searchField":"brand",
"applicationNumber":"NDA020639",
"sponsorName":"BAYER HEALTHCARE LLC",
"brandNames":["ASPIRIN"],
"activeIngredients":["ASPIRIN"],
"dosageForms":["TABLET"],
"routes":["ORAL"],
"products":[],
"submissions":[],
"patentDataAvailable":false,
"exclusivityDataAvailable":false
}

How much does it cost to scrape FDA Orange Book data?

This actor uses pay-per-event pricing.

A small start fee is charged once per run.
A per-record fee is charged for each FDA application record saved.
Your final cost depends on the number of matching FDA application records and your Apify plan tier.

For most targeted application-number or brand-name lookups, runs are small and inexpensive.

How to run it

Open the actor on Apify.
Add one or more search terms.
Choose the default search field.
Optionally add exact application numbers.
Set maxItems to cap the export size.
Start the run.
Download the dataset as JSON, CSV, Excel, or via API.

Tips for best results

Use exact application numbers when you know them.
Use ingredient for portfolio research by active ingredient.
Use sponsor for applicant-level monitoring.
Use raw only when you already know openFDA query syntax.
Keep maxItems low for quick smoke tests.
Enable includeRawRecord for compliance audits or custom transformations.

Patent and exclusivity fields

The dataset includes patent and exclusivity compatibility fields.

In this version, the reliable public API source is openFDA Drugs@FDA. If patent or exclusivity data is not present in that source, the actor sets:

patentDataAvailable: false
patents: []
exclusivityDataAvailable: false
exclusivities: []

This makes downstream schemas stable while avoiding unreliable scraping of blocked FDA download pages.

Integrations

You can connect the dataset to:

Google Sheets for regulatory watchlists.
Snowflake or BigQuery for pharma analytics.
CRM enrichment pipelines for sponsor intelligence.
Internal dashboards that monitor generic-entry opportunities.
Apify webhooks for scheduled portfolio updates.

API usage with Node.js

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token: process.env.APIFY_TOKEN});
const run =await client.actor('automation-lab/fda-orange-book-scraper').call({
queries:['aspirin'],
searchField:'brand',
maxItems:100
});
console.log(run.defaultDatasetId);

API usage with Python

from apify_client import ApifyClient
client = ApifyClient('YOUR_APIFY_TOKEN')
run = client.actor('automation-lab/fda-orange-book-scraper').call(run_input={
'queries':['aspirin'],
'searchField':'brand',
'maxItems':100,
})
print(run['defaultDatasetId'])

API usage with cURL

curl-X POST 'https://api.apify.com/v2/acts/automation-lab~fda-orange-book-scraper/runs?token=YOUR_APIFY_TOKEN'\
-H'Content-Type: application/json'\
-d'{"queries":["aspirin"],"searchField":"brand","maxItems":100}'

MCP integration

Use Apify MCP to call this scraper from Claude Desktop, Claude Code, or other MCP clients.

MCP URL:

https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper

Claude Code setup:

$claude mcp add apify-fda-orange-book "https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper"

Claude Desktop JSON config:

{
"mcpServers":{
"apify-fda-orange-book":{
"url":"https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper"
}
}
}

Example prompts:

"Export FDA Orange Book records for ibuprofen and summarize the sponsors."
"Find Drugs@FDA applications for sponsor PFIZER and group by active ingredient."
"Run an application-number lookup for NDA020639 and return the product strengths."

Scheduling

For monitoring workflows, schedule the actor daily, weekly, or monthly.

Common schedules include:

Weekly sponsor monitoring.
Monthly ingredient portfolio exports.
Quarterly regulatory database refreshes.

Data quality notes

The actor reports the data returned by openFDA Drugs@FDA.

It does not provide medical advice.

Always verify regulatory decisions against official FDA systems and primary records.

Legality and responsible use

This actor uses public FDA/openFDA data.

You are responsible for how you use exported data, including compliance with your organization’s regulatory, medical, and legal review processes.

FAQ and troubleshooting

Why did my search return no rows?

Try a different search mode. For example, use ingredient for active ingredients and application_number for NDA/ANDA/BLA identifiers.

Why are patent arrays empty?

The MVP uses the reliable openFDA Drugs@FDA API. Patent/exclusivity download pages may be unavailable or blocked from automated environments, so the actor marks those fields unavailable when the source does not provide them.

How do I get the original FDA JSON?

Set includeRawRecord to true.

Related scrapers

Other Automation Lab actors that can support healthcare and regulatory workflows:

Changelog

Initial version:

Public openFDA Drugs@FDA search.
Brand, generic, ingredient, sponsor, application-number, and raw query modes.
Application, product, submission, and openFDA identifier fields.

Support

If you need a missing field, include an example application number and describe the workflow you are trying to automate.

Final note

FDA Orange Book Scraper is designed for practical, repeatable exports, not one-off manual lookups.

Use it whenever your team needs FDA drug application data in a dataset, scheduled job, or API pipeline.

👁 FDA Orange Book Scraper avatar

FDA Orange Book Scraper

parseforge/fda-orange-book-scraper

Scrape the FDA Orange Book of approved drug products with therapeutic equivalence evaluations. Get NDA, ingredient, dosage form, strength, route, applicant, marketing status, TE code, RLD/RS flags, exclusivity and patent data. Perfect for pharma research, payers, and generics intelligence.

👁 User avatar

ParseForge

👁 FDA Orange Book Scraper avatar

FDA Orange Book Scraper

labrat011/fda-orange-book-scraper

Extract FDA Orange Book data — drug patent expirations, exclusivity periods, generic equivalents, and therapeutic equivalence ratings. No API key required.

👁 User avatar

mick_

👁 FDA Warning Letters Scraper avatar

FDA Warning Letters Scraper

parseforge/fda-warning-letters-scraper

Scrape FDA Warning Letters with recipient company, issue date, issuing office, product type (drug, device, food, tobacco), subject, violation type, response letters and direct PDF URLs. Ideal for pharma compliance, MedTech regulatory teams, and life-sciences journalism.

👁 User avatar

ParseForge

👁 US Drug & Medical Reference Pro avatar

US Drug & Medical Reference Pro

parseforge/us-drug-med-reference-pro-scraper

Look up any US medication across FDA Orange Book, openFDA labels, FDA warning letters, MedlinePlus, Medscape and eMedicine in one search. Get brand names, ingredients, dosage forms, side effects, contraindications and warning letters.

👁 User avatar

ParseForge

Patent Expiry Monitor - FDA Drug Listing Expiry Tracker

intelscrape/patent-expiry-monitor

Fetches NDA-approved drug products from the FDA openFDA NDC API and returns their listing expiration dates, applicant, active ingredients, and drug metadata for pharmaceutical market intelligence.

👁 User avatar

IntelScrape

👁 🟠 FDA Orange Book — Drug Patent & Exclusivity Tracker avatar

🟠 FDA Orange Book — Drug Patent & Exclusivity Tracker

nexgendata/fda-orange-book-drug-patents

Search the FDA Orange Book: every approved drug product joined with its listed patents and exclusivity, plus a computed estimated generic-entry date. Built for generic manufacturers, pharma IP attorneys, and biotech investors tracking patent cliffs.

👁 User avatar

NexGenData

👁 Health Canada Drug Database Scraper | DPD Records avatar

Health Canada Drug Database Scraper | DPD Records

parseforge/health-canada-drug-database-scraper

Pull Health Canada Drug Product Database entries: DIN, brand name, company, active ingredients, route, dosage form, schedule, and status. Filter by status or company. Useful for pharma research, healthcare apps, and regulatory monitoring across Canada.

👁 User avatar

ParseForge

FDA Orange Book Patent & Exclusivity Tracker — Quarterly Diff

changewire/fda-orange-book-extraction

Purpose-built FDA Orange Book diff stream — NDA/ANDA/BLA patent listings, exclusivity grants, LoE + TE-code shifts as JSONL. FDA-vocabulary-aware vs general-purpose scrapers like Browse AI; per-record metered vs Cortellis $35-50k/yr seat tax for pharma BI + IP + generics. Public runs.

👁 User avatar

ChangeWire

👁 FDA Orange Book Patent Expirations Monitor avatar

FDA Orange Book Patent Expirations Monitor

oobr/fda-orange-book-patent-expirations

Monthly Orange Book extracts with patent expiration dates, applicant, active ingredient — for generics timing & IP diligence.

👁 User avatar

OOBR Team

👁 🟣 FDA Purple Book — Biologics & Biosimilars Tracker avatar

🟣 FDA Purple Book — Biologics & Biosimilars Tracker

nexgendata/fda-purple-book-biologics-biosimilars

Search the FDA Purple Book: licensed biologics, biosimilars, and interchangeables with reference-product linkage and exclusivity-expiry dates that signal when biosimilar competition can begin. For biosimilar developers, pharma investors, and IP/regulatory teams.

👁 User avatar

NexGenData

URL: https://apify.com/automation-lab/fda-orange-book-scraper

⇱ FDA Orange Book Scraper — Drug Application Data · Apify

FDA Orange Book Scraper

What does FDA Orange Book Scraper do?

Who is it for?

Why use it?

Data source

What data can you extract?

Search modes

Input example

Output example

How much does it cost to scrape FDA Orange Book data?

How to run it

Tips for best results

Patent and exclusivity fields

Integrations

API usage with Node.js

API usage with Python

API usage with cURL

MCP integration

Scheduling

Data quality notes

Legality and responsible use

FAQ and troubleshooting

Why did my search return no rows?

Why are patent arrays empty?

How do I get the original FDA JSON?

Related scrapers

Changelog

Support

Final note

You might also like

FDA Orange Book Scraper

FDA Orange Book Scraper

FDA Warning Letters Scraper

US Drug & Medical Reference Pro

Patent Expiry Monitor - FDA Drug Listing Expiry Tracker

🟠 FDA Orange Book — Drug Patent & Exclusivity Tracker

Health Canada Drug Database Scraper | DPD Records

FDA Orange Book Patent & Exclusivity Tracker — Quarterly Diff

FDA Orange Book Patent Expirations Monitor

🟣 FDA Purple Book — Biologics & Biosimilars Tracker