Pricing
Pay per event
FDA Orange Book Scraper
Search public FDA Orange Book / Drugs@FDA records by brand, generic, ingredient, sponsor, or application number for pharma research.
Pricing
Pay per event
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Export public FDA Orange Book and Drugs@FDA application records by brand name, generic name, active ingredient, sponsor, or application number.
Use this scraper when you need repeatable, structured FDA drug approval data for regulatory research, portfolio monitoring, generic-drug analysis, or internal pharma intelligence workflows.
What does FDA Orange Book Scraper do?
FDA Orange Book Scraper queries the public openFDA Drugs@FDA API and saves normalized application-level records to an Apify dataset.
It turns FDA application JSON into export-ready rows with application numbers, sponsor names, product summaries, active ingredients, dosage forms, routes, strengths, marketing statuses, submissions, and openFDA identifiers.
The actor is API-first, so it does not need a browser, login, cookies, or a private FDA account.
Who is it for?
- π§ͺ Regulatory affairs teams checking approved drug applications.
- π Generic-drug portfolio analysts comparing brand and ingredient coverage.
- βοΈ Pharma IP and market-access teams building patent-cliff research datasets.
- π Competitive-intelligence teams monitoring sponsors and application families.
- π¬ Healthcare data teams joining FDA application records with internal databases.
Why use it?
- It provides a simple Apify interface around openFDA Drugs@FDA search.
- It supports buyer-friendly inputs instead of requiring users to remember API field names.
- It saves one normalized dataset row per application record.
- It includes nested products and submissions for downstream auditing.
- It can include the raw openFDA record when your compliance workflow needs source evidence.
Data source
The actor uses:
https://api.fda.gov/drug/drugsfda.json
This is a public FDA/openFDA endpoint.
No FDA API token is required for normal use.
What data can you extract?
| Field | Description |
|---|---|
applicationNumber | NDA, ANDA, or BLA application number from Drugs@FDA. |
sponsorName | Application sponsor / applicant. |
brandNames | Brand names found in openFDA and product data. |
genericNames | Generic names from openFDA. |
activeIngredients | Active ingredient names from product records. |
dosageForms | Dosage forms across products. |
routes | Administration routes. |
strengths | Product strengths. |
marketingStatuses | Product marketing statuses where provided. |
products | Nested product summaries. |
submissions | Nested submission summaries. |
openfda | Original openFDA identifiers and classification fields. |
patentDataAvailable | Whether patent records were available from the source. |
exclusivityDataAvailable | Whether exclusivity records were available from the source. |
Search modes
You can search by:
- Brand name.
- Generic name.
- Active ingredient.
- Sponsor / applicant.
- Exact application number.
- Raw openFDA query syntax.
Input example
{"queries":["aspirin",{"term":"ibuprofen","field":"ingredient"},{"term":"PFIZER","field":"sponsor"}],"applicationNumbers":["NDA020639"],"searchField":"brand","maxItems":100,"includeRawRecord":false}
Output example
{"searchTerm":"aspirin","searchField":"brand","applicationNumber":"NDA020639","sponsorName":"BAYER HEALTHCARE LLC","brandNames":["ASPIRIN"],"activeIngredients":["ASPIRIN"],"dosageForms":["TABLET"],"routes":["ORAL"],"products":[],"submissions":[],"patentDataAvailable":false,"exclusivityDataAvailable":false}
How much does it cost to scrape FDA Orange Book data?
This actor uses pay-per-event pricing.
- A small start fee is charged once per run.
- A per-record fee is charged for each FDA application record saved.
- Your final cost depends on the number of matching FDA application records and your Apify plan tier.
For most targeted application-number or brand-name lookups, runs are small and inexpensive.
How to run it
- Open the actor on Apify.
- Add one or more search terms.
- Choose the default search field.
- Optionally add exact application numbers.
- Set
maxItemsto cap the export size. - Start the run.
- Download the dataset as JSON, CSV, Excel, or via API.
Tips for best results
- Use exact application numbers when you know them.
- Use
ingredientfor portfolio research by active ingredient. - Use
sponsorfor applicant-level monitoring. - Use
rawonly when you already know openFDA query syntax. - Keep
maxItemslow for quick smoke tests. - Enable
includeRawRecordfor compliance audits or custom transformations.
Patent and exclusivity fields
The dataset includes patent and exclusivity compatibility fields.
In this version, the reliable public API source is openFDA Drugs@FDA. If patent or exclusivity data is not present in that source, the actor sets:
patentDataAvailable: falsepatents: []exclusivityDataAvailable: falseexclusivities: []
This makes downstream schemas stable while avoiding unreliable scraping of blocked FDA download pages.
Integrations
You can connect the dataset to:
- Google Sheets for regulatory watchlists.
- Snowflake or BigQuery for pharma analytics.
- CRM enrichment pipelines for sponsor intelligence.
- Internal dashboards that monitor generic-entry opportunities.
- Apify webhooks for scheduled portfolio updates.
API usage with Node.js
import{ ApifyClient }from'apify-client';const client =newApifyClient({token: process.env.APIFY_TOKEN});const run =await client.actor('automation-lab/fda-orange-book-scraper').call({queries:['aspirin'],searchField:'brand',maxItems:100});console.log(run.defaultDatasetId);
API usage with Python
from apify_client import ApifyClientclient = ApifyClient('YOUR_APIFY_TOKEN')run = client.actor('automation-lab/fda-orange-book-scraper').call(run_input={'queries':['aspirin'],'searchField':'brand','maxItems':100,})print(run['defaultDatasetId'])
API usage with cURL
curl-X POST 'https://api.apify.com/v2/acts/automation-lab~fda-orange-book-scraper/runs?token=YOUR_APIFY_TOKEN'\-H'Content-Type: application/json'\-d'{"queries":["aspirin"],"searchField":"brand","maxItems":100}'
MCP integration
Use Apify MCP to call this scraper from Claude Desktop, Claude Code, or other MCP clients.
MCP URL:
https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper
Claude Code setup:
$claude mcp add apify-fda-orange-book "https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper"
Claude Desktop JSON config:
{"mcpServers":{"apify-fda-orange-book":{"url":"https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper"}}}
Example prompts:
- "Export FDA Orange Book records for ibuprofen and summarize the sponsors."
- "Find Drugs@FDA applications for sponsor PFIZER and group by active ingredient."
- "Run an application-number lookup for NDA020639 and return the product strengths."
Scheduling
For monitoring workflows, schedule the actor daily, weekly, or monthly.
Common schedules include:
- Weekly sponsor monitoring.
- Monthly ingredient portfolio exports.
- Quarterly regulatory database refreshes.
Data quality notes
The actor reports the data returned by openFDA Drugs@FDA.
It does not provide medical advice.
Always verify regulatory decisions against official FDA systems and primary records.
Legality and responsible use
This actor uses public FDA/openFDA data.
You are responsible for how you use exported data, including compliance with your organizationβs regulatory, medical, and legal review processes.
FAQ and troubleshooting
Why did my search return no rows?
Try a different search mode. For example, use ingredient for active ingredients and application_number for NDA/ANDA/BLA identifiers.
Why are patent arrays empty?
The MVP uses the reliable openFDA Drugs@FDA API. Patent/exclusivity download pages may be unavailable or blocked from automated environments, so the actor marks those fields unavailable when the source does not provide them.
How do I get the original FDA JSON?
Set includeRawRecord to true.
Related scrapers
Other Automation Lab actors that can support healthcare and regulatory workflows:
- https://apify.com/automation-lab/fda-warning-letters-scraper
- https://apify.com/automation-lab/clinicaltrials-gov-scraper
- https://apify.com/automation-lab/healthcare-contact-finder
Changelog
Initial version:
- Public openFDA Drugs@FDA search.
- Brand, generic, ingredient, sponsor, application-number, and raw query modes.
- Application, product, submission, and openFDA identifier fields.
Support
If you need a missing field, include an example application number and describe the workflow you are trying to automate.
Final note
FDA Orange Book Scraper is designed for practical, repeatable exports, not one-off manual lookups.
Use it whenever your team needs FDA drug application data in a dataset, scheduled job, or API pipeline.
