VOOZH about

URL: https://apify.com/parseforge/paleobiology-fossils-scraper

โ‡ฑ Paleobiology Database Fossils Scraper ยท Apify


๐Ÿ‘ Paleobiology Database Fossils Scraper avatar

Paleobiology Database Fossils Scraper

Pricing

from $6.00 / 1,000 results

Go to Apify Store

Paleobiology Database Fossils Scraper

Search the Paleobiology Database by taxon name and pull every fossil occurrence beneath it. Returns taxon, rank, occurrence and collection IDs, geologic interval, early and late age in millions of years, country, coordinates, and formation. Filter by interval or country.

Pricing

from $6.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

24 days ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿฆด Paleobiology Database Fossils Scraper

๐Ÿš€ Pull fossil occurrence records in seconds. Search any taxon and get back where it was found, how old it is, and which collection it came from, straight from the Paleobiology Database (PBDB).

๐Ÿ•’ Last updated: 2026-06-05 ยท ๐Ÿ“Š 24 fields per record ยท Global coverage ยท Keyless public data service

The Paleobiology Database is a public, scientist-curated archive of the fossil record, holding millions of occurrence records contributed by paleontologists worldwide. This Actor queries the PBDB data service by taxonomic name and returns clean, structured fossil occurrence records ready for analysis.

Coverage: Every taxonomic level is searchable, from a single genus like Tyrannosaurus up to an entire class like Mammalia. Each occurrence carries its taxon identification, geologic age in millions of years, country and coordinates, and the rock formation it was recovered from. Filter by geologic interval (period, epoch, or age) and by country to focus a query.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Paleontologists and geologistsBuild occurrence datasets for a taxon or clade
Researchers and studentsMap fossil distribution across time and place
Data scientists and educatorsFeed biodiversity and macroevolution models
Museum and collection staffCross-reference specimens with literature

๐Ÿ“‹ What the Paleobiology Database Fossils Scraper does

It calls the PBDB occurrences endpoint for a taxon name you provide and walks the returned records, returning one row per fossil occurrence. For every occurrence it captures the taxon name and rank, the occurrence and collection identifiers, the geologic interval and early/late age in millions of years (Ma), the reference identifier, the country, latitude and longitude, and the rock formation. Optional interval and country filters narrow the search.

๐ŸŽฌ Full Demo (๐Ÿšง Coming soon)

โš™๏ธ Input

FieldTypeRequiredDescription
baseNamestringNo (defaults to Tyrannosaurus)Taxonomic name to search. Hierarchical, so a genus, family, or class returns every occurrence beneath it.
intervalstringNoNamed geologic interval such as Cretaceous, Maastrichtian, or Miocene.
countryselectNoRestrict to one country by ISO code (for example US, CA, MN).
maxItemsintegerNoFree users limited to 10. Paid users up to 1,000,000.

Example 1 โ€” all Tyrannosaurus occurrences:

{
"baseName":"Tyrannosaurus",
"maxItems":50
}

Example 2 โ€” Cretaceous dinosaurs from the United States:

{
"baseName":"Dinosauria",
"interval":"Cretaceous",
"country":"US",
"maxItems":200
}

โš ๏ธ Good to Know: PBDB queries are hierarchical. A broad name like Mammalia or Trilobita can match tens of thousands of occurrences, so set maxItems to keep runs focused. Geographic fields such as formation, group, member, and county are filled only when the original collection recorded them, so they may be empty on some occurrences.

๐Ÿ“Š Output

๐Ÿท FieldDescription
๐Ÿฆด taxonNameAccepted taxonomic name of the occurrence
๐Ÿท taxonRankRank of the name (species, genus, family, and so on)
๐Ÿ”Ž identifiedNameName as originally identified, when it differs
๐Ÿ†” occurrenceIdPBDB occurrence identifier
๐Ÿ—‚ collectionIdPBDB collection identifier
๐Ÿงฌ taxonIdPBDB taxon identifier
๐Ÿงซ phylumPhylum classification
๐Ÿพ classClass classification
๐Ÿ“š orderOrder classification
๐Ÿ‘ช familyFamily classification
๐Ÿฆ• genusGenus classification
โณ earlyIntervalEarliest geologic interval for the occurrence
โŒ› lateIntervalLatest geologic interval, when bounded
โฌ† earlyAgeMaEarly age boundary in millions of years
โฌ‡ lateAgeMaLate age boundary in millions of years
๐ŸŒ countryCountry code where the fossil was found
๐Ÿ—บ stateState or province
๐Ÿ˜ countyCounty, when recorded
๐Ÿ“ latLatitude in decimal degrees
๐Ÿ“ lngLongitude in decimal degrees
๐Ÿชจ formationGeologic formation
๐Ÿ—ป geologicGroupGeologic group, when recorded
๐Ÿงฑ memberGeologic member, when recorded
๐Ÿ“– referenceIdPBDB bibliographic reference identifier
๐Ÿ•’ scrapedAtTimestamp the record was collected

Real sample records:

[
{
"taxonName":"Tyrannosaurus rex",
"taxonRank":"species",
"occurrenceId":"occ:139292",
"collectionId":"col:11917",
"phylum":"Chordata",
"class":"Reptilia",
"family":"Tyrannosauridae",
"earlyInterval":"Late Maastrichtian",
"earlyAgeMa":72.2,
"lateAgeMa":66,
"country":"CA",
"state":"Alberta",
"formation":"Scollard",
"lat":51.906399,
"lng":-113.0289,
"referenceId":"ref:4218"
},
{
"taxonName":"Tyrannosaurus rex",
"taxonRank":"species",
"occurrenceId":"occ:139293",
"collectionId":"col:11918",
"phylum":"Chordata",
"class":"Reptilia",
"family":"Tyrannosauridae",
"earlyInterval":"Late Maastrichtian",
"earlyAgeMa":72.2,
"lateAgeMa":66,
"country":"CA",
"state":"Alberta",
"formation":"Scollard",
"lat":51.933334,
"lng":-113.23333,
"referenceId":"ref:4205"
},
{
"taxonName":"Tyrannosaurus rex",
"taxonRank":"species",
"occurrenceId":"occ:220009",
"collectionId":"col:22657",
"phylum":"Chordata",
"class":"Reptilia",
"family":"Tyrannosauridae",
"earlyInterval":"Late Campanian",
"earlyAgeMa":83.6,
"lateAgeMa":72.2,
"country":"CA",
"state":"Alberta",
"formation":"Dinosaur Park",
"lat":50.727234,
"lng":-111.524582,
"referenceId":"ref:5721"
}
]

โœจ Why choose this Actor

  • Direct line to the PBDB data service, the reference archive for the fossil record.
  • One clean row per occurrence, with classification, age, location, and formation already separated into fields.
  • Hierarchical taxon search, so one query can pull a genus or an entire class.
  • Interval and country filters to scope a study without post-processing.
  • Ages returned numerically in millions of years, ready for plotting and modeling.

๐Ÿ“ˆ How it compares to alternatives

ApproachEffortStructured outputFilters
This ActorEnter a taxon nameYes, 24 fieldsInterval and country
Manual PBDB web downloadBuild query strings by handPartialManual
Copying from publicationsVery highNoNone

๐Ÿš€ How to use

  1. Create a free Apify account using this sign-up link.
  2. Open the Paleobiology Database Fossils Scraper.
  3. Enter a taxon name in baseName, for example Triceratops, Canis, or Trilobita.
  4. Optionally set an interval and a country, then choose maxItems.
  5. Run the Actor and collect your fossil occurrence dataset.

๐Ÿ’ผ Business use cases

Research and academia

NeedHow it helps
Build a taxon occurrence datasetPull every record for a clade in one run
Study diversity over timeAge fields support range and turnover analysis

Education

NeedHow it helps
Teach the fossil recordReal occurrences with ages and locations
Student projectsReady data for maps and timelines

Collections and museums

NeedHow it helps
Cross-reference holdingsMatch specimens to PBDB collections
Trace literatureReference identifiers link back to sources

Data and analytics

NeedHow it helps
Feed biodiversity modelsClean, typed fields per occurrence
Build dashboardsCoordinates and ages drive maps and charts

๐Ÿ”Œ Automating Paleobiology Database Fossils Scraper

Connect runs to Make, Zapier, Slack, Airbyte, GitHub Actions, or Google Drive through the Apify API and integrations to schedule queries and route fresh occurrence data wherever your team works.

๐ŸŒŸ Beyond business use cases

  • Research: assemble occurrence sets for macroevolution and biogeography studies.
  • Personal: explore where your favorite prehistoric animals once lived.
  • Non-profit: support science outreach and museum education programs.
  • Experimentation: prototype paleo data visualizations and maps.

๐Ÿค– Ask an AI assistant

Paste your dataset into ChatGPT, Claude, Perplexity, or Microsoft Copilot and ask it to summarize age ranges, cluster occurrences by formation, or draft a methods paragraph.

โ“ Frequently Asked Questions

Is the data official? Yes, it comes from the Paleobiology Database public data service, curated by paleontologists.

Do I need an API key? No. The PBDB data service is public and keyless.

What does baseName accept? Any taxonomic name. The search is hierarchical and includes everything below the name.

How do the age fields work? earlyAgeMa and lateAgeMa give the older and younger boundaries of the occurrence in millions of years.

Why are some location fields empty? Formation, group, member, and county appear only when the original collection recorded them.

Can I filter by time? Yes, set interval to a period, epoch, or age such as Jurassic or Maastrichtian.

Can I filter by place? Yes, set country to an ISO code such as US, CN, or MN.

How many records can I get? Free runs return up to 10 records. Paid plans return up to 1,000,000.

What if a taxon has no occurrences? The run finishes with no records and a note to broaden the search.

Can I run it on a schedule? Yes, use Apify scheduling or any connected automation platform.

๐Ÿ”Œ Integrate with any app

Use the Apify API, webhooks, and native integrations to push occurrence data into your own pipelines, databases, and notebooks.

๐Ÿ”— Recommended Actors

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection.

๐Ÿ†˜ Need Help? Open our contact form

โš ๏ธ Disclaimer: independent tool, not affiliated with the Paleobiology Database. Only publicly available data is collected.

You might also like

GBIF Occurrence Search Scraper

parseforge/gbif-occurrence-search-scraper

Search 1.5B+ species occurrence records from the Global Biodiversity Information Facility. Filter by taxon, country, year, dataset, basis of record. Pulls coordinates, taxonomy, dates, recorder, IUCN status, license per record.

GBIF Species Occurrences Scraper

compute-edge/gbif-species-occurrences-scraper

Extract species occurrence records from GBIF, the Global Biodiversity Information Facility (2B+ records). Filter by scientific name, country, taxon, year, and basis of record. Returns taxonomy, coordinates, dates, dataset provenance, and collector metadata for biodiversity and ESG research.

GBIF Biodiversity Data Scraper

parseforge/gbif-biodiversity-scraper

Export species records and occurrence data from GBIF, the Global Biodiversity Information Facility. 2.5B+ occurrences across every species on Earth. Search by scientific name or fetch occurrences by location, taxon, or collector. Pull taxonomy, coordinates, and dataset metadata.

iNaturalist Observations Scraper

parseforge/inaturalist-observations-scraper

Pull citizen science wildlife sightings from iNaturalist. Filter by taxon name, place, and quality grade (research, needs ID, casual). Returns taxon, geojson location, observer, photos, identification count, captive flag, and license. Useful for biodiversity mapping and field studies.

GBIF Species Occurrence Scraper โ€” Biodiversity Records API

compute-edge/gbif-occurrences-scraper

Extract species occurrence records from GBIF (Global Biodiversity Information Facility). Filter by scientific name, country, date range, and basis of record. Outputs taxonomy, geolocation, and dataset attribution.

Macrostrat Geology Units Scraper

parseforge/macrostrat-geology-units-scraper

Pull the full geologic column for any latitude and longitude from Macrostrat. Returns unit and formation names, top and bottom ages in millions of years, lithology, depositional environment, mapped thickness, and column id. Search by point or stratigraphic name for research and mapping.

iNaturalist Scraper

parseforge/inaturalist-scraper

Scrape wildlife observations from iNaturalist. Get species identifications, photos, GPS coordinates, observer info, and quality grades for millions of sightings worldwide. Filter by taxon name, location, date range, quality grade, and photo availability. Ideal for biodiversity research.