VOOZH about

URL: https://apify.com/jungle_synthesizer/fishbase-species-database-api-scraper

โ‡ฑ FishBase Species Database Scraper ยท Apify


Pricing

Pay per event

Go to Apify Store

FishBase Species Database Scraper

Scrapes FishBase โ€” 35,000+ fish species with taxonomy, ecology, size/weight, trophic level, IUCN status, game-fish flag, depth range, and common names.

Pricing

Pay per event

Rating

0.0

(0)

Developer

๐Ÿ‘ BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

0

Monthly active users

a day ago

Last modified

Share

Scrape FishBase, the global reference database for fish species, and extract structured records for 35,000+ species covering taxonomy, ecology, size and weight, trophic level, IUCN Red List status, depth range, and common names. Returns clean JSON per species pulled from the public summary pages.


FishBase Scraper Features

  • Indexes 35,000+ fish species โ€” every validated entry in the FishBase catalog
  • Extracts full taxonomy: scientific name, genus, species, family, order
  • Returns ecology fields: environment, climate zone, distribution, depth range
  • Captures size and life-history stats: max length, common length, max weight, max age
  • Pulls trophic level, IUCN Red List status, game-fish flag, and dangerousness classification
  • Includes primary English common name plus a source URL back to the FishBase summary page
  • No API key, no proxy, no captcha โ€” just respectful crawling at the site's robots.txt rate
  • Pure HTTP scraping. Hits the ValidNameList index page once, then walks /summary/{ID} for each species

Who Uses FishBase Data?

  • Marine biologists and ichthyologists โ€” Pull species-level data into research notebooks without scraping each summary page by hand
  • Fisheries management agencies โ€” Cross-reference IUCN status and trophic level across stocks for assessment models
  • AI training datasets โ€” Build species classification or natural-language Q&A datasets grounded in the canonical reference
  • Aquarium and reef hobbyists โ€” Generate compatibility data driven by climate zone, depth range, and dangerousness classification
  • Game-fishing apps โ€” Filter on is_game_fish and species distribution to power location-aware species guides
  • EdTech and museum apps โ€” Populate species cards with verified taxonomy and conservation data

How the FishBase Scraper Works

  1. Set maxItems โ€” Pick a sample size or leave it at 0 to crawl all 35,000+ species
  2. The scraper fetches the ValidNameList index โ€” A single page that lists every species ID and scientific name on the site
  3. It walks one summary page per species at the polite rate FishBase requests in robots.txt
  4. Returns one normalized JSON record per species with taxonomy, ecology, size, and conservation fields

The scraper respects the 10-second crawl-delay and uses a single connection. That means full-catalog runs take a while โ€” but you only need to do it once, and the result is a complete snapshot of FishBase's public data.


Input

{
"maxItems":100
}
FieldTypeDefaultDescription
maxItemsinteger10Maximum number of species to scrape. Leave at 0 for the full 35,000+ catalog.

FishBase Scraper Output Fields

{
"scientific_name":"Carcharodon carcharias",
"genus":"Carcharodon",
"species":"carcharias",
"common_name":"Great white shark",
"family":"Lamnidae",
"order":"Lamniformes",
"environment":"Marine; brackish; pelagic-oceanic; oceanodromous",
"climate_zone":"Subtropical",
"distribution":"Cosmopolitan in all tropical and temperate seas. Most common in the southern oceans.",
"max_length_cm":"720.0",
"common_length_cm":"488.0",
"max_weight_g":"3324000",
"max_age_years":"73",
"trophic_level":"4.5",
"iucn_status":"Vulnerable",
"is_game_fish":"yes",
"is_dangerous":"Traumatogenic",
"depth_range_m":"0 - 1280",
"source_url":"https://www.fishbase.se/summary/751",
"scraped_at":"2026-05-27T03:14:22.000Z"
}
FieldTypeDescription
scientific_namestringFull binomial scientific name (Genus species).
genusstringGenus name.
speciesstringSpecies epithet.
common_namestringPrimary English common name.
familystringTaxonomic family.
orderstringTaxonomic order.
environmentstringHabitat descriptors, semicolon-separated (e.g. Marine; brackish; benthopelagic).
climate_zonestringClimate zone (Tropical, Temperate, Polar, Boreal, Subtropical, Deep-water).
distributionstringGeographic distribution text.
max_length_cmstringMaximum recorded length in cm.
common_length_cmstringCommon or typical length in cm.
max_weight_gstringMaximum recorded weight in grams.
max_age_yearsstringMaximum recorded age in years.
trophic_levelstringTrophic level (e.g. 3.4).
iucn_statusstringIUCN Red List status (Least Concern, Vulnerable, Endangered, etc.).
is_game_fishstringyes or no โ€” whether listed as a game/sport fish.
is_dangerousstringDanger classification (Harmless, Traumatogenic, Venomous, etc.).
depth_range_mstringDepth range in meters (e.g. 0 - 364).
source_urlstringSource URL on FishBase.
scraped_atstringISO 8601 timestamp of the scrape.

FAQ

How do I scrape FishBase species data?

FishBase Scraper fetches the ValidNameList.php index, parses every species ID and scientific name from the embedded JSON, then walks one detail page per species. No API key, no proxy, no anti-bot to clear โ€” just polite scraping at the rate the site asks for.

How many species can I get?

FishBase Scraper covers 35,000+ validated species โ€” every entry in the public FishBase catalog. Set maxItems to 0 to pull the full snapshot, or pass a number to sample.

How much does the FishBase Scraper cost to run?

FishBase Scraper is priced per record returned via the pay-per-event model. The full catalog is large enough that most users only run it once and cache the result.

Does this need proxies?

FishBase Scraper does not need proxies. FishBase serves public data without anti-bot protection โ€” datacenter IPs work fine. The scraper observes the site's 10-second crawl-delay and runs single-threaded out of respect for a small academic resource.

Why is a full-catalog run slow?

FishBase Scraper crawls one species at a time with a 10-second delay between requests, per the site's robots.txt. That's not a limit you can tune โ€” it's the rate the site asks for. The good news is the database is fairly stable, so a single full run gives you usable data for months.


Need More Features?

Need additional FishBase fields (genetic data, food items, predator lists), region-filtered crawls, or incremental updates? File an issue or get in touch.

Why Use the FishBase Scraper?

  • Comprehensive coverage โ€” All 35,000+ validated species, every taxonomic rank, every ecology field FishBase exposes
  • Clean schema โ€” Normalized field names and consistent types across taxonomy, size, and conservation data. Drop it straight into a Pandas frame.
  • Polite by default โ€” Respects the robots.txt crawl-delay so the source site stays healthy. Useful when you'd rather not have a community resource throttle you.

You might also like

IGFA World Record Fish Scraper

jungle_synthesizer/igfa-world-record-fish-scraper

Scrape all IGFA world record fish data. Returns current, pending, and vacant records across all categories โ€” all-tackle, line-class, fly-rod, and junior โ€” with species, angler, weight, location, and status.

๐Ÿ‘ User avatar

BowTiedRaccoon

2

GBIF Scraper

crawlergang/gbif-scraper

Scrape GBIF (Global Biodiversity Information Facility), 2.7B+ biodiversity occurrence records and 10M+ species. Search species taxonomy, browse occurrence records, explore contributing datasets, or fetch species details with vernacular names.

2

5.0

GBIF Scraper

crawlerbros/gbif-scraper

Scrape GBIF (Global Biodiversity Information Facility), 2.7B+ biodiversity occurrence records and 10M+ species. Search species taxonomy, browse occurrence records, explore contributing datasets, or fetch species details with vernacular names.

iNaturalist Species & Observations Scraper

crawlerbros/inaturalist-species-observations

Scrape iNaturalist - the world's largest nature observation platform with 200M+ wildlife sightings. Search observations by species name, browse taxa, filter by location and date, find common species in a region. Completely free, no API key required.