VOOZH about

URL: https://apify.com/parseforge/figshare-scraper

โ‡ฑ Figshare Research Data Scraper ยท Apify


Pricing

from $8.24 / 1,000 result items

Go to Apify Store

Figshare Research Data Scraper

Export open research data from Figshare. 6M+ datasets, papers, figures, posters, and code from universities and publishers worldwide. Search by keyword, item type, institution, or category. Pull titles, authors, DOIs, download counts, licenses, file metadata, and citations.

Pricing

from $8.24 / 1,000 result items

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿš€ Figshare Research Data Scraper

๐Ÿš€ Export 6M+ research datasets, papers, figures from Figshare. Filter by keyword, type, institution, category.

๐Ÿ•’ Last updated: 2026-04-24 ยท ๐Ÿ“Š 11+ fields per record ยท ๐Ÿ” 5 filters ยท ๐Ÿšซ No auth required

Export open research data from Figshare. 6M+ datasets, papers, figures, posters, and code from universities and publishers worldwide. Search by keyword, item type, institution, or category.

Pull titles, authors, DOIs, download counts.


๐Ÿ“‹ What the Figshare Research Data Scraper does

  • ๐ŸŽฏ Targeted filtering. Use the input schema to narrow results to what you need.
  • ๐Ÿ“ฆ Structured output. Clean, typed records with every field documented.
  • ๐Ÿ”„ Live data. Every run fetches fresh data at runtime, no cached responses.
  • ๐Ÿ”Œ Easy integration. Consume via Apify API, webhooks, or direct dataset export.
  • ๐Ÿ“Š Scale on demand. Run once or run on a schedule, the same way.

๐Ÿ’ก Why it matters: teams that rely on this source no longer need to babysit a custom crawler. Set up your filters once, get updated data on demand.


โš™๏ธ Input

Send a JSON body with any of the documented input fields. All fields are optional unless the schema marks them required.

FieldTypeNameDescription
maxItemsintegerMax ItemsFree users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000
searchstringSearch QueryFree text search across titles and descriptions.
itemTypeintegerItem TypeFilter by Figshare item type. 1=Figure, 2=Media, 3=Dataset, 4=Fileset, 5=Poster, 6=Paper, 7=Presentation, 8=Thesis, 9=Code, 10=Metadata, 11=Preprint, 13=Book, 14=Chapter.
categoryIdintegerCategory IDFigshare category ID. See https://docs.figshare.com/#private_list_categories.
institutionIdintegerInstitution IDRestrict results to an institutional repository.
publishedSincestringPublished Since (YYYY-MM-DD)Only include items published on or after this date.

โš ๏ธ Good to Know: free users are limited to 10 items per run for preview purposes. Upgrade to Apify paid plans for higher limits.


๐Ÿ“Š Output

The dataset returns one structured record per item. Each record includes identifiers, descriptive fields, and a link back to the source. Consume the dataset as JSON, CSV, Excel, XML, or RSS via the Apify console or API.


๐Ÿ’ผ Business use cases

๐Ÿ“Š Analysts and researchers

  • Build longitudinal datasets for trend analysis
  • Benchmark across sources and regions
  • Feed BI tools and custom dashboards
  • Enrich existing pipelines with fresh data

๐Ÿ› ๏ธ Engineers and operators

  • Power internal APIs without building your own crawler
  • Schedule weekly deltas to a database
  • Plug into existing ETL stacks via Apify webhooks
  • Skip the infra work, get clean structured output

๐ŸŽฏ Growth and sales teams

  • Discover new leads and accounts at scale
  • Monitor competitor coverage and positioning
  • Build outbound lists keyed to real signals
  • Prioritize outreach with structured context

๐Ÿงช Product and data teams

  • Prototype features against live data
  • A/B test ranking or matching logic
  • Train or evaluate domain-specific models
  • Validate hypotheses before committing engineering

๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

๐ŸŽจ Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

๐Ÿค Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

๐Ÿงช Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

โœจ Why choose this Actor

Capability
๐ŸŽฏBuilt for the job. Scoped specifically to this data source so you skip the parser engineering entirely.
๐Ÿ”–Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines.
โšกFast. Optimized request patterns return results in seconds, not minutes.
๐Ÿ”Always fresh. Every run pulls live data, so the dataset reflects the source as of run time.
๐ŸŒNo infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage.
๐Ÿ›ก๏ธReliable. Battle-tested across many runs and edge cases, with graceful error handling.
๐ŸšซNo code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK.

๐Ÿ“Š Production-grade structured data without the engineering overhead of building and maintaining your own scraper.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ Figshare Research Data Scraper (this Actor)$5 free credit, then pay-per-useFull source coverageLive per runSource-native filters supportedโšก 2 min
Build your own scraperEngineering hoursFull once builtWhenever you maintain itCustom code๐Ÿข Days to weeks
Paid managed APIs$$$ monthlyVendor-definedLiveVendor-definedโณ Hours
Third-party data dumpsVariesSubset, often stalePeriodicNone๐Ÿ•’ Variable

Pick this Actor when you want broad coverage, server-side filtering, and no pipeline maintenance.


๐Ÿš€ How to use

  1. ๐Ÿ“ Create a free account. Sign up at console.apify.com to get $5 in credits.
  2. ๐Ÿ” Open the actor. Paste your filters into the input schema in the Apify console.
  3. โ–ถ๏ธ Click Start. Wait a few seconds for the first records to land.
  4. ๐Ÿ“ค Export the data. Download JSON/CSV or pipe to webhooks, Google Sheets, or Zapier.
  5. ๐Ÿ”„ Schedule it. Apify Schedules let you rerun on a cron cadence for free.

โฑ๏ธ Total time to first data: about 60 seconds.


๐Ÿค– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


โ“ Frequently Asked Questions

๐Ÿ” What does the Figshare Research Data do?

Export open research data from Figshare. 6M+ datasets, papers, figures, posters, and code from universities and publishers worldwide. Search by keyword, item type, institution, or category. Pull titles, authors, DOIs, download counts.. Pass your filters via the input schema and run the actor.

๐Ÿ› ๏ธ How do I get started?

Open the actor in Apify, fill in the input fields, and click Start. The dataset appears on your run page within seconds.

๐Ÿ’ฐ How much does it cost?

Free Apify users can run the actor and preview up to 10 records. Paid plans remove the preview cap. See the Apify pricing page for details.

๐Ÿ“… How fresh is the data?

Every run scrapes live from the source at runtime. No cached responses, no pre-loaded dumps. You get the snapshot visible to the source when the actor starts.

๐Ÿ—‚๏ธ What filters are supported?

The input schema exposes search, itemType, categoryId, institutionId, publishedSince. Combine them to narrow results. If a filter is empty, the default ordering from the source is used.

๐Ÿ” Do I need an API key, account, or authentication?

No. The actor runs against public endpoints using Apify residential proxies. You just need your Apify account to launch the run.

๐Ÿงพ What fields are returned per record?

Each record includes the primary identifiers, descriptive fields, URLs to the source page, and any structured data the source exposes. Exact fields depend on the source and are documented in the output schema.

โšก How fast is a run?

Most runs return a first batch of records within a minute. Throughput depends on source rate limits and the number of filters stacked, not on Apify.

๐Ÿ“ค Can I export the dataset?

Yes. Apify exposes the dataset as JSON, CSV, XML, Excel, or RSS via the UI or API. You can also stream new records into webhooks, Google Sheets, Airtable, and more.

๐Ÿงญ Can I schedule recurring runs?

Yes. Apify Schedules let you run this actor on a cron cadence and deliver fresh data to your destination. No extra code is required.

๐Ÿ›ก๏ธ Is scraping this source legal for commercial use?

This actor only retrieves publicly available information. You are responsible for complying with the source website terms and any applicable privacy and competition rules in your jurisdiction.

๐Ÿค What if a run fails or returns fewer items than expected?

Open the run log for the exact error. Most failures come from source rate limits or filter combinations with no matches. Retry with a broader filter or contact support via the Tally form below.


๐Ÿ”Œ Integrate with any app

Connect the Figshare Research Data Scraper to cloud services via Apify integrations:


๐Ÿ”— Recommended Actors

Pair the Figshare Research Data Scraper with related actors:

๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more niche actors.


๐Ÿ†˜ Need Help? Open our contact form


โš ๏ธ Disclaimer: This actor retrieves data from publicly available sources. You are responsible for complying with the source website's terms of service and applicable laws in your jurisdiction. ParseForge is not affiliated with the data source.

You might also like

Figshare Research Articles Scraper

parseforge/figshare-articles-scraper

Search Figshare for shared research articles, datasets, posters, theses, and code. Filter by item type and free text query to retrieve article IDs, DOIs, titles, authors, descriptions, license info, and publication dates. Useful for scholarly discovery and open research tracking.

Figshare Scraper

crawlerbros/figshare-scraper

This actor extracts metadata and content information from Figshare, one of the world's largest open research data repositories. It supports full-text keyword search, direct article ID lookup, and institution-specific article browsing across all Figshare content types.

Zenodo Research Repository Scraper

parseforge/zenodo-scraper

Export records from Zenodo, CERN's open research data repository. 5M+ datasets, publications, software, posters, and presentations with DOIs. Search by keyword, community, creator, resource type, or license. Pull titles, authors, abstracts, files, DOIs, and download counts.

CORE Open Research Scraper

crawlerbros/core-open-research-scraper

Search millions of open-access research papers from CORE - the world's largest aggregator of open access research. Search by topic, author, or institution, or browse recent papers. Returns title, abstract, authors, DOI, download URL, and more. No API key required.

CKAN Open Data Exporter: Government Datasets & Files

doggo/ckan-opendata-exporter

Search and download open data from any CKAN portal: data.gov, data.gov.uk, the EU Data Portal, and thousands of national and city catalogs. Find datasets by keyword and export their metadata and file-download links, or pull data rows, to Excel, CSV, or JSON.

Academic Research & Papers Scraper (OpenAlex)

rupom888/academic-research-scraper

Search 200M+ academic papers, researchers, and institutions via OpenAlex API. Completely free, no API key needed. Get paper titles, abstracts, DOIs, citations, authors, open access links, and concepts. Filter by year, paper type, open access, and field of study.