Figshare Research Data Scraper

Pricing

from $8.24 / 1,000 result items

Figshare Research Data Scraper

Export open research data from Figshare. 6M+ datasets, papers, figures, posters, and code from universities and publishers worldwide. Search by keyword, item type, institution, or category. Pull titles, authors, DOIs, download counts, licenses, file metadata, and citations.

Pricing

from $8.24 / 1,000 result items

Rating

0.0

(0)

Developer

👁 ParseForge

ParseForge

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

🚀 Figshare Research Data Scraper

🚀 Export 6M+ research datasets, papers, figures from Figshare. Filter by keyword, type, institution, category.

🕒 Last updated: 2026-04-24 · 📊 11+ fields per record · 🔍 5 filters · 🚫 No auth required

Export open research data from Figshare. 6M+ datasets, papers, figures, posters, and code from universities and publishers worldwide. Search by keyword, item type, institution, or category.

Pull titles, authors, DOIs, download counts.

📋 What the Figshare Research Data Scraper does

🎯 Targeted filtering. Use the input schema to narrow results to what you need.
📦 Structured output. Clean, typed records with every field documented.
🔄 Live data. Every run fetches fresh data at runtime, no cached responses.
🔌 Easy integration. Consume via Apify API, webhooks, or direct dataset export.
📊 Scale on demand. Run once or run on a schedule, the same way.

💡 Why it matters: teams that rely on this source no longer need to babysit a custom crawler. Set up your filters once, get updated data on demand.

⚙️ Input

Send a JSON body with any of the documented input fields. All fields are optional unless the schema marks them required.

Field	Type	Name	Description
`maxItems`	integer	Max Items	Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000
`search`	string	Search Query	Free text search across titles and descriptions.
`itemType`	integer	Item Type	Filter by Figshare item type. 1=Figure, 2=Media, 3=Dataset, 4=Fileset, 5=Poster, 6=Paper, 7=Presentation, 8=Thesis, 9=Code, 10=Metadata, 11=Preprint, 13=Book, 14=Chapter.
`categoryId`	integer	Category ID	Figshare category ID. See https://docs.figshare.com/#private_list_categories.
`institutionId`	integer	Institution ID	Restrict results to an institutional repository.
`publishedSince`	string	Published Since (YYYY-MM-DD)	Only include items published on or after this date.

⚠️ Good to Know: free users are limited to 10 items per run for preview purposes. Upgrade to Apify paid plans for higher limits.

📊 Output

The dataset returns one structured record per item. Each record includes identifiers, descriptive fields, and a link back to the source. Consume the dataset as JSON, CSV, Excel, XML, or RSS via the Apify console or API.

💼 Business use cases

📊 Analysts and researchers

Build longitudinal datasets for trend analysis
Benchmark across sources and regions
Feed BI tools and custom dashboards
Enrich existing pipelines with fresh data

🛠️ Engineers and operators

Power internal APIs without building your own crawler
Schedule weekly deltas to a database
Plug into existing ETL stacks via Apify webhooks
Skip the infra work, get clean structured output

🎯 Growth and sales teams

Discover new leads and accounts at scale
Monitor competitor coverage and positioning
Build outbound lists keyed to real signals
Prioritize outreach with structured context

🧪 Product and data teams

Prototype features against live data
A/B test ranking or matching logic
Train or evaluate domain-specific models
Validate hypotheses before committing engineering

🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

Empirical datasets for papers, thesis work, and coursework
Longitudinal studies tracking changes across snapshots
Reproducible research with cited, versioned data pulls
Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

Side projects, portfolio demos, and indie app launches
Data visualizations, dashboards, and infographics
Content research for bloggers, YouTubers, and podcasters
Hobbyist collections and personal trackers

🤝 Non-profit and civic

Transparency reporting and accountability projects
Advocacy campaigns backed by public-interest data
Community-run databases for local issues
Investigative journalism on public records

🧪 Experimentation

Prototype AI and machine-learning pipelines with real data
Validate product-market hypotheses before engineering spend
Train small domain-specific models on niche corpora
Test dashboard concepts with live input

✨ Why choose this Actor

	Capability
🎯	Built for the job. Scoped specifically to this data source so you skip the parser engineering entirely.
🔖	Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines.
⚡	Fast. Optimized request patterns return results in seconds, not minutes.
🔁	Always fresh. Every run pulls live data, so the dataset reflects the source as of run time.
🌐	No infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage.
🛡️	Reliable. Battle-tested across many runs and edge cases, with graceful error handling.
🚫	No code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK.

📊 Production-grade structured data without the engineering overhead of building and maintaining your own scraper.

📈 How it compares to alternatives

Approach	Cost	Coverage	Refresh	Filters	Setup
⭐ Figshare Research Data Scraper (this Actor)	$5 free credit, then pay-per-use	Full source coverage	Live per run	Source-native filters supported	⚡ 2 min
Build your own scraper	Engineering hours	Full once built	Whenever you maintain it	Custom code	🐢 Days to weeks
Paid managed APIs	$$$ monthly	Vendor-defined	Live	Vendor-defined	⏳ Hours
Third-party data dumps	Varies	Subset, often stale	Periodic	None	🕒 Variable

Pick this Actor when you want broad coverage, server-side filtering, and no pipeline maintenance.

🚀 How to use

📝 Create a free account. Sign up at console.apify.com to get $5 in credits.
🔍 Open the actor. Paste your filters into the input schema in the Apify console.
▶️ Click Start. Wait a few seconds for the first records to land.
📤 Export the data. Download JSON/CSV or pipe to webhooks, Google Sheets, or Zapier.
🔄 Schedule it. Apify Schedules let you rerun on a cron cadence for free.

⏱️ Total time to first data: about 60 seconds.

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

🔍 What does the Figshare Research Data do?

🛠️ How do I get started?

Open the actor in Apify, fill in the input fields, and click Start. The dataset appears on your run page within seconds.

💰 How much does it cost?

Free Apify users can run the actor and preview up to 10 records. Paid plans remove the preview cap. See the Apify pricing page for details.

📅 How fresh is the data?

Every run scrapes live from the source at runtime. No cached responses, no pre-loaded dumps. You get the snapshot visible to the source when the actor starts.

🗂️ What filters are supported?

The input schema exposes search, itemType, categoryId, institutionId, publishedSince. Combine them to narrow results. If a filter is empty, the default ordering from the source is used.

🔐 Do I need an API key, account, or authentication?

No. The actor runs against public endpoints using Apify residential proxies. You just need your Apify account to launch the run.

🧾 What fields are returned per record?

Each record includes the primary identifiers, descriptive fields, URLs to the source page, and any structured data the source exposes. Exact fields depend on the source and are documented in the output schema.

⚡ How fast is a run?

Most runs return a first batch of records within a minute. Throughput depends on source rate limits and the number of filters stacked, not on Apify.

📤 Can I export the dataset?

Yes. Apify exposes the dataset as JSON, CSV, XML, Excel, or RSS via the UI or API. You can also stream new records into webhooks, Google Sheets, Airtable, and more.

🧭 Can I schedule recurring runs?

Yes. Apify Schedules let you run this actor on a cron cadence and deliver fresh data to your destination. No extra code is required.

🛡️ Is scraping this source legal for commercial use?

This actor only retrieves publicly available information. You are responsible for complying with the source website terms and any applicable privacy and competition rules in your jurisdiction.

🤝 What if a run fails or returns fewer items than expected?

Open the run log for the exact error. Most failures come from source rate limits or filter combinations with no matches. Retry with a broader filter or contact support via the Tally form below.

🔌 Integrate with any app

Connect the Figshare Research Data Scraper to cloud services via Apify integrations:

Make - visual automation builder
Zapier - 5000+ app connectors
Google Sheets - pipe rows directly
Airbyte - ingest into data warehouses
Slack - receive run alerts
HTTP webhooks - custom downstream

🔗 Recommended Actors

Pair the Figshare Research Data Scraper with related actors:

🌐 Website Content Crawler - crawl any page at scale
🔍 Google Search Scraper - harvest SERPs
📄 Article Extractor - extract clean article text
📊 Google Trends Scraper - capture demand signals
📸 Screenshot URL - render any page to image

💡 Pro Tip: browse the complete ParseForge collection for more niche actors.

🆘 Need Help? Open our contact form

⚠️ Disclaimer: This actor retrieves data from publicly available sources. You are responsible for complying with the source website's terms of service and applicable laws in your jurisdiction. ParseForge is not affiliated with the data source.

👁 Figshare Research Articles Scraper avatar

Figshare Research Articles Scraper

parseforge/figshare-articles-scraper

Search Figshare for shared research articles, datasets, posters, theses, and code. Filter by item type and free text query to retrieve article IDs, DOIs, titles, authors, descriptions, license info, and publication dates. Useful for scholarly discovery and open research tracking.

👁 User avatar

ParseForge

👁 Figshare Scraper avatar

Figshare Scraper

crawlerbros/figshare-scraper

This actor extracts metadata and content information from Figshare, one of the world's largest open research data repositories. It supports full-text keyword search, direct article ID lookup, and institution-specific article browsing across all Figshare content types.

👁 User avatar

Crawler Bros

👁 Zenodo Research Repository Scraper avatar

Zenodo Research Repository Scraper

parseforge/zenodo-scraper

Export records from Zenodo, CERN's open research data repository. 5M+ datasets, publications, software, posters, and presentations with DOIs. Search by keyword, community, creator, resource type, or license. Pull titles, authors, abstracts, files, DOIs, and download counts.

👁 User avatar

ParseForge

👁 CORE Open Research Scraper avatar

CORE Open Research Scraper

crawlerbros/core-open-research-scraper

Search millions of open-access research papers from CORE - the world's largest aggregator of open access research. Search by topic, author, or institution, or browse recent papers. Returns title, abstract, authors, DOI, download URL, and more. No API key required.

👁 User avatar

Crawler Bros

arXiv Papers Scraper Pro — Research Papers, Authors, Citations

diverse_venture/arxiv-papers-scraper

Search and scrape arXiv research papers. Returns titles, abstracts, authors, categories, DOIs, and PDF download links. Filter by keywords (cat:cs.LG, all:transformer, au:author_name). Up to 500 papers per run. No auth required. Ideal for AI researchers and academic data mining.

👁 User avatar

Chak Man Fung

Academic Research MCP — Papers, DOIs & Citations

saturday/academic-research-mcp

MCP server + scraper for AI research agents: search 400M+ papers across Crossref, OpenAlex & arXiv, fetch DOIs, and trace both references and forward citations - plus author metrics. Built for literature review.

👁 User avatar

Josh Compton

arXiv Paper Scraper - AI ML Research Papers

openclawmara/arxiv-paper-scraper

Scrape arXiv research papers by keyword, category, or author. Extracts titles, abstracts, authors, citations, and metadata. Perfect for AI/ML research monitoring, literature reviews, and LLM training data collection.

👁 User avatar

OpenClaw Mara

👁 CKAN Open Data Exporter: Government Datasets & Files avatar

CKAN Open Data Exporter: Government Datasets & Files

doggo/ckan-opendata-exporter

Search and download open data from any CKAN portal: data.gov, data.gov.uk, the EU Data Portal, and thousands of national and city catalogs. Find datasets by keyword and export their metadata and file-download links, or pull data rows, to Excel, CSV, or JSON.

👁 User avatar

Doggo

👁 Academic Research & Papers Scraper (OpenAlex) avatar

Academic Research & Papers Scraper (OpenAlex)

rupom888/academic-research-scraper

Search 200M+ academic papers, researchers, and institutions via OpenAlex API. Completely free, no API key needed. Get paper titles, abstracts, DOIs, citations, authors, open access links, and concepts. Filter by year, paper type, open access, and field of study.

👁 User avatar

Syed Rupom

Crossref Scraper - DOI, Citations, Academic Papers

gio21/crossref-scraper

Search and fetch academic article metadata (DOIs, authors, citations, journals) from the Crossref REST API. No key required.

👁 User avatar

Gio

URL: https://apify.com/parseforge/figshare-scraper