VOOZH about

URL: https://apify.com/parseforge/openaire-publications-scraper

โ‡ฑ OpenAIRE Publications Scraper ยท Apify


Pricing

from $7.50 / 1,000 results

Go to Apify Store

OpenAIRE Publications Scraper

Search OpenAIRE open scholarship records by keyword and acceptance date. Returns title, publisher, resource type, year, DOI, language, full text URL, and repository name. Useful for open access discovery, bibliometrics, and research intelligence across European scholarly outputs.

Pricing

from $7.50 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

24 days ago

Last modified

Share

๐Ÿ‘ ParseForge Banner

๐Ÿ“– OpenAIRE Publications Scraper

๐Ÿš€ Export European open-access publications from OpenAIRE in seconds. Titles, authors, publishers, DOIs, full-text URLs, and repositories โ€” direct from the public OpenAIRE Graph API.

๐Ÿ•’ Last updated. 2026-06-05 ยท ๐Ÿ“Š 12 fields per record ยท tens of millions of open-access publications aggregated from European research repositories ยท Public API ยท No login required

The OpenAIRE Publications Scraper turns the OpenAIRE Graph API public endpoint into a clean, structured dataset. It queries the source live, normalizes the response into one row per record, and pushes the result into an Apify dataset you can download or pipe to your warehouse.

Tens of millions of open-access publications aggregated from European research repositories are covered in a single run, with stable field names and null-safe parsing.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
๐Ÿ”ฌ ResearchersBuild open-access reading lists
๐ŸŽ“ UniversitiesAudit national repository contributions
๐Ÿ“Š BibliometriciansTrack open-access uptake across Europe
๐Ÿค– ML teamsTrain scientific NLP on open corpora

๐Ÿ“‹ What the OpenAIRE Publications Scraper does

  • Calls the public OpenAIRE Graph API endpoint with the parameters you supply.
  • Parses the response and flattens each record into a single dataset row.
  • Casts numeric fields to numbers where applicable for clean spreadsheet imports.
  • Surfaces rate-limit or upstream errors as a single-row error record instead of crashing.
  • Exports to every Apify dataset format supported in the UI.

๐Ÿ’ก Why it matters. The raw OpenAIRE Graph API response is great for API consumers but awkward for spreadsheets and BI tools. This actor normalizes the shape so the data drops straight into pandas, BigQuery, or a Google Sheet.

๐ŸŽฌ Full Demo

๐Ÿšง Coming soon.

โš™๏ธ Input

FieldTypeRequiredDescription
querystringNoOpenAIRE keywords search. Example. climate change adaptation.
fromDateAcceptedstringNoFilter by acceptance date (YYYY-MM-DD). Leave empty for no lower bound.
maxItemsintegerNoFree users. 10. Paid users. up to 1,000,000. Prefill. 10.

Example 1.

{
"query":"example",
"fromDateAccepted":"example",
"maxItems":10
}

Example 2.

{
"query":"example",
"fromDateAccepted":"example",
"maxItems":50
}

โš ๏ธ Good to Know. This actor calls the public OpenAIRE Graph API endpoint with no authentication required. Upstream rate limits apply; if the source returns a limit notice, you will see it as a single error record in your dataset.

๐Ÿ“Š Output

Each record is a flat object. error is always last.

FieldTypeDescription
๐Ÿ”น titlestringField from the OpenAIRE Graph API response.
๐Ÿ”น authorsstringField from the OpenAIRE Graph API response.
๐Ÿ”น publisherstringField from the OpenAIRE Graph API response.
๐Ÿ”น typestringField from the OpenAIRE Graph API response.
๐Ÿ”น yearstringField from the OpenAIRE Graph API response.
๐Ÿ”น doistringField from the OpenAIRE Graph API response.
๐Ÿ”น oaftypestringField from the OpenAIRE Graph API response.
๐Ÿ”น languagestringField from the OpenAIRE Graph API response.
๐Ÿ”น fulltextUrlstringField from the OpenAIRE Graph API response.
๐Ÿ”น repositorystringField from the OpenAIRE Graph API response.
๐Ÿ”น scrapedAtstringField from the OpenAIRE Graph API response.
๐Ÿ”น errorstringSet if the upstream response was an error or rate-limit.

Sample record.

{
"title":"sample_title",
"authors":"sample_authors",
"publisher":"sample_publisher",
"type":"sample_type",
"year":"sample_year",
"doi":"sample_doi",
"oaftype":"sample_oaftype",
"language":"sample_language",
"fulltextUrl":"sample_fulltextUrl",
"repository":"sample_repository",
"scrapedAt":"sample_scrapedAt",
"error":null
}

โœจ Why choose this Actor

| ๐Ÿ†“ | Works with the public OpenAIRE Graph API endpoint. No API key, no signup. | | ๐Ÿงน | Clean field names, ready for BI tools. | | ๐Ÿ”ข | Numeric strings cast to real numbers where it makes sense. | | ๐Ÿ›Ÿ | Upstream errors and rate limits surface as a clean error record. | | ๐Ÿ”Œ | One-click export to every Apify dataset format. | | ๐Ÿ’พ | Push to dataset, then pipe to BigQuery, Snowflake, Postgres, or Google Sheets. |

๐Ÿ“ˆ How it compares to alternatives

ApproachSetup timeClean shapePaginationError handling
Roll your own fetch30 min +โŒmanualmanual
Copy-paste from the browser5 min, fragileโŒโŒโŒ
This Actor5 sec, no installโœ…โœ…โœ…

๐Ÿš€ How to use

  1. Click Try for free.
  2. Fill in the input (or leave defaults).
  3. Click Start.
  4. Within seconds, the dataset is ready for download or integration.

๐Ÿ’ผ Business use cases

๐Ÿ“Š Analytics. Pipe records into your warehouse and join against internal data for cross-source dashboards.

๐Ÿค– Automation. Trigger this actor on a schedule, then push results to Slack, Airtable, or Google Sheets.

๐Ÿงช Research. Snapshot the public state of OpenAIRE Graph API on a date and archive it for reproducible studies.

๐Ÿ“ฐ Editorial. Verify quotes, numbers, or records cited in stories with a one-click fresh pull.

๐Ÿ”Œ Automating OpenAIRE Publications Scraper

  • Make / Zapier. Trigger this actor on a schedule, push results to Slack, Airtable, Google Sheets, or anywhere else.
  • Cron schedule. Use the native Apify scheduler to run on any cadence.
  • Webhooks. Get a POST to your endpoint the moment a run finishes.
  • Pipe to BigQuery / Snowflake / Postgres. Native Apify integrations move datasets straight into your warehouse.

๐ŸŒŸ Beyond business use cases

๐ŸŽ“ Education. Build classroom datasets without paying for a commercial feed.

๐Ÿงช Personal research. Track changes in the source over time on your own schedule.

๐Ÿค Non-profit and open data. Build public dashboards without writing client code.

๐Ÿงฐ Tinkering and prototyping. Wire up a fresh data feed in seconds to test a new chart or model.

๐Ÿค– Ask an AI assistant about this scraper

Pop this README into ChatGPT, Claude, or any AI assistant and ask it to map your specific workflow to the actor's inputs. The schema, examples, and field list above contain everything an LLM needs to design a working pipeline.

โ“ Frequently Asked Questions

โ“ Do I need an API key? No. This actor calls the public OpenAIRE Graph API endpoint with no authentication required.

โ“ Is there a rate limit? The upstream source may rate-limit aggressive use. If you hit a limit, the actor pushes a single error record rather than crashing.

โ“ Which formats can I download? Every format Apify's dataset UI supports.

โ“ Are values cast to numbers? Where the source returns numeric strings for numeric fields, yes.

โ“ How do you handle upstream errors? A single record with a populated error field is pushed, then the actor exits cleanly.

โ“ Can I schedule runs? Yes. Use Apify's native scheduler, Make, Zapier, or cron.

โ“ Is this scraping or API? API. The OpenAIRE Graph API endpoint is fully public; this actor only normalizes the response.

โ“ Will the schema change? Core fields are stable. Optional fields surface as null when the source omits them.

โ“ How fresh is the data? Each run hits the live endpoint, so the data is as fresh as the source allows.

โ“ Can I filter the output? Yes. The input fields above let you narrow the result set before it lands in your dataset.

๐Ÿ”Œ Integrate with any app

Apify ships native integrations with Make, Zapier, Slack, Discord, Google Drive, Google Sheets, Gmail, Airbyte, Keboola, Telegram, GitHub, and any REST API or webhook endpoint. Trigger runs from a calendar event, a form submission, a cron job, or pipe results straight into BigQuery, Snowflake, or a Postgres warehouse.

๐Ÿ”— Recommended Actors

ActorWhat it does
ParseForge OurAirports ScraperGlobal airport database.
ParseForge Alpha Vantage ScraperStocks, FX, crypto, and indicators.
ParseForge CurseForge Mods ScraperPublic mod metadata from CurseForge.
ParseForge NBA Stats ScraperPlayer and team stats from NBA.com.

๐Ÿ’ก Pro Tip. Browse the complete ParseForge collection for 900+ production-grade scrapers across business intelligence, real estate, e-commerce, sports, finance, and public records.


Disclaimer. This actor scrapes only publicly available data. ParseForge is not affiliated with, endorsed by, or sponsored by any of the third-party services referenced. Users are responsible for complying with the target site's terms of service and applicable law. Create a free account w/ $5 credit.

You might also like

OpenAIRE Scraper | Open Access Research Records

parseforge/openaire-scraper

Search OpenAIRE for open access publications, datasets, software, and funded projects with titles, authors, affiliations, DOI, abstracts, funders, and links. Power academic discovery, research analytics, bibliographic tooling, and science observatories with structured scholarly data.

CORE Open Access Research Scraper

parseforge/core-ac-uk-scraper

Query CORE for open access research works by keyword, year range, and language. Records include id, title, abstract, authors, publication year, DOI, publisher, language, full text URL, and citation count. Useful for literature reviews, OA monitoring, and academic studies.

Zenodo Research Records Scraper

parseforge/zenodo-records-scraper

Search the CERN Zenodo repository for research outputs by keyword and resource type. Returns record IDs, DOIs, titles, creators, descriptions, publication dates, license info, and access right flags. Useful for scholarly discovery, citation tracking, open access audits, and meta research.

CORE Open Research Scraper

crawlerbros/core-open-research-scraper

Search millions of open-access research papers from CORE - the world's largest aggregator of open access research. Search by topic, author, or institution, or browse recent papers. Returns title, abstract, authors, DOI, download URL, and more. No API key required.

The Scholarship Scraper Actor

majestic_fund/the-scholarship-scraper-actor

This Apify Actor is designed to scrape scholarship information from multiple scholarship databases and platforms.

OpenAlex Scholarly Works Scraper

dami_studio/openalex-scraper

Searches OpenAlex (250M+ scholarly works) by keyword and returns structured records: title, authors, institutions, venue, year, citation count, concepts, open-access link, and the full reconstructed abstract for literature reviews.

1

5.0

Crossref Scholarly Works Scraper

dami_studio/crossref-scraper

Searches the Crossref API (150M+ scholarly works) and returns clean records: DOI, title, authors, journal, publisher, date, citation count, subjects, ISSN, abstract. Filter by work type/date, sort by relevance, citations, or newest for lit reviews.

2

5.0

Zenodo Research Repository Scraper

parseforge/zenodo-scraper

Export records from Zenodo, CERN's open research data repository. 5M+ datasets, publications, software, posters, and presentations with DOIs. Search by keyword, community, creator, resource type, or license. Pull titles, authors, abstracts, files, DOIs, and download counts.