VOOZH about

URL: https://apify.com/scrapeworks/arxiv-search

โ‡ฑ arXiv Search & Paper Scraper - Abstracts, Authors, PDF ยท Apify


Pricing

from $1.00 / 1,000 results

Go to Apify Store

arXiv Search & Paper Scraper

Search arXiv and get clean structured JSON for each paper: title, authors, abstract, categories, DOI, PDF link, and dates. Built for research, datasets, and AI pipelines.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Nicolas van Arkens

Nicolas van Arkens

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

22 days ago

Last modified

Share

arXiv Search & Paper Scraper ๐Ÿ“š

Search arXiv and get clean, structured JSON for every paper โ€” title, authors, abstract, categories, DOI, journal reference, PDF link, and dates. The arXiv API returns awkward Atom XML; this actor does the parsing for you and hands back tidy records ready for analysis, datasets, citation management, or feeding papers to an LLM.

Why use it

  • ๐Ÿ”Ž Flexible search โ€” by keywords, author, arXiv category, or title
  • ๐Ÿ‘ฅ Authors as a clean list โ€” not a blob of XML
  • ๐Ÿท๏ธ Categories split out โ€” primary category plus all cross-listed ones
  • ๐Ÿ”— Direct PDF + abstract links โ€” and DOI / journal reference when available
  • ๐Ÿ“… Parsed dates โ€” published and last-updated
  • ๐Ÿงน Normalized text โ€” abstracts cleaned of the API's messy whitespace
  • โ†•๏ธ Sort by relevance, last updated, or submission date

Use cases

  • Literature reviews & research โ€” pull every recent paper in a field
  • Building datasets โ€” assemble structured corpora of papers and abstracts
  • LLM / RAG pipelines โ€” feed clean abstracts and metadata to models
  • Trend monitoring โ€” track new submissions in a category over time
  • Citation & reference tooling โ€” grab DOIs and journal refs at scale

Input

FieldDescription
Search queryFree-text keywords across all fields.
AuthorRestrict to an author (phrase match).
CategoryarXiv code, e.g. cs.LG, cs.CL, stat.ML.
Title containsRestrict by title phrase.
Sort by / orderRelevance, last updated, or submitted; asc/desc.
Maximum papersHow many to return.

Output

{
"arxivId":"1706.03762v7",
"version":7,
"title":"Attention Is All You Need",
"summary":"The dominant sequence transduction models are based on...",
"authors":["Ashish Vaswani","Noam Shazeer","Niki Parmar"],
"authorCount":3,
"primaryCategory":"cs.CL",
"categories":["cs.CL","cs.LG"],
"published":"2017-06-12T17:57:34Z",
"updated":"2023-08-02T00:41:18Z",
"doi":"10.5555/3295222.3295349",
"journalRef":"NeurIPS 2017",
"pdfUrl":"http://arxiv.org/pdf/1706.03762v7",
"absUrl":"http://arxiv.org/abs/1706.03762v7"
}

Export to JSON, CSV, or Excel, or pull via the Apify API. Connect to Sheets, Notion, Slack, Zapier, or Make.

Notes

  • Uses the official public arXiv API. Independent tool, not affiliated with arXiv or Cornell University.
  • Please be considerate with large jobs; the actor paces requests to respect arXiv's API guidelines.
  • arXiv category reference: see arxiv.org/category_taxonomy for the full list of codes.

You might also like

ArXiv Research Paper Scraper

datapilot/arxiv-research-paper-scraper

arXiv Research Paper Scraper retrieves academic paper metadata from the arXiv API based on a keyword. It extracts titles, abstracts, authors with affiliations, DOI, categories, submission dates, and PDF links. Supports proxy usage and outputs structured JSON results for research and data analysis.

arXiv Scraper

dami_studio/arxiv-scraper

Search arXiv via the official API and get clean, structured paper metadata: title, abstract, authors, categories, DOI, dates, and abstract + PDF links. No key, no login, no anti-bot. Uses arXiv search syntax (all:, cat:, ti:, au:).

3

5.0

arXiv Research Paper Scraper

crawlerbros/arxiv-research-paper-scraper

Scrape research papers from arXiv.org - search by query, category, or author; lookup by arXiv ID. Returns title, authors, abstract, PDF URL, DOI, categories, and more. Uses the public arXiv Atom API. No login or proxy required.

arXiv Paper Scraper

plantane/arxiv-scraper

Scrape research papers from arXiv by search query or category. Get titles, abstracts, authors, categories, and PDF links via the public arXiv API.

ArXiv Paper Scraper

sheshinmcfly/arxiv-paper-scraper

Search and extract scientific papers from ArXiv.org across any field. Returns title, authors, full abstract, PDF link, arXiv ID, categories, and submission date. Ideal for AI research monitoring, RAG pipelines, literature reviews, and academic trend analysis. No API key needed.

arXiv Search Scraper ๐Ÿ“š

easyapi/arxiv-search-scraper

Extract comprehensive research paper data from arXiv search results. Get detailed metadata including titles, authors, abstracts, categories and more. Perfect for academic research monitoring, trend analysis and building paper databases. ๐ŸŽ“๐Ÿ“š

arXiv Scraper โ€” Search & Export Paper Metadata

devilscrapes/arxiv-papers-scraper

Search arXiv by query, category, or author and export structured paper metadata โ€” title, authors, abstract, primary category, DOI, PDF URL, submitted and updated timestamps โ€” to JSON or CSV. An arXiv API wrapper that handles pagination, retries, and rate-limit pacing for your pipeline.

ArXiv Paper Search

gentle_cloud/arxiv-paper-search

Search and extract academic papers from ArXiv. Find papers by keyword, author, or category with full metadata including title, authors, abstract, categories, and PDF links.

10