👁 arXiv Scraper - Scientific Papers, Abstracts & PDFs avatar

arXiv Scraper - Scientific Papers, Abstracts & PDFs

Pricing

Pay per usage

👁 arXiv Scraper - Scientific Papers, Abstracts & PDFs

arXiv Scraper - Scientific Papers, Abstracts & PDFs

arXiv Scraper for the official arXiv API. Search 2M+ scientific papers in CS, physics, math and biology by keyword, title, author, abstract or category. Extract title, authors, abstract, categories, DOI, dates and PDF links. For AI/ML research, literature reviews and RAG datasets.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

👁 ben

ben

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

8 hours ago

Last modified

arXiv Scraper — Scientific Papers, Abstracts & PDFs

Search arXiv.org — 2M+ open-access scientific papers in physics, CS, math, biology, economics and more — via the official arXiv API.

Built for AI/ML research, literature reviews, RAG datasets, and research analytics. Keyless, fast and reliable — no proxy or browser needed.

What you get

Per paper:

title, arxiv_id
authors, author_count
abstract (full text)
primary_category, categories
published, updated
doi, journal_ref, comment
pdf_url, abstract_url
scraped_at

Why this Actor

	arXiv Scraper	Manual search	Raw arXiv API
Clean flat JSON output	Yes	—	Atom XML to parse
Search + filters + paging	Yes	Slow	DIY
PDF + abstract links	Yes	Manual	Yes
Pay per result	Yes	—	—

Input

Use the simple fields, or a raw searchQuery for full arXiv syntax.

Field	Type	Description
`allFields`	string	Keyword across title/abstract/authors
`title`	string	Title contains
`author`	string	Author name
`abstract`	string	Abstract contains
`category`	string	arXiv category (e.g. `cs.LG`, `cs.CL`, `cs.AI`)
`searchQuery`	string	Advanced raw query (overrides the above)
`sortBy`	string	Relevance / Newest / Recently updated
`maxResults`	integer	Max papers to return

Example: newest LLM papers

{
"allFields":"large language models",
"sortBy":"newest",
"maxResults":100
}

Example: a category, advanced syntax

{
"searchQuery":"cat:cs.CL AND abs:retrieval augmented",
"sortBy":"newest",
"maxResults":200
}

Sample output

{
"arxiv_id":"2605.30351v1",
"title":"VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Video",
"authors":["Hidir Yesiltepe","Jiazhen Hu"],
"primary_category":"cs.CV",
"categories":["cs.CV","cs.AI"],
"published":"2026-05-28T17:59:57Z",
"abstract":"Long-rollout causal video diffusion...",
"pdf_url":"https://arxiv.org/pdf/2605.30351v1",
"abstract_url":"https://arxiv.org/abs/2605.30351v1"
}

Use cases

AI/ML research — track the latest papers in a field or category
RAG / LLM datasets — build corpora of abstracts + PDF links by topic
Literature reviews — gather and rank relevant papers fast
Research analytics — analyse output by category, author and time

Pricing

Pay-per-result. You are charged only for the papers returned — empty runs cost nothing.

Notes & legal

Uses the official arXiv API. Please respect arXiv's API terms and rate limits (the Actor waits between requests).
Use data only for lawful purposes.

Related actors

More scrapers from the same author:

OpenAlex Scraper — academic papers & citations
PubMed Scraper — biomedical literature & citations
Reddit Archive Scraper — years of historical posts & comments

Arxiv Paper Scraper - Scientific Research

vernacular_reservoir/arxiv-org-paper-scraper

Search and extract scientific papers from Arxiv.org. Get paper title, authors, abstract, categories, publication date and PDF link. Filter by topic, category (cs.AI, physics, math) and sort by relevance or date. No API key required.

👁 User avatar

Aleksandrs

👁 📄 arXiv Papers Monitor avatar

📄 arXiv Papers Monitor

skootle/arxiv-papers

Pull new AI / ML / CS / physics / math papers from arXiv as they land via the official arXiv API. Title, abstract, authors, PDF link, DOI, and LLM-ready summary card per paper. For ML researchers, AI agents, and journalists. Export, run via API, schedule, or integrate with other tools.

👁 User avatar

Skootle

👁 ArXiv Paper Scraper avatar

ArXiv Paper Scraper

sheshinmcfly/arxiv-paper-scraper

Search and extract scientific papers from ArXiv.org across any field. Returns title, authors, full abstract, PDF link, arXiv ID, categories, and submission date. Ideal for AI research monitoring, RAG pipelines, literature reviews, and academic trend analysis. No API key needed.

👁 User avatar

Sheshinmcfly

👁 ArXiv Paper Search avatar

ArXiv Paper Search

gentle_cloud/arxiv-paper-search

Search and extract academic papers from ArXiv. Find papers by keyword, author, or category with full metadata including title, authors, abstract, categories, and PDF links.

👁 User avatar

Monkey Coder

👁 arXiv Scraper avatar

arXiv Scraper

artificially/arxiv-scraper

Search and extract academic papers from arXiv.org. Get paper titles, authors, abstracts, categories, and PDF links for AI/ML, physics, math, and more.

👁 User avatar

Artificially

👁 arXiv Research Paper Scraper avatar

arXiv Research Paper Scraper

crawlerbros/arxiv-research-paper-scraper

Scrape research papers from arXiv.org - search by query, category, or author; lookup by arXiv ID. Returns title, authors, abstract, PDF URL, DOI, categories, and more. Uses the public arXiv Atom API. No login or proxy required.

👁 User avatar

Crawler Bros

👁 📄 ArXiv Scraper — Preprints & Research Data avatar

📄 ArXiv Scraper — Preprints & Research Data

nexgendata/arxiv-scraper

Extract papers from ArXiv — titles, abstracts, authors, categories & PDF links. Monitor new AI, physics, math & CS research. Build tracking & literature review tools. Pay per paper.

👁 User avatar

NexGenData

arXiv Paper Scraper

skystone_labs/arxiv-scraper

Extract research papers from arXiv using the official API. Get titles, authors, abstracts, PDF URLs, categories, and more. Perfect for research datasets and literature reviews.

👁 User avatar

Skystone

👁 arXiv Metadata Collector— Metadata, PDF, Authors & Abstract avatar

arXiv Metadata Collector— Metadata, PDF, Authors & Abstract

scrapepilot/arxiv-metadata-collector---metadata-pdf-authors-abstract

Scrape arXiv research papers with metadata including title, authors, abstract, PDF links, DOI, and categories. Supports keyword search, proxy integration, and structured dataset output for AI, ML, and academic research use

👁 User avatar

Scrape Pilot

👁 arXiv Papers Scraper avatar

arXiv Papers Scraper

crawlerbros/arxiv-papers-scraper

Scrape academic preprints from arXiv.org by keyword, author, or category. Returns clean records with title, authors, abstract, categories, PDF URL, DOI. HTTP-only via the public arXiv API. No login, no proxy.

👁 User avatar

Crawler Bros

URL: https://apify.com/benthepythondev/arxiv-scraper