ArXiv Paper Scraper

Pricing

from $2.00 / 1,000 results

ArXiv Paper Scraper

Search and extract scientific papers from ArXiv.org across any field. Returns title, authors, full abstract, PDF link, arXiv ID, categories, and submission date. Ideal for AI research monitoring, RAG pipelines, literature reviews, and academic trend analysis. No API key needed.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

👁 Sheshinmcfly

Sheshinmcfly

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

9 days ago

Last modified

What data does it extract?

Field	Description	Example
`arxivId`	ArXiv paper ID	`"2604.18584"`
`title`	Full paper title	`"MathNet: a Global Multimodal Benchmark..."`
`authors`	List of authors	`["Shaden Alshammari", "Kevin Wen"]`
`abstract`	Full abstract text	`"Mathematical problem solving remains..."`
`categories`	ArXiv subject tags	`["cs.AI", "cs.LG", "cs.IR"]`
`primaryCategory`	Primary category	`"cs.AI"`
`submittedDate`	Submission date	`"20 April, 2026"`
`comments`	Author comments	`"ICLR 2026; 30 pages"`
`journalRef`	Journal reference	`"Proceedings of ICLR, 2026"`
`pdfUrl`	Direct PDF link	`"https://arxiv.org/pdf/2604.18584"`
`url`	ArXiv abstract page	`"https://arxiv.org/abs/2604.18584"`
`query`	Search query used	`"large language models"`
`extractedAt`	Extraction timestamp	`"2026-04-21T12:00:00Z"`

Use cases

RAG pipelines: Feed domain-specific papers into retrieval-augmented AI systems
AI research monitoring: Track the latest publications in LLMs, computer vision, NLP
Academic trend analysis: Identify hot topics and emerging research areas
Literature review automation: Collect papers for a specific topic at scale
LLM fine-tuning data: High-quality scientific text for model training
Competitive intelligence: Monitor what research competitors are publishing

How to use

Open the actor and configure:
- Search queries: One or more search terms (e.g. "diffusion models", "reinforcement learning")
- Search field: All fields, title only, abstract only, or author
- Sort by: Newest first or by relevance
- Max results: Number of papers per query
Click Start
Download results as JSON, CSV, or Excel

Example output (JSON)

{
"arxivId":"2604.18584",
"title":"MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval",
"authors":["Shaden Alshammari","Kevin Wen","Antonio Torralba"],
"abstract":"Mathematical problem solving remains a challenging test of reasoning...",
"categories":["cs.AI","cs.DL","cs.IR","cs.LG"],
"primaryCategory":"cs.AI",
"submittedDate":"20 April, 2026",
"comments":"ICLR 2026; Website: http://mathnet.mit.edu",
"journalRef":"Proceedings of ICLR, 2026",
"pdfUrl":"https://arxiv.org/pdf/2604.18584",
"url":"https://arxiv.org/abs/2604.18584",
"query":"large language models",
"extractedAt":"2026-04-21T12:00:00.000Z"
}

Pricing

This actor charges $0.002 USD per paper extracted. Extracting 100 papers costs approximately $0.20 USD.

Keywords

arxiv scraper, scientific paper extractor, research paper scraper, arxiv API, AI paper scraper, academic data extractor, preprint scraper, NLP research data, LLM training data, arxiv search scraper

Legal Disclaimer

This actor extracts publicly available open-access data only from ArXiv.org, in compliance with Chilean Law 19.628 on the Protection of Private Life (Ley 19.628 sobre Protección de la Vida Privada).

ArXiv is an open-access repository operated by Cornell University. All papers and metadata extracted are freely and publicly accessible without authentication.

What this actor does NOT collect:

Names or personal data of any private individuals
User accounts, submissions portals, or private information
Any data not freely visible to anonymous visitors

What this actor collects:

Paper titles, abstracts, and author names (public academic data)
Subject categories and submission dates
Public URLs and PDF links

Users are solely responsible for ensuring their use of this data complies with applicable laws and ArXiv's terms of use.

Other actors you may like

Stack Overflow Scraper — questions, answers and tags.
Reddit Thread Scraper — threads, posts and comments.
FinViz Stock Screener — stock screener — gainers, losers, most active and more.
Numbeo Cost of Living Scraper — cost-of-living indices by city.

👁 arXiv Research Paper Scraper avatar

arXiv Research Paper Scraper

crawlerbros/arxiv-research-paper-scraper

Scrape research papers from arXiv.org - search by query, category, or author; lookup by arXiv ID. Returns title, authors, abstract, PDF URL, DOI, categories, and more. Uses the public arXiv Atom API. No login or proxy required.

👁 User avatar

Crawler Bros

ArXiv Academic Paper Scraper

fortuitous_pirate/arxiv-scraper

Scrape academic papers from ArXiv. Extract titles, authors, abstracts, categories, and PDF links. Essential for research and literature reviews.

👁 User avatar

Fortuitous Pirate

👁 ArXiv Paper Search avatar

ArXiv Paper Search

gentle_cloud/arxiv-paper-search

Search and extract academic papers from ArXiv. Find papers by keyword, author, or category with full metadata including title, authors, abstract, categories, and PDF links.

👁 User avatar

Monkey Coder

👁 Arxiv Paper Intelligence avatar

Arxiv Paper Intelligence

viralanalyzer/arxiv-paper-intelligence

Search and extract ArXiv papers, abstracts, authors, and citations. Track research trends across any scientific field. AI-powered analysis.

👁 User avatar

viralanalyzer

5.0

arXiv Paper Scraper

cloud9_ai/arxiv-paper-scraper

Scrape academic papers from arXiv.org. Search by keyword, browse categories, or get latest papers. Extract titles, abstracts, authors, PDF links, and citation data via arXiv API.

👁 User avatar

cloud9

arXiv Research Papers Tracker

wsgcjj/arxiv-papers-scraper

Search and extract academic papers from arXiv by category, keyword, date range. Returns paper title, authors, abstract, categories, published date, PDF URL. Ideal for AI/ML research monitoring and training data collection.

👁 User avatar

陈俊杰

👁 arXiv Paper Scraper avatar

arXiv Paper Scraper

plantane/arxiv-scraper

Scrape research papers from arXiv by search query or category. Get titles, abstracts, authors, categories, and PDF links via the public arXiv API.

👁 User avatar

Daniel

arXiv Paper Scraper

skystone_labs/arxiv-scraper

Extract research papers from arXiv using the official API. Get titles, authors, abstracts, PDF URLs, categories, and more. Perfect for research datasets and literature reviews.

👁 User avatar

Skystone

Arxiv Paper Scraper - Scientific Research

vernacular_reservoir/arxiv-org-paper-scraper

Search and extract scientific papers from Arxiv.org. Get paper title, authors, abstract, categories, publication date and PDF link. Filter by topic, category (cs.AI, physics, math) and sort by relevance or date. No API key required.

👁 User avatar

Aleksandrs

👁 arXiv Scraper - Scientific Papers, Abstracts & PDFs avatar

arXiv Scraper - Scientific Papers, Abstracts & PDFs

benthepythondev/arxiv-scraper

arXiv Scraper for the official arXiv API. Search 2M+ scientific papers in CS, physics, math and biology by keyword, title, author, abstract or category. Extract title, authors, abstract, categories, DOI, dates and PDF links. For AI/ML research, literature reviews and RAG datasets.

👁 User avatar

ben

URL: https://apify.com/sheshinmcfly/arxiv-paper-scraper

⇱ ArXiv Paper Scraper - Scientific Research Data · Apify

ArXiv Paper Scraper

What data does it extract?

Use cases

How to use

Example output (JSON)

Pricing

Keywords

Legal Disclaimer

Other actors you may like

You might also like

arXiv Research Paper Scraper

ArXiv Academic Paper Scraper

ArXiv Paper Search

Arxiv Paper Intelligence

arXiv Paper Scraper

arXiv Research Papers Tracker

arXiv Paper Scraper

arXiv Paper Scraper

Arxiv Paper Scraper - Scientific Research

arXiv Scraper - Scientific Papers, Abstracts & PDFs