HuggingFace Papers Scraper

Pricing

$20.00 / 1,000 results

HuggingFace Papers Scraper

Scrape trending HuggingFace Papers by day, week, or month. Get titles, dates, submitters, organizations, upvotes, abstracts, summaries, PDFs, project links, and agent-ready commands for AI agents, RAG pipelines, research monitoring, and automation.

Pricing

$20.00 / 1,000 results

Rating

0.0

(0)

Developer

👁 Marco Rodrigues

Marco Rodrigues

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

🤗 HuggingFace Papers Scraper

Track the latest AI research from HuggingFace Papers and turn trending papers into clean, structured data for agents, RAG systems, dashboards, and research workflows.

Choose a period (Daily, Weekly, or Monthly) plus an end date, and scrape up to 100 papers with titles, dates, submitter details, organizations, upvotes, abstracts, summaries, PDF links, project pages, and the HuggingFace CLI command agents can use to read the paper. The actor starts from the end date and paginates to older papers.

👁 HuggingFace Papers

💡 Perfect For

🤖 AI Agents: Give agents fresh, structured research context with direct pdf_url, project_url, and agent_command fields.
📚 RAG Pipelines: Index abstracts, summaries, metadata, and source URLs so assistants can answer questions about recent AI papers with citations.
🔬 Research Monitoring: Track emerging models, benchmarks, datasets, and methods across daily, weekly, or monthly HuggingFace trends.
📈 Trend Analysis: Compare upvotes, organizations, publication dates, and topics to spot fast-moving areas in AI.
⚙️ Automation Workflows: Feed new paper metadata into Slack bots, Discord alerts, newsletters, spreadsheets, or internal agent workflows.

✨ Why This Actor Matters

AI agents are only as useful as the context they can reliably access. HuggingFace Papers is one of the best places to discover what the AI community is reading right now, but agents and pipelines need structured fields, stable links, and normalized dates instead of raw HTML.

This actor turns that fast-moving research feed into data that is easy to search, rank, summarize, embed, and route into automated systems.

📦 What's Inside The Data?

For every paper, the actor returns:

Core metadata: url, title, published_date, submitted_date
Submitter details: submitted_by, submitted_by_url
Organization details: organization, organization_url
Engagement: upvotes
Research content: abstract, summary
Useful links: pdf_url, project_url
Agent-ready command: agent_command, for example hf papers read 2605.29486

🚀 Quick Start

Open the actor on Apify or run it locally.
Choose the period: Daily, Weekly, or Monthly.
Choose end_date. If omitted or set in the future, the actor uses the current date.
Set max_papers to the number of papers you want, up to 100.
Start the actor and export the results as JSON, CSV, Excel, or through the Apify API.

🧑‍💻 Tech Details

Input Example:

{
"period":"Daily",
"end_date":"2026-06-01",
"max_papers":100
}

The actor builds the HuggingFace Papers URL from period and end_date, then paginates to older papers:

Daily + 2026-06-01 -> https://huggingface.co/papers/date/2026-06-01
Weekly + 2026-06-01 -> https://huggingface.co/papers/week/2026-W23
Monthly + 2026-06-01 -> https://huggingface.co/papers/month/2026-06

Output Example:

{
"url":"https://huggingface.co/papers/2605.29486",
"title":"PhoneWorld: Scaling Phone-Use Agent Environments",
"published_date":"2026-05-28T00:00:00",
"submitted_date":"2026-05-29T00:00:00",
"submitted_by":"Zhengyang Tang",
"submitted_by_url":"https://huggingface.co/tangzhy",
"organization":"shanghai ailab",
"organization_url":"https://huggingface.co/ShanghaiAiLab",
"upvotes":2,
"abstract":"PhoneWorld is a pipeline that transforms real GUI trajectories and screenshots into controllable mobile environments, executable tasks, and automated verifiers, enabling scalable creation of phone-use benchmarks.",
"summary":"A central bottleneck for phone-use agents is that controllable, reproducible environments covering real mobile behavior are hard to build at scale...",
"pdf_url":"https://arxiv.org/pdf/2605.29486",
"project_url":null,
"agent_command":"hf papers read 2605.29486"
}

Parameters:

Parameter	Type	Required	Description
`period`	string	No	HuggingFace Papers period to scrape: `Daily`, `Weekly`, or `Monthly`. Default: `Daily`.
`end_date`	string	No	Latest date to scrape from. Format: `YYYY-MM-DD`. The actor paginates to older papers from this date. If omitted or in the future, the actor uses the current date.
`max_papers`	integer	No	Number of papers to collect from the listing. Min 10, max 100, default 100.

HuggingFace Daily Papers Scraper

tzmyk/huggingface-daily-papers-scraper

Scrapes AI/ML research papers from HuggingFace Daily Papers (huggingface.co/papers). Extracts title, authors, abstract, GitHub repo, star count, upvotes, AI summary, and keywords.

👁 User avatar

tzmyk

Huggingface Scraper

fortuitous_pirate/huggingface-scraper

Huggingface Scraper. Structured data export for lead generation, enrichment, and competitive research.

👁 User avatar

Fortuitous Pirate

👁 Ai-ML-scraper avatar

Ai-ML-scraper

labrat011/ai-ml-scraper

Search AI/ML models, research papers, and trending papers from HuggingFace Hub and arXiv. No API key required.

👁 User avatar

mick_

Huggingface Models Scraper

klondikeking/huggingface-models-scraper

👁 User avatar

Pierrick McD0nald

HuggingFace Model Tracker

optimus-fulcria/huggingface-model-tracker

Track trending, popular, and new AI models on HuggingFace. Monitor downloads, likes, trending scores. Filter by task type, library, or author. No API key required.

👁 User avatar

Fulcria Labs

👁 Hugging Face Papers Scraper avatar

Hugging Face Papers Scraper

parseforge/huggingface-papers-scraper

Scrape AI and machine learning research papers from Hugging Face Papers. Get titles, abstracts, authors with affiliations, upvotes, publication dates, ArXiv IDs, and community discussion counts. Search by keyword or browse daily papers.

👁 User avatar

ParseForge

HuggingFace Models Scraper

tzmyk/huggingface-models-scraper

Scrapes AI/ML models from HuggingFace (huggingface.co/models) via the official API. Extracts model ID, downloads, likes, task type, library, tags, and more. Supports search, author/org filter, pipeline tag filter, and sort order.

👁 User avatar

tzmyk

👁 AI Research Radar — compliant feed of new AI papers and news avatar

AI Research Radar — compliant feed of new AI papers and news

topsail/compliant-ai-research-radar

AI research feed of new ML papers and AI news from HuggingFace, Anthropic, Google, The Decoder — structured JSON, robots-compliant.

👁 User avatar

Connor Teskey

👁 HuggingFaceTP avatar

HuggingFaceTP

aligned_tripod/huggingfacetp

Scrapes trending research papers from HuggingFace, capturing each paper’s title, description, and URL. The scraper collects data from the listing page and visits individual paper pages for full abstracts, providing a structured dataset of the latest AI research.

👁 User avatar

amazing

arXiv Papers Scraper Pro — Research Papers, Authors, Citations

diverse_venture/arxiv-papers-scraper

Search and scrape arXiv research papers. Returns titles, abstracts, authors, categories, DOIs, and PDF download links. Filter by keywords (cat:cs.LG, all:transformer, au:author_name). Up to 500 papers per run. No auth required. Ideal for AI researchers and academic data mining.

👁 User avatar

Chak Man Fung

URL: https://apify.com/dadhalfdev/huggingface-papers-scraper

⇱ HuggingFace Papers Scraper · Apify

HuggingFace Papers Scraper

🤗 HuggingFace Papers Scraper

💡 Perfect For

✨ Why This Actor Matters

📦 What's Inside The Data?

🚀 Quick Start

🧑‍💻 Tech Details

You might also like

HuggingFace Daily Papers Scraper

Huggingface Scraper

Ai-ML-scraper

Huggingface Models Scraper

HuggingFace Model Tracker

Hugging Face Papers Scraper

HuggingFace Models Scraper

AI Research Radar — compliant feed of new AI papers and news

HuggingFaceTP

arXiv Papers Scraper Pro — Research Papers, Authors, Citations