VOOZH about

URL: https://apify.com/muhammadafzal/google-scholar-scraper

⇱ Google Scholar Scraper β€” Academic Papers & Citations Β· Apify


πŸ‘ Google Scholar Scraper β€” Academic Papers & Citations avatar

Google Scholar Scraper β€” Academic Papers & Citations

Pricing

from $5.00 / 1,000 results

Go to Apify Store

Google Scholar Scraper β€” Academic Papers & Citations

Extract academic paper titles, authors, abstracts, citation counts, publication details, and PDF links from Google Scholar. Fast, reliable, no browser overhead. Search by keyword, topic, or author name. MCP-optimized for AI agents.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Muhammad Afzal

Muhammad Afzal

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

12 days ago

Last modified

Categories

Share

Google Scholar Scraper β€” Extract Academic Papers, Citations & Author Data

Extract academic paper metadata from Google Scholar including titles, authors, abstracts, citation counts, publication venues, PDF links, and publication years. Search by keyword, research topic, or author name. Returns structured JSON ready for literature reviews, bibliometric analysis, and AI research pipelines.

Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Features

  • 50 papers in ~10 seconds β€” lightning-fast API-based extraction
  • Rich academic metadata β€” title, authors, abstract, venue, year, citation count, PDF link
  • Citation tracking β€” exact citation counts for research impact measurement
  • Year filtering β€” narrow results to a specific publication year range
  • Author search β€” find all papers by a specific researcher (e.g., "Geoffrey Hinton")
  • Multi-query support β€” search multiple keywords or authors in a single run
  • Up to 500 results per query β€” deep search coverage with automatic pagination
  • PDF direct links β€” when available, extract direct PDF download URLs
  • Journal and conference data β€” venue name extracted from publication info
  • Source type detection β€” classify results as PDF, book, or HTML

Use Cases

Use CaseDescription
Literature reviewsSystematically collect papers for academic research and systematic reviews
Bibliometric analysisMeasure research impact, track citation trends, map collaboration networks
Competitor researchMonitor competitor publications and R&D directions
Grant writingFind related work, citation context, and research gaps for proposals
AI knowledge graphsFeed structured academic data to LLMs for summarization and classification
Content creationGenerate research-backed articles, newsletters, and educational materials

Input

FieldTypeDefaultDescription
searchQueriesstring[]["machine learning"]Keywords or topics to search
authorUrlsstring[][]Author names to search (e.g., "Geoffrey Hinton")
maxResultsinteger20Max papers per query (1–500)
yearFromintegerβ€”Filter papers from this year onward
yearTointegerβ€”Filter papers up to this year

Output

Each record represents one academic paper:

{
"title":"Attention Is All You Need",
"authors":["Ashish Vaswani","Noam Shazeer","Niki Parmar"],
"publicationInfo":"Advances in neural information processing systems, 2017",
"abstract":"The dominant sequence transduction models are based on complex recurrent or convolutional neural networks...",
"citationCount":98432,
"paperUrl":"https://proceedings.neurips.cc/paper/2017/hash/...",
"pdfUrl":"https://arxiv.org/pdf/1706.03762.pdf",
"sourceType":"PDF",
"year":2017,
"citationsUrl":"https://scholar.google.com/scholar?cites=...",
"relatedUrl":"https://scholar.google.com/scholar?q=related:...",
"scrapedAt":"2025-08-01T12:00:00.000Z",
"searchQuery":"transformer attention mechanism"
}

API Usage

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'YOUR_API_TOKEN'});
const run =await client.actor('USERNAME/google-scholar-scraper').call({
searchQueries:['large language models','RLHF reinforcement learning'],
maxResults:50,
yearFrom:2022,
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(`Found ${items.length} papers`);

Pricing

This actor charges per paper returned.

VolumeEstimated Cost
100 papers~$0.10
1,000 papers~$1.00
5,000 papers~$5.00

FAQ

Q: Does this require a Google account or API key? No β€” the scraper uses Google Scholar's public data. No credentials needed.

Q: What is the citation count accuracy? Citation counts are extracted directly from Google Scholar's displayed counts. They match what you see on the website.

Q: Can I search for papers by a specific author? Yes β€” use the authorUrls field with the author's name (e.g., "Yann LeCun") or their Scholar profile URL.

Q: Does it extract full paper text? No β€” it extracts the abstract/snippet shown on Google Scholar. For full text, use the pdfUrl field when available.

You might also like

Google Scholar Scraper

automation-lab/google-scholar-scraper

Search Google Scholar and extract academic papers. Get titles, authors, citation counts, abstracts, PDF links, and publication details. Supports year filtering.

πŸ‘ User avatar

Stas Persiianenko

11

Google Scholar Scraper

kawsar/google-scholar-scraper

Google Scholar scraper that collects paper titles, authors, citations, and PDF links from search results, so you get structured academic data without the manual work.

Google Scholar Scraper - Low-costπŸ’²πŸ”₯πŸ“šπŸŽ“

delectable_incubator/google-scholar-scraper-low-cost

Scrape Google Scholar academic papers πŸ“šπŸ” with a powerful research scraper. Extract paper titles, authors, publication dates, journals/sources, citations, and direct links to full texts. Ideal for academic research, literature reviews, citation analysis, AI/NLP training, and knowledge discovery πŸš€

Google Scholar Scraper - Academic Papers Search

gio21/google-scholar-scraper

Search Google Scholar for academic papers. Get title, authors, year, publication, snippet, cited-by count, PDF links. Filter by year range, language.

Academic Paper Scraper

labrat011/academic-paper-scraper

Search MILLIONS of academic papers from Semantic Scholar and arXiv by keyword, DOI, or citation graph. Returns titles, authors, abstracts, citation counts, and open access PDFs as clean JSON. Works as an MCP tool for AI agents.

Related articles

Top 5 Google Scholar APIs to extract article data
Read more
How to scrape Google Scholar πŸŽ“
Read more