VOOZH about

URL: https://apify.com/parseforge/open-library-scraper

⇱ Open Library Scraper - Books, Authors, Subjects Β· Apify


Pricing

Pay per event

Go to Apify Store

Open Library Scraper

Comprehensive scraper for Open Library to extract books, authors, subjects, and list data from the Internet Archive’s platform. Supports multiple search types and ebook filtering, providing automated, structured access to Open Library’s extensive bibliographic collection.

Pricing

Pay per event

Rating

5.0

(1)

Developer

πŸ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

1

Bookmarked

12

Total users

0

Monthly active users

25 days ago

Last modified

Share

πŸ‘ ParseForge Banner

πŸ“š Open Library Scraper

πŸš€ Extract book data from Open Library in seconds. Search by title, author, or subject with ebook filtering. No coding, no API keys required.

πŸ•’ Last updated: 2026-04-23 Β· πŸ“Š 20 fields Β· πŸ” 5 search types Β· πŸ“– Supports books, authors, subjects, lists, and full-text search

Open Library is the Internet Archive's free, open catalog of every book ever published. This scraper connects to Open Library's public API and returns structured book data including titles, authors, ISBNs, publishers, cover images, ratings, descriptions, page counts, and download links. It supports 5 search types (Books, Authors, Search Inside, Subjects, and Lists), handles pagination automatically, and exports data as JSON, CSV, or Excel.

Whether you are building a book recommendation engine, tracking ISBN availability, or conducting bibliographic research, this actor delivers structured data for up to 1,000,000 records per run for paid users. Each result includes cover images, publication dates, publisher names, available formats (PDF, EPUB, AZW3), community ratings, edition counts, and subject classifications. No manual searching, copying, or format conversion needed.

🎯 Target AudienceπŸ’‘ Use Cases
LibrariansBuild digital catalogs with cover images and ISBNs
PublishersResearch publication history and edition counts
Book bloggersGenerate reading lists with ratings and descriptions
Data scientistsAnalyze publishing trends by subject and year
App developersFeed book metadata into recommendation engines
EducatorsCurate subject-specific reading lists for courses

πŸ“‹ What the Open Library Scraper does

  • πŸ” Keyword search across books, authors, subjects, lists, and full-text content
  • πŸ“– Ebook filtering to show only books available as free digital downloads
  • πŸ–ΌοΈ Cover image extraction with URLs for small, medium, and large sizes
  • πŸ“Š Edition and rating data including community ratings and total edition counts
  • πŸ“₯ Download link collection for PDF, EPUB, and AZW3 formats when available
  • 🌐 Direct URL support to scrape any Open Library search results page

The scraper sends your query to Open Library's public API, retrieves matching records, and extracts full metadata for each item. For book searches, it collects titles, authors, ISBNs, publishers, page counts, descriptions, cover images, ratings, available formats, and download links. For author searches, it returns author profiles with their works. Every record is timestamped and includes a direct link to the Open Library entry.

πŸ’‘ Why it matters: Open Library contains metadata for millions of books, but browsing and exporting data manually is tedious. This scraper automates collection and delivers clean, structured data ready for databases, spreadsheets, or applications.


🎬 Full Demo

🚧 Coming soon...


βš™οΈ Input

FieldTypeRequiredDescription
searchQuerystringNoSearch term for books, authors, or subjects (e.g., "Space")
searchTypestringNoWhat to search: books, authors, searchInside, subjects, or lists
ebooksOnlybooleanNoShow only ebooks (only works with "books" search type)
startUrlstringNoDirect Open Library search URL (overrides search filters)
maxItemsintegerNoMax results to collect. Free: up to 10. Paid: up to 1,000,000

Example 1: Basic book search

{
"searchQuery":"Space",
"searchType":"books",
"maxItems":10
}

Example 2: Ebook-only author search

{
"searchQuery":"Isaac Asimov",
"searchType":"books",
"ebooksOnly":true,
"maxItems":50
}

⚠️ Good to Know: Use either a Start URL or search filters, not both. If you provide a Start URL, search filters are ignored. The ebooks-only filter only works with the "books" search type.


πŸ“Š Output

🧾 Schema

EmojiFieldTypeDescription
πŸ–ΌοΈcoverImagestringBook cover image URL
πŸ“titlestringFull book title
πŸ‘€authorstringAuthor name
πŸ”—detailUrlstringDirect link to the Open Library entry
πŸ“…publicationDatestringFirst publication date
🏒publisherstringPublisher name
πŸ“ŠisbnstringISBN identifier
🌍languagestringAvailable languages
πŸ“–pageCountnumberNumber of pages
πŸ“„descriptionstringFull book description
🏷️subjectsarraySubject classifications
🏷️subjectTagsarraySubject tag identifiers
πŸ–ΌοΈcoverImagesobjectCover image URLs in multiple sizes
πŸ’ΎavailableFormatsarrayEbook formats (PDF, EPUB, AZW3)
πŸ“₯downloadLinksarrayDirect download links for ebooks
⭐ratingnumberCommunity rating score
πŸ“ŠeditionCountnumberNumber of published editions
πŸ†”itemIdstringOpen Library work identifier
⏰scrapedAtstringTimestamp of when the record was collected
⚠️errorstringError message if processing failed

πŸ“¦ Sample records


✨ Why choose this Actor

FeatureDetails
πŸ” 5 search typesBooks, Authors, Search Inside, Subjects, and Lists
πŸ“– Ebook filteringShow only books available for free digital download
πŸ–ΌοΈ Cover imagesURLs in small, medium, and large sizes
πŸ“₯ Download linksDirect links to PDF, EPUB, and AZW3 files
⭐ Community ratingsRating scores and edition counts
πŸ“Š Full metadataISBNs, publishers, page counts, descriptions, and subjects
πŸ“¦ Flexible exportJSON, CSV, or Excel output for any use case

πŸ“Š Collect up to 1,000,000 book records per run with cover images, ISBNs, download links, and ratings.


πŸ“ˆ How it compares to alternatives

FeatureThis ActorManual BrowsingGeneric Scrapers
5 search typesβœ…βœ…βŒ
Ebook-only filteringβœ…βœ…βŒ
Cover image URLsβœ…βŒVaries
Download link extractionβœ…Manual❌
Bulk collection (1M+ records)βœ…βŒβŒ
Structured JSON/CSV outputβœ…βŒVaries
Scheduled runsβœ…βŒβŒ

Get structured book data at scale without manual browsing or copy-pasting.


πŸš€ How to use

  1. Create an Apify account - Sign up free with $5 credit
  2. Open the Open Library Scraper - Navigate to the actor page on Apify
  3. Enter your search query - Type a title, author name, or subject
  4. Select search type and filters - Choose Books, Authors, Subjects, etc. and enable ebook filtering if needed
  5. Click Start - The actor collects matching records and delivers structured data

⏱️ A typical run with 10 books completes in under 30 seconds.


πŸ’Ό Business use cases

πŸ“š Library Management
  • Build digital catalogs with ISBNs and cover images
  • Track edition availability across languages
  • Identify books available as free ebooks
  • Cross-reference holdings with Open Library records
πŸ“Š Publishing Research
  • Analyze edition counts and publication histories
  • Track subject trends by publication year
  • Compare publisher catalogs across genres
  • Monitor new releases in specific subject areas
πŸ–₯️ App Development
  • Feed book metadata into recommendation engines
  • Populate product databases with cover images and ISBNs
  • Build reading list applications with ratings
  • Create book discovery tools with subject filtering
πŸŽ“ Education
  • Curate subject-specific reading lists for courses
  • Find free ebook versions of required textbooks
  • Build bibliographies with full publication details
  • Track available formats for accessibility planning


🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

πŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

🀝 Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

πŸ§ͺ Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

πŸ€– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

πŸ”Œ Automating Open Library Scraper

Integrate the Open Library Scraper into your workflow using the Apify API or client libraries.

Node.js:

import{ ApifyClient }from'apify-client';
const client =newApifyClient({token:'YOUR_API_TOKEN'});
const run =await client.actor("parseforge/open-library-scraper").call({
searchQuery:"Space",
searchType:"books",
maxItems:50
});
const{ items }=await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python:

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("parseforge/open-library-scraper").call(run_input={
"searchQuery":"Space",
"searchType":"books",
"maxItems":50
})
items =list(client.dataset(run["defaultDatasetId"]).iterate_items())
print(items)

Schedules: Set up recurring runs to track new book additions, monitor ebook availability, or build growing datasets of book metadata. Configure daily, weekly, or monthly schedules from the Apify Console.

πŸ”Œ Integrate with any app

  • πŸ”— Make (Integromat) - Connect book data to Google Sheets, Airtable, or any of 1,500+ apps
  • πŸ”— Zapier - Trigger workflows when new book records are collected
  • πŸ”— Slack - Get notified when a book data run completes
  • πŸ”— Airbyte - Stream book metadata into your data warehouse
  • πŸ”— GitHub - Store book datasets in repositories for version control
  • πŸ”— Google Drive - Automatically save CSV exports to shared folders

πŸ”— Recommended Actors

ActorDescription
PubMed Citation ScraperExtract publication metadata from PubMed for research analysis
Crossref ScraperExtract DOI metadata for 155M+ research publications
NASA Reports ScraperCollect technical reports from NASA's NTRS database
US Census Bureau ScraperExtract demographic and economic data from the Census Bureau
ROR ScraperCollect research organization data from the Research Organization Registry

πŸ’‘ Pro Tip: Combine the Open Library Scraper with the Crossref Scraper to match book ISBNs with DOI metadata and citation counts.


πŸ†˜ Need Help? Open our contact form and we will get back to you within 24 hours. We are happy to help with custom setups, integrations, or feature requests.


Disclaimer: This actor is not affiliated with, endorsed by, or connected to Open Library or the Internet Archive. It accesses publicly available data through Open Library's public API. Use responsibly and in accordance with applicable terms of service.

You might also like

Open Library Scraper

crawlerbros/openlibrary-scraper

Scrape Open Library, Internet Archive's open catalog of 50M+ books. Search by title/author/subject, fetch by ISBN or work ID, get full bibliographic metadata, cover images, ratings, and edition counts.

Open Library Scraper

viralanalyzer/open-library-scraper

Search and extract book data from Open Library: titles, authors, editions, subjects, and availability. Literary research at scale.

2

4.7

Open Library Scraper β€” Book Metadata in Bulk

devilscrapes/openlibrary-books-scraper

Search the Open Library API (the Internet Archive's open book catalogue) and export structured book metadata β€” title, authors, ISBNs, subjects, publish year, cover URL, edition count, OpenLibrary ID β€” to JSON or CSV. We handle pagination and retries across 30M+ works.

Open Library Books Scraper

gio21/openlibrary-books-scraper

Search and scrape books on Open Library by title, author, subject, or ISBN. Returns title, authors, first publish year, edition count, ISBNs, cover image, language, ebook access status. Pay per book returned.

Open Library Book Scraper – Cheap πŸ“šπŸŒπŸ”

scrapestorm/open-library-book-scraper---cheap

Easily collect books, authors & reading lists from Open Library Extract structured book and literary data from OpenLibrary.org, the world’s largest open book database maintained by the Internet Archive. Collect book titles, authors, subjects, editions, availability, reading lists, and more πŸ“šπŸŒ

2

Open Library Book Scraper

moving_beacon-owner1/my-actor-80

Extract book data from Open Library, the Internet Archive's open book database featuring over 20 million books, more than 10 million authors, and 40 million editions. Gather titles, authors, cover images, ISBNs, publishers, subjects, ratings, reading statistics, and more.

2