VOOZH about

URL: https://apify.com/scrapepilot/amazon-book-scraper----books-data-metadata-extractor

⇱ Amazon Book Scraper β€” Books Data & Metadata Extractor Β· Apify


πŸ‘ Amazon Book Scraper β€” Books Data & Metadata Extractor avatar

Amazon Book Scraper β€” Books Data & Metadata Extractor

Pricing

$8.99/month + usage

Go to Apify Store

Amazon Book Scraper β€” Books Data & Metadata Extractor

Scrape Amazon books data from any keyword, URL, or ASIN list. Get full book metadata β€” title, author, rating, reviews, price, publisher, pages, language, and cover image. Supports 7 Amazon marketplaces. No login. $8.99/month. 2-hour free trial.

Pricing

$8.99/month + usage

Rating

0.0

(0)

Developer

πŸ‘ Scrape Pilot

Scrape Pilot

Maintained by Community

Actor stats

0

Bookmarked

17

Total users

4

Monthly active users

2 months ago

Last modified

Share

πŸ“š Amazon Book Scraper β€” Books Data & Metadata Extractor

The most complete Amazon Book Scraper on Apify. Extract full Amazon books data from any keyword search, direct book URL, or bulk ASIN list β€” title, author, rating, reviews, price, description, publisher, publication date, page count, language, availability status, and high-resolution cover image. Supports 7 Amazon marketplaces. No login. No API key. Instant structured output.


πŸ“Œ Table of Contents


πŸ” What Is This Actor?

Amazon Book Scraper is a production-ready Apify actor that extracts complete Amazon books data and Amazon book metadata from any keyword search, direct Amazon book URL, or bulk list of ASINs β€” across 7 Amazon marketplaces.

Provide a search keyword like "machine learning" or "Stephen King", paste a direct Amazon book URL, or supply a list of ASINs β€” and receive back a clean, structured dataset for every book found: title, author, star rating, review count, price, full description, publisher, publication date, page count, language, availability status, and cover image URL.

This Amazon book scraper handles keyword search with sort options, pagination across multiple result pages, direct detail page extraction with automatic fallback URL formats, and partial record recovery when full data is unavailable β€” making it the most reliable Amazon books data tool on Apify.


πŸš€ Why Use This Amazon Book Scraper?

FeatureThis ActorManual ResearchAmazon APIOther Scrapers
Keyword search β†’ bulk books dataβœ… Paginated❌ Slow⚠️ Limited⚠️
Direct URL + ASIN bulk inputβœ… Both modesβŒβœ…βš οΈ
Full Amazon book metadataβœ… 15 fields❌⚠️ Partial⚠️
Publisher, pub date, pages, languageβœ…βŒβŒβš οΈ
Price & availability statusβœ…βœ… Manual⚠️⚠️
High-resolution cover imageβœ…βŒβš οΈβš οΈ
7 Amazon marketplacesβœ…βŒβš οΈβŒ
Sort by bestseller, new, reviewsβœ… Built-in❌❌❌
No login or API keyβœ…N/A❌ Requiredβœ…
Export to CSV / Excelβœ… Via Apify❌❌❌

Bottom line: This Amazon book scraper is the only actor that combines keyword search with sort options, direct ASIN lookup, multi-marketplace support, and a complete 15-field Amazon book metadata record β€” all in one tool with no credentials needed.


🌍 Supported Marketplaces

CodeMarketplaceDomain
usUnited Statesamazon.com
ukUnited Kingdomamazon.co.uk
deGermanyamazon.de
inIndiaamazon.in
caCanadaamazon.ca
auAustraliaamazon.com.au
jpJapanamazon.co.jp

Simply set the country input to the marketplace code. The actor automatically targets the correct Amazon domain and currency for that market.


🎯 Use Cases

πŸ“Š Book Market Research & Publishing Intelligence

  • Scrape Amazon books data for any genre or topic to analyze pricing trends, rating distributions, and review volumes
  • Identify bestselling books in a category with full metadata for competitive publishing research
  • Track new releases, publication dates, and publisher activity across Amazon marketplaces

πŸ›’ Price Monitoring & Comparison

  • Monitor Amazon book prices across multiple marketplaces (US, UK, DE, IN) for arbitrage or pricing strategy
  • Track price changes on a watchlist of ASINs by scheduling regular scraper runs
  • Compare paperback, hardcover, and Kindle pricing using the same ASIN list

πŸ€– AI & Recommendation Systems

  • Build book recommendation datasets by scraping Amazon book metadata β€” title, author, genre, description, ratings
  • Collect training data for NLP models using book descriptions, titles, and category tags
  • Extract cover images and metadata for visual book recommendation interfaces

πŸͺ E-Commerce & Affiliate Integrations

  • Populate a book directory, affiliate site, or comparison platform with structured Amazon books data
  • Automate product catalog updates by re-scraping ASINs on a schedule
  • Build structured book listings with prices, ratings, and descriptions for content sites

πŸŽ“ Academic & Library Research

  • Collect publication metadata (publisher, date, pages, language) for academic bibliographic research
  • Build structured datasets of books in specific fields for literature review automation
  • Study Amazon review and rating patterns across genres for consumer behavior research

πŸ“° Content Creation & Journalism

  • Research book topics, authors, and publication histories at scale for editorial content
  • Gather structured book metadata for book review sites, newsletters, or media platforms
  • Track Amazon bestseller rankings and new releases for publishing industry reporting

βš™οΈ Input Parameters

{
"keyword":"machine learning",
"url":"",
"urls":[],
"country":"us",
"sort_by":"bestseller",
"max_results":20,
"proxyConfiguration":{
"useApifyProxy":true,
"apifyProxyGroups":["RESIDENTIAL"]
}
}
ParameterTypeDefaultDescription
keywordstring""Search keyword β€” any book title, author name, topic, or ISBN (e.g. "python programming", "Stephen King")
urlstring""Single Amazon book URL or ASIN β€” processed as a direct detail page lookup
urlsarray or string[]Multiple Amazon book URLs or bare ASINs for bulk extraction β€” newline-separated string also accepted
countrystring"us"Target Amazon marketplace β€” "us", "uk", "de", "in", "ca", "au", "jp"
sort_bystring"relevance"Search result sort order β€” "relevance", "bestseller", "new", "avg_review"
max_resultsinteger20Maximum books to return across all input modes
proxyConfigurationobjectResidentialApify proxy config β€” residential proxy strongly recommended for Amazon

Tip: You can combine keyword and urls in the same run. Keyword search results are processed first, then direct URLs and ASINs are scraped individually. All results are merged into a single output dataset.


πŸ“‹ Output Fields

Every record from this Amazon book scraper includes complete Amazon book metadata:

FieldTypeDescriptionExample
urlstringFull Amazon book page URL"https://www.amazon.com/dp/B08XY..."
asinstringAmazon Standard Identification Number"B08XY12345"
titlestringFull book title (max 500 chars)"Deep Learning with Python"
authorstringAuthor name(s)"FranΓ§ois Chollet"
ratingfloatAverage star rating4.6
reviews_countintegerTotal customer review count2841
pricestringListed price with currency symbol"$39.99"
descriptionstringFull book description (max 2000 chars)"The definitive guide to..."
imagestringHigh-resolution cover image URL"https://images-na.ssl-images-amazon.com/..."
publisherstringPublisher name"Manning Publications"
pub_datestringPublication date"October 14, 2021"
pages_countintegerTotal page count504
languagestringBook language"English"
statusstringAvailability status"available", "out_of_stock", "unavailable"
scraped_atstringExtraction timestamp (ISO 8601 UTC)"2024-03-15T10:30:00Z"

πŸ“¦ Example Input & Output

Input β€” keyword search:

{
"keyword":"deep learning",
"country":"us",
"sort_by":"bestseller",
"max_results":3
}

Output (one record):

{
"url":"https://www.amazon.com/dp/B08XY12345/",
"asin":"B08XY12345",
"title":"Deep Learning with Python, Second Edition",
"author":"FranΓ§ois Chollet",
"rating":4.6,
"reviews_count":2841,
"price":"$39.99",
"description":"The definitive guide to deep learning using Python and Keras. Revised and updated to cover the latest deep learning techniques...",
"image":"https://images-na.ssl-images-amazon.com/images/I/81abc123.jpg",
"publisher":"Manning Publications",
"pub_date":"October 14, 2021",
"pages_count":504,
"language":"English",
"status":"available",
"attempt":1,
"scraped_at":"2024-03-15T10:30:00Z"
}

πŸ’° Pricing & Free Trial

PlanPriceIncludes
Free Trial$02 hours full access β€” no credit card required
Monthly$8.99 / monthUnlimited runs, all input modes, all 7 marketplaces

Everything included in every plan:

  • βœ… Keyword search with pagination and sort options
  • βœ… Direct URL and bulk ASIN extraction
  • βœ… Complete Amazon book metadata β€” 15 fields per book
  • βœ… 7 Amazon marketplaces (US, UK, DE, IN, CA, AU, JP)
  • βœ… Sort by relevance, bestseller, newest, or average review
  • βœ… Availability status per book
  • βœ… High-resolution cover image URL
  • βœ… JSON + CSV + Excel export from Apify dataset
  • βœ… Scheduled runs for automated price and metadata monitoring

Start your 2-hour free trial now β€” no credit card needed. Click Try for free at the top of this page.


⚑ Performance & Limits

ModeCountEstimated Time
Single book URL or ASIN1~8–20 seconds
Keyword search20 books~3–6 minutes
Bulk ASINs20 books~4–8 minutes
Keyword search (paginated)50 books~12–20 minutes
  • Results are pushed to the Apify dataset in real time as each book is processed
  • Partial records are saved for books where full detail page extraction is blocked
  • Per-URL retry logic with automatic alternate URL format fallback
  • Residential proxy strongly recommended for reliable Amazon access at any volume

❓ FAQ

Q: Can I scrape books from multiple Amazon marketplaces in one run? A: Each run targets one marketplace via the country input. For multi-marketplace scraping, run the actor multiple times with different country values, or use Apify's task scheduling to run them in parallel.

Q: Can I input bare ASINs without a full URL? A: Yes. The urls field accepts bare 10-character ASINs (e.g. B08XY12345) directly β€” the actor automatically builds the correct Amazon URL for the selected marketplace.

Q: What does status: "partial_search_only" mean? A: When a book detail page cannot be fetched (blocked or slow), the actor saves a partial record using the data available from the search results page β€” title, author, rating, reviews, price, and image β€” rather than losing the record entirely.

Q: Why is price null for some books? A: Some Amazon book listings do not display a public price β€” this is common for books sold exclusively through third-party sellers, pre-order titles, or marketplace-only listings. The field returns null when no price is present on the page.

Q: Can I sort search results by bestseller or newest release? A: Yes. Use the sort_by parameter: "bestseller" for Amazon bestseller rank order, "new" for newest releases first, "avg_review" for highest rated, or "relevance" for default search relevance.

Q: Is residential proxy required? A: Amazon actively blocks datacenter IP addresses. A residential proxy makes requests appear to come from regular home internet connections, which significantly improves reliability β€” especially for keyword searches and high-volume ASIN lookups. It is strongly recommended.

Q: Can I export results to Excel or CSV? A: Yes. All results are pushed to the Apify dataset, which can be exported to JSON, CSV, Excel, and more directly from the Apify Console after each run completes.

Q: What happens if Amazon blocks a specific book page? A: The actor automatically retries with alternate URL formats, rotates browser fingerprints, and applies backoff delays. If all attempts fail, a partial record with available data is saved and the run continues with the remaining books.


πŸ“œ Changelog

v2.0.0 (Current)

  • βœ… Three input modes: keyword search, direct URL, and bulk ASIN list
  • βœ… Full Amazon book metadata β€” 15 fields per record
  • βœ… Publisher, publication date, page count, and language extraction
  • βœ… Availability status detection per book
  • βœ… High-resolution cover image URL
  • βœ… 7 Amazon marketplace support (US, UK, DE, IN, CA, AU, JP)
  • βœ… Sort options: relevance, bestseller, newest, average review
  • βœ… Automatic pagination across multiple search result pages
  • βœ… Alternate URL format fallback for 404 and blocked pages
  • βœ… Partial record recovery from search results when detail page fails
  • βœ… Proxy rotation support for high-volume runs
  • βœ… Real-time dataset push as each book is processed

v1.0.0

  • Initial release with basic keyword search and core field extraction

🏷️ Tags

amazon book scraper amazon books data amazon book metadata amazon scraper book data extractor amazon asin scraper book price tracker amazon search scraper book metadata amazon product scraper book research tool amazon bestseller scraper


βš–οΈ Legal & Terms of Use

This actor accesses publicly visible Amazon book listing pages in the same way a regular user browses the Amazon website.

Please note:

  • Use extracted Amazon books data only for lawful purposes β€” research, price monitoring, content creation, affiliate marketing, and academic use are common legitimate applications
  • Do not use this Amazon book scraper to systematically copy Amazon's catalog for redistribution or to build a competing retail platform
  • Respect Amazon's Terms of Service β€” do not use this tool at volumes designed to overload or disrupt Amazon's infrastructure
  • Book descriptions, cover images, and metadata are Amazon's intellectual property β€” always credit the source appropriately in your application
  • The actor developer is not responsible for how extracted Amazon book metadata is used

🀝 Support & Feedback

  • Bug report? Contact us via the Apify actor page
  • Feature request? Post in the Apify Community forum
  • Loving it? Please leave a ⭐ review β€” it helps other users find this actor!

Built with ❀️ on Apify
The most complete Amazon Book Scraper β€” full metadata, 7 marketplaces, keyword search & bulk ASIN

πŸ’° $8.99/month Β· πŸ†“ 2-hour free trial Β· No credit card required

You might also like

Amazon book scraper

datapilot/amazon-book-scraper

Amazon Book Scraper uses residential proxies to extract book details from Amazon product pages. It collects title, author, price, rating, reviews, ASIN, publisher, publication date, pages, language, description, and image. Outputs structured JSON for e-commerce analysis and research.

21

3.0

Amazon Scraper

dtrungtin/amazon-scraper

Extract structured product data from [Amazon.com](https://www.amazon.com) at scale. Provide one or more Amazon search or category URLs and this Actor will crawl through all result pages, visit each product listing, and return a clean dataset with prices, images, reviews, dimensions, and more.

AbeBooks Scraper - Used & Rare Books

lulzasaur/abebooks-scraper

Scrape AbeBooks used and rare book listings. Extract titles, authors, prices, ISBNs, publishers, conditions, editions, and formats. Search by keyword with pagination support. Perfect for book collectors, dealers, and price comparison.

Amazon Bestsellers Scraper

automation-lab/amazon-bestsellers-scraper

Scrape Amazon Best Sellers rankings from any category. Extract product names, prices, ratings, reviews, ASIN codes, and thumbnails. 10 marketplaces. Export to JSON, CSV, Excel, or API.

πŸ‘ User avatar

Stas Persiianenko

53

Amazon Scraper

automation-lab/amazon-scraper

Scrape Amazon search results for price monitoring: current/list prices, ratings, reviews, seller info, Prime status, availability, and images across 10 marketplaces. Export to JSON, CSV, Excel.

πŸ‘ User avatar

Stas Persiianenko

156

Amazon BSR Scraper

marketplace-scrapers/amazon-bsr-scraper

Scrape Amazon BSR rank, Buy Box winner, price, and offer count for any list of ASINs across the US, UK, DE, and JP marketplaces. Time-series snapshots accumulate in a named Apify dataset keyed by date. PA-API sunset replacement; no seller account needed; bring your own cookies.

Market Place Scrapers

9

Amazon Product Research Scraper β€” Jungle Scout Alternative

samstorm/amazon-competitor-research-scraper

Bulk Amazon product research by ASIN. Get BSR rank, price, reviews, seller count, FBA status, and more. Supports 10 Amazon marketplaces.

15

Amazon Product Scraper - Prices, ASIN, BSR & Reviews

harvestlab/amazon-scraper

Amazon product scraper for prices, ASINs, ratings, review counts, BSR, availability, sellers, images, and rank alerts across 19 marketplaces. Built for seller research, competitor monitoring, price tracking, and MCP connector alerts.

Book & Product Metadata Scraper Pro: Amazon, GBooks, OpenLib

scrapepilot/google-search-scraper

Scrape complete book data from Amazon, Google Books, Open Library and WorldCat. Accepts ISBN, ASIN, Amazon URL or keyword. Returns price, rating, reviews, description, cover image and all metadata. Exports CSV and Excel

Magazineluiza Product Search Scraper

stealth_mode/magazineluiza-product-search-scraper

Extract product listings from MagazineLuiza.com.br search pages. Gather prices, ratings, seller information, shipping costs, and product details from Brazil's top e-commerce platform. Ideal for price monitoring, market research, and competitive intelligence.