Amazon book scraper

Pricing

$8.00/month + usage

Amazon book scraper

Amazon Book Scraper uses residential proxies to extract book details from Amazon product pages. It collects title, author, price, rating, reviews, ASIN, publisher, publication date, pages, language, description, and image. Outputs structured JSON for e-commerce analysis and research.

Pricing

$8.00/month + usage

Rating

3.0

(1)

Developer

👁 Data Pilot

Data Pilot

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

5.1 hours

Issues response

3 months ago

Last modified

Overview

The Amazon Book Scraper is an Apify Actor that extracts detailed book metadata directly from Amazon product pages using browser automation. Provide one or more Amazon book URLs and the actor returns structured data including title, author, price, rating, description, publisher, and cover image. Whether you're building an Amazon book database, monitoring Amazon book prices, or conducting publishing research, this actor delivers accurate Amazon book data efficiently.

With residential proxy support and anti-detection techniques, the Amazon Book Scraper ensures reliable and undetected access to Amazon book pages — even on slow or restricted networks.

Features

Full Amazon Book Extraction – Scrapes detailed Amazon book data including title, author, price, rating, reviews count, description, publisher, and more.
Playwright Browser Automation – Uses a real Chromium browser to render JavaScript-heavy Amazon book pages accurately.
Anti-Detection – Rotates user agents and disables automation fingerprints to avoid Amazon bot detection.
Retry Logic – Automatically retries up to 3 times with multiple navigation strategies for reliable Amazon book scraping.
ASIN Extraction – Automatically extracts the Amazon ASIN from the book URL.
Proxy Support – Uses Apify residential proxies to bypass Amazon IP restrictions.
Anti-Blocking Delays – Adds random delays between requests to mimic human browsing behavior.
Error Handling – Logs errors per URL and continues processing remaining Amazon book pages.
Dataset Integration – Automatically pushes all Amazon book data to your Apify dataset for easy export.

How It Works

Input – Provide a list of Amazon book page URLs.
Browser Launch – The actor launches a headless Chromium browser with anti-detection settings.
Page Navigation – It navigates to each Amazon book URL using multiple fallback strategies (domcontentloaded → load → commit).
Data Extraction – Extracts Amazon book fields using targeted CSS selectors.
Build Output – Structures all extracted data into a clean record and pushes it to the dataset.
Repeat – Processes all URLs with random delays between requests.

Input

Field	Type	Default	Description
`urls`	String or Array	Required	Amazon book page URLs, one per line or as a JSON array.
`useApifyProxy`	Boolean	`true`	Whether to use Apify proxy for Amazon book scraping.
`apifyProxyGroups`	Array of strings	`["RESIDENTIAL"]`	Proxy groups to use (e.g., `["RESIDENTIAL"]`).

Example input:

{
"urls":[
"https://www.amazon.com/dp/0735211299",
"https://www.amazon.com/dp/0062316117"
],
"useApifyProxy":true,
"apifyProxyGroups":["RESIDENTIAL"]
}

Output

Each Amazon book record is pushed as a separate dataset item with the following fields:

Field	Type	Description
`url`	string	Original Amazon book page URL.
`asin`	string	Amazon Standard Identification Number (ASIN).
`title`	string	Amazon book title.
`author`	string	Amazon book author name.
`rating`	string	Average Amazon book rating (e.g., `"4.7 out of 5 stars"`).
`reviews_count`	string	Total number of Amazon book customer reviews.
`price`	string	Amazon book price (e.g., `"$14.99"`).
`image`	string	Direct URL to the Amazon book cover image.
`description`	string	Amazon book description/synopsis.
`publisher`	string	Amazon book publisher name.
`pub_date`	string	Amazon book publication date.
`pages_count`	string	Number of pages in the Amazon book.
`language`	string	Language of the Amazon book.
`status`	string	`"success"` or `"error"`.
`attempt`	integer	Number of attempts made to scrape this Amazon book page.
`scraped_at`	string	ISO 8601 UTC timestamp of when the Amazon book was scraped.

Example output (success):

{
"url":"https://www.amazon.com/dp/0735211299",
"asin":"0735211299",
"title":"Atomic Habits",
"author":"James Clear",
"rating":"4.8 out of 5 stars",
"reviews_count":"112,000 ratings",
"price":"$14.99",
"image":"https://m.media-amazon.com/images/I/513Y5o-DYtL.jpg",
"description":"No matter your goals, Atomic Habits offers a proven framework...",
"publisher":"Avery",
"pub_date":"October 16, 2018",
"pages_count":"320",
"language":"English",
"status":"success",
"attempt":1,
"scraped_at":"2025-03-22T12:34:56Z"
}

Example output (error):

{
"url":"https://www.amazon.com/dp/XXXXXXXXXX",
"status":"error",
"message":"All navigation strategies timed out",
"attempt":3
}

Use Cases

Amazon Book Databases – Build and maintain a structured catalog of Amazon book listings.
Price Monitoring – Track Amazon book prices over time for deals and trends.
Publishing Research – Analyze Amazon book ratings, reviews, and publisher data.
E-commerce Enrichment – Enrich product listings with Amazon book descriptions and cover images.
Competitor Analysis – Monitor Amazon book rankings and pricing for competitive intelligence.
Academic Research – Collect structured Amazon book data for literature or data science projects.
Recommendation Engines – Power Amazon book recommendation systems using ratings and metadata.

Quick Start

Open on Apify – Visit the actor page and click Try for free.
Set Input – Paste your Amazon book page URLs into the urls field.
Enable Proxy – Keep useApifyProxy enabled for reliable Amazon book scraping.
Run the Actor – Start the run and monitor progress in the logs.
Download Results – Export the Amazon book dataset as JSON, CSV, or Excel once finished.

Technical Stack

Browser Automation – (Chromium) for JavaScript-rendered Amazon book pages
Anti-Detection – Random user agents, disabled webdriver fingerprint
HTTP Navigation – Multi-strategy page loading (domcontentloaded, load, commit)
Proxy – Apify Proxy (residential) for bypassing Amazon restrictions
Platform – Apify Actor — serverless, scalable, integrated with Dataset and Key-Value Store

Related Tools

Actor	Description
Goodreads Scraper	Extracts Amazon book ratings, reviews, and metadata from Goodreads.
Book Metadata Scraper	Extracts rich Amazon book metadata from the Open Library database.
Google Books Scraper	Fetches Amazon book metadata and previews via the Google Books API.
ISBN Lookup Tool	Looks up detailed Amazon book info by ISBN from multiple data sources.
Book Price Comparator	Compares Amazon book prices across major online retailers.

Changelog

v1.0.0 – Initial Release

-based Amazon book page scraping
Title, author, price, rating, and reviews extraction
ASIN extraction from Amazon book URLs
Publisher, publication date, and page count extraction
Cover image URL extraction
Residential proxy configuration support
Anti-detection user agent rotation
Retry logic with multiple navigation strategies
Random anti-blocking delays
Dataset integration with error handling

Pricing

Free for basic usage on Apify (up to certain compute limits).
Paid plans available for higher volume, priority support, and longer runs.
Proxy credits consumed if residential proxies are enabled.

Support & Feedback

Issues & Ideas – Open a ticket on the Apify Actor issue tracker.
Documentation – Visit Apify Docs for platform guides.
Scraping Notes – Always use residential proxies when scraping Amazon book pages to avoid blocks.

Disclaimer: This actor scrapes publicly visible data from Amazon book pages. Please ensure your usage complies with Amazon's terms of service. This actor is intended for research and informational purposes only.

👁 Amazon Book Scraper — Books Data & Metadata Extractor avatar

Amazon Book Scraper — Books Data & Metadata Extractor

scrapepilot/amazon-book-scraper----books-data-metadata-extractor

Scrape Amazon books data from any keyword, URL, or ASIN list. Get full book metadata — title, author, rating, reviews, price, publisher, pages, language, and cover image. Supports 7 Amazon marketplaces. No login. $8.99/month. 2-hour free trial.

👁 User avatar

Scrape Pilot

👁 AbeBooks Scraper - Used & Rare Books avatar

AbeBooks Scraper - Used & Rare Books

lulzasaur/abebooks-scraper

Scrape AbeBooks used and rare book listings. Extract titles, authors, prices, ISBNs, publishers, conditions, editions, and formats. Search by keyword with pagination support. Perfect for book collectors, dealers, and price comparison.

👁 User avatar

lulz bot

👁 KDP & Amazon Book Niche Analyzer avatar

KDP & Amazon Book Niche Analyzer

sarginstudio/kdp-amazon-book-niche-analyzer

Find profitable KDP niches for low content book publishing. Enter any keyword and get scored Amazon book results ranked by competition, price, and ratings. Perfect for self-publishing authors and print-on-demand creators researching niche opportunities. Automated niche research for KDP success.

👁 User avatar

Osman Sargin

👁 Amazon Scraper avatar

Amazon Scraper

dtrungtin/amazon-scraper

Extract structured product data from [Amazon.com](https://www.amazon.com) at scale. Provide one or more Amazon search or category URLs and this Actor will crawl through all result pages, visit each product listing, and return a clean dataset with prices, images, reviews, dimensions, and more.

👁 User avatar

Tin

👁 Amazon Search Scraper avatar

Amazon Search Scraper

axesso_data/amazon-search-scraper

Our Amazon Search Results Scraper extracts real-time search results from Amazon, including product titles, prices, ratings and other key details. Download the data in formats such as JSON, CSV, Excel, XML, or HTML. Fast, accurate, and designed for detailed analysis and actionable insights.

👁 User avatar

AxCrawler

785

4.5

👁 Amazon Bestsellers Scraper avatar

Amazon Bestsellers Scraper

automation-lab/amazon-bestsellers-scraper

Scrape Amazon Best Sellers rankings from any category. Extract product names, prices, ratings, reviews, ASIN codes, and thumbnails. 10 marketplaces. Export to JSON, CSV, Excel, or API.

👁 User avatar

Stas Persiianenko

👁 Amazon Scraper avatar

Amazon Scraper

automation-lab/amazon-scraper

Scrape Amazon search results for price monitoring: current/list prices, ratings, reviews, seller info, Prime status, availability, and images across 10 marketplaces. Export to JSON, CSV, Excel.

👁 User avatar

Stas Persiianenko

155

👁 Amazon Bestsellers Scraper avatar

Amazon Bestsellers Scraper

junglee/amazon-bestsellers

Scrape the Amazon Best Sellers categories and extract details on top 100 most popular items on Amazon. Download product name, price, URL, and thumbnail image. Best used on .com, .co.uk, .de, .fr, .es, and .it domains. Download your data in various formats: HTML table, JSON, CSV, Excel, and more.

👁 User avatar

Junglee

3.7K

5.0

👁 Amazon Product Details Scraper avatar

Amazon Product Details Scraper

igview-owner/amazon-product-data-scraper

Fetch Amazon product details from ASINs or product URLs in one click. Get title, price, rating, images, availability and key product info for multiple marketplaces. Perfect for product research, catalog enrichment, price tracking and analytics.

👁 User avatar

Sachin Kumar Yadav

243

1.0

👁 Amazon ASINs Scraper avatar

Amazon ASINs Scraper

junglee/amazon-asins-scraper

Gets you product data from Amazon. Unofficial API. Scrapes and downloads product information without using the Amazon API, including reviews, prices, descriptions.

👁 User avatar

Junglee

1.7K

5.0

URL: https://apify.com/datapilot/amazon-book-scraper

⇱ Amazon book scraper · Apify

Amazon book scraper

Overview

Features

How It Works

Input

Output

Use Cases

Quick Start

Technical Stack

Related Tools

Changelog

Pricing

Support & Feedback

You might also like

Amazon Book Scraper — Books Data & Metadata Extractor

AbeBooks Scraper - Used & Rare Books

KDP & Amazon Book Niche Analyzer

Amazon Scraper

Amazon Search Scraper

Amazon Bestsellers Scraper

Amazon Scraper

Amazon Bestsellers Scraper

Amazon Product Details Scraper

Amazon ASINs Scraper