VOOZH about

URL: https://apify.com/tom2turnt/product-extractor

โ‡ฑ Product Data Extractor (price, stock, rating) ยท Apify


๐Ÿ‘ Product Data Extractor (price, stock, rating) avatar

Product Data Extractor (price, stock, rating)

Pricing

Pay per usage

Go to Apify Store

Product Data Extractor (price, stock, rating)

Extract clean, normalized product data โ€” name, price, currency, availability, brand, rating, SKU/GTIN, image โ€” from public product pages via JSON-LD, microdata, and OpenGraph. HTML-only, fast, structured output.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

๐Ÿ‘ Tommy G

Tommy G

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

13 days ago

Last modified

Categories

Share

Product Data Extractor (Apify Actor)

Give it public product page URLs, get back clean, normalized product data โ€” name, price, currency, availability, in-stock, brand, rating, SKU/GTIN/MPN, image โ€” pulled from JSON-LD, microdata, and OpenGraph. HTML-only (no headless browser) so it's fast and cheap. Ideal for price monitoring, competitor tracking, catalog enrichment, and feed building.

Why it's useful (and money-first)

Price/stock monitoring is one of the most-demanded scraping jobs. This actor turns messy product markup (which comes in dozens of shapes โ€” Offer vs AggregateOffer, price as string vs number, 1.299,00 vs $1,299.00, availability URLs vs text) into one stable, tidy record.

Input

{"startUrls":[{"url":"https://scrapeme.live/shop/Bulbasaur/"}],"maxConcurrency":5,"maxPages":100}

maxPages capped at 200, maxConcurrency at 20 (cost guard).

Output โ€” one STABLE record per URL (ok and error rows share the shape)

{
"status":"ok",
"requested_url":"https://shop.example.com/widget",
"final_url":"https://shop.example.com/widget",
"http_status":200,
"found":true,
"source":"json-ld",
"name":"Acme Widget",
"brand":"Acme",
"price":19.99,
"currency":"USD",
"availability":"InStock",
"in_stock":true,
"rating_value":4.5,
"rating_count":231,
"sku":"AW-1",
"gtin":"0123456789012",
"mpn":null,
"image":"https://cdn.example.com/w.jpg",
"description":"...",
"offers_count":1,
"extracted_at":"2026-05-29T..."
}

source is json-ld | microdata | opengraph | none. found:false means no product data was present in the page markup (e.g. a blog or a JS-rendered shop). Failed fetches return the same keys with status:"error" + error.

Run locally / test

npminstall
npmtest# unit tests on the pure extractor (node:test)

Publish to Apify (account-holder's step)

npminstall-g apify-cli
apify login # free Apify account
apify push # from this directory

Keep it free initially; enable pricing later via the adult account-holder once it shows repeat organic usage and clears a margin gate.

Notes / safety

  • SSRF-guarded (scheme + private/metadata IP block + redirect re-check), robots-respecting, rate-limited, cost-capped โ€” all via the shared src/lib/actor_runner.js.
  • Stores only derived product fields โ€” no raw page bodies / PII.
  • HTML-only: client-rendered shops that inject product JSON via JS will return found:false (no server-side markup to read). Core logic in src/extract.js (pure, unit-tested).

You might also like

Local Business Data Extractor (NAP, hours, geo)

tom2turnt/localbusiness-extractor

Extract normalized local-business data โ€” name, type, phone, email, full address, lat/long, opening hours, price range, rating โ€” from public pages via JSON-LD (LocalBusiness subtypes, Organization), microdata, and OpenGraph. HTML-only, fast, structured ok/error output.

Event Data Extractor (date, venue, tickets, performers)

tom2turnt/newtype-extractor

Extract clean, normalized event data โ€” name, start/end date, venue & address, geo, online/offline mode, performers, ticket price & availability โ€” from public event pages via JSON-LD (schema.org/Event), microdata, and OpenGraph. HTML-only, fast, structured output.

Walmart Product Scraper

yasmany.casanova/walmart-product-scraper

Extract full Walmart US product data by product ID: price, original price, discount, rating, reviews, seller, stock, fulfillment options, images, GTIN and specifications โ€” as clean, structured JSON.

๐Ÿ‘ User avatar

Yasmany Grijalba Casanova

2

Advanced Amazon Product Scraper

scrapeai/advanced-amazon-product-scraper

The scraper collects detailed product information including product title, price, rating, number of reviews, product URL, image URL, brand, availability status, and other key details from the product page, and exports the data in structured JSON format.

Review & Rating Extractor (aggregate + individual)

tom2turnt/review-extractor

Extract the aggregate rating (value, count, best) AND individual reviews (author, rating, date, title, body) from public product, business, and article pages via JSON-LD Review and AggregateRating. HTML-only, fast, structured output with clean ok/error parity.

Competitor Price Tracker - Amazon, Shopify & More

forward_workstation/competitor-price-tracker

Track competitor product prices across Amazon, Shopify, WooCommerce, and any e-commerce site. Extracts price, currency, availability via JSON-LD, OpenGraph, and CSS heuristics.

๐Ÿ‘ User avatar

Forward Workstation

16

4.8

Amazon Product Scraper โ€” Price, Rating, Seller & ASIN Data

jaybird/amazon-product-data-scraper

Scrape Amazon product pages into clean JSON: price, rating, reviews, availability, seller, images, specs. Pay per result โ€” $1 per 1,000 products scraped.

๐Ÿ‘ User avatar

Jaybird Technologies

352