👁 Product Data Extractor (price, stock, rating) avatar

Product Data Extractor (price, stock, rating)

Pricing

Pay per usage

👁 Product Data Extractor (price, stock, rating)

Product Data Extractor (price, stock, rating)

Extract clean, normalized product data — name, price, currency, availability, brand, rating, SKU/GTIN, image — from public product pages via JSON-LD, microdata, and OpenGraph. HTML-only, fast, structured output.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

👁 Tommy G

Tommy G

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

13 days ago

Last modified

Product Data Extractor (Apify Actor)

Give it public product page URLs, get back clean, normalized product data — name, price, currency, availability, in-stock, brand, rating, SKU/GTIN/MPN, image — pulled from JSON-LD, microdata, and OpenGraph. HTML-only (no headless browser) so it's fast and cheap. Ideal for price monitoring, competitor tracking, catalog enrichment, and feed building.

Why it's useful (and money-first)

Price/stock monitoring is one of the most-demanded scraping jobs. This actor turns messy product markup (which comes in dozens of shapes — Offer vs AggregateOffer, price as string vs number, 1.299,00 vs $1,299.00, availability URLs vs text) into one stable, tidy record.

Input

{"startUrls":[{"url":"https://scrapeme.live/shop/Bulbasaur/"}],"maxConcurrency":5,"maxPages":100}

maxPages capped at 200, maxConcurrency at 20 (cost guard).

Output — one STABLE record per URL (ok and error rows share the shape)

{
"status":"ok",
"requested_url":"https://shop.example.com/widget",
"final_url":"https://shop.example.com/widget",
"http_status":200,
"found":true,
"source":"json-ld",
"name":"Acme Widget",
"brand":"Acme",
"price":19.99,
"currency":"USD",
"availability":"InStock",
"in_stock":true,
"rating_value":4.5,
"rating_count":231,
"sku":"AW-1",
"gtin":"0123456789012",
"mpn":null,
"image":"https://cdn.example.com/w.jpg",
"description":"...",
"offers_count":1,
"extracted_at":"2026-05-29T..."
}

source is json-ld | microdata | opengraph | none. found:false means no product data was present in the page markup (e.g. a blog or a JS-rendered shop). Failed fetches return the same keys with status:"error" + error.

Run locally / test

npminstall
npmtest# unit tests on the pure extractor (node:test)

Publish to Apify (account-holder's step)

npminstall-g apify-cli
apify login # free Apify account
apify push # from this directory

Keep it free initially; enable pricing later via the adult account-holder once it shows repeat organic usage and clears a margin gate.

Notes / safety

SSRF-guarded (scheme + private/metadata IP block + redirect re-check), robots-respecting, rate-limited, cost-capped — all via the shared src/lib/actor_runner.js.
Stores only derived product fields — no raw page bodies / PII.
HTML-only: client-rendered shops that inject product JSON via JS will return found:false (no server-side markup to read). Core logic in src/extract.js (pure, unit-tested).

👁 Local Business Data Extractor (NAP, hours, geo) avatar

Local Business Data Extractor (NAP, hours, geo)

tom2turnt/localbusiness-extractor

Extract normalized local-business data — name, type, phone, email, full address, lat/long, opening hours, price range, rating — from public pages via JSON-LD (LocalBusiness subtypes, Organization), microdata, and OpenGraph. HTML-only, fast, structured ok/error output.

👁 User avatar

Tommy G

Structured Data Extractor - JSON-LD, OpenGraph, Meta

piposlab/structured-data-extractor

Extract JSON-LD, OpenGraph, Twitter cards, microdata and meta tags from any URL. For SEO audits, AI dataset building and competitor research. No API key.

👁 User avatar

Alejandro Bufarini

👁 Event Data Extractor (date, venue, tickets, performers) avatar

Event Data Extractor (date, venue, tickets, performers)

tom2turnt/newtype-extractor

Extract clean, normalized event data — name, start/end date, venue & address, geo, online/offline mode, performers, ticket price & availability — from public event pages via JSON-LD (schema.org/Event), microdata, and OpenGraph. HTML-only, fast, structured output.

👁 User avatar

Tommy G

👁 Walmart Product Scraper avatar

Walmart Product Scraper

yasmany.casanova/walmart-product-scraper

Extract full Walmart US product data by product ID: price, original price, discount, rating, reviews, seller, stock, fulfillment options, images, GTIN and specifications — as clean, structured JSON.

👁 User avatar

Yasmany Grijalba Casanova

Structured Data Extractor - JSON-LD, OpenGraph, Microdata

gratifying_graph/structured-data-extractor

Extract every piece of structured data from any URL: JSON-LD blocks by schema.org type, OpenGraph and Twitter Card tags, microdata items, canonical and meta basics. Batch over URL lists or call synchronously from AI agents.

👁 User avatar

Jimmy A

👁 Advanced Amazon Product Scraper avatar

Advanced Amazon Product Scraper

scrapeai/advanced-amazon-product-scraper

The scraper collects detailed product information including product title, price, rating, number of reviews, product URL, image URL, brand, availability status, and other key details from the product page, and exports the data in structured JSON format.

👁 User avatar

ScrapeAI

5.0

👁 Review & Rating Extractor (aggregate + individual) avatar

Review & Rating Extractor (aggregate + individual)

tom2turnt/review-extractor

Extract the aggregate rating (value, count, best) AND individual reviews (author, rating, date, title, body) from public product, business, and article pages via JSON-LD Review and AggregateRating. HTML-only, fast, structured output with clean ok/error parity.

👁 User avatar

Tommy G

Amazon Product Scraper

patel_dev_automation/amazon-product-scraper

Extract structured Amazon product data - title, ASIN, price, brand, rating, reviews, images, description, and attributes - from search, category, or product URLs. Auto-pagination, full detail extraction, and clean JSON output.

👁 User avatar

Dev Patel

5.0

👁 Competitor Price Tracker - Amazon, Shopify & More avatar

Competitor Price Tracker - Amazon, Shopify & More

forward_workstation/competitor-price-tracker

Track competitor product prices across Amazon, Shopify, WooCommerce, and any e-commerce site. Extracts price, currency, availability via JSON-LD, OpenGraph, and CSS heuristics.

👁 User avatar

Forward Workstation

4.8

👁 Amazon Product Scraper — Price, Rating, Seller & ASIN Data avatar

Amazon Product Scraper — Price, Rating, Seller & ASIN Data

jaybird/amazon-product-data-scraper

Scrape Amazon product pages into clean JSON: price, rating, reviews, availability, seller, images, specs. Pay per result — $1 per 1,000 products scraped.

👁 User avatar

Jaybird Technologies

352

URL: https://apify.com/tom2turnt/product-extractor