VOOZH about

URL: https://apify.com/v0iddo/shopify-store-intelligence

⇱ Shopify Store Intelligence β€” Catalog, Pricing, Vendors Β· Apify


πŸ‘ Shopify Store Intelligence β€” Catalog + Pricing avatar

Shopify Store Intelligence β€” Catalog + Pricing

Pricing

$1.00 / 1,000 shopify product extracteds

Go to Apify Store

Shopify Store Intelligence β€” Catalog + Pricing

Snapshot any Shopify storefront's full public catalog. One row per product with title, vendor, type, tags, all variants (SKU, price, compare_at_price, available), images. Skips non-Shopify domains gracefully. Source: /products.json + /collections.json (public, no auth).

Pricing

$1.00 / 1,000 shopify product extracteds

Rating

0.0

(0)

Developer

πŸ‘ vΓΈiddo

vΓΈiddo

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

15 days ago

Last modified

Categories

Share

Shopify Store Intelligence β€” Catalog + Pricing Snapshot

Pull the full public catalog of any Shopify-powered storefront β€” all products, variants, prices, SKUs, vendor portfolio, and stock signals β€” in one run. No login, no HTML scraping, no headless browser. Just the open /products.json endpoint that Shopify exposes for every store by design.

What you get

One row per product (default) β€” or one row per variant (SKU) if you'd rather join in a spreadsheet. Every row carries:

  • Identifiers: productId, handle, url, storeDomain
  • Catalog metadata: title, vendor, productType, tags, imageCount
  • Pricing signal: minPrice, maxPrice, anyOnSale, per-variant price + compareAtPrice
  • Stock signal: variantsInStock / variantsTotal, per-variant available
  • Timestamps: publishedAt, createdAt, updatedAt

Plus a per-store summary in OUTPUT with vendor distribution (top 10), product types breakdown, products on sale / in stock, and an optional /collections.json snapshot.

Why this matters

Shopify powers >4 million live stores. Each one of them silently publishes its full product catalog at /products.json, paginated 250 at a time. Most growth teams don't know this β€” and the ones that do are stitching together one-off scripts. This actor turns that into a single, reliable, paginated, billable data source.

Built for:

  • Competitive intelligence β€” track a competitor's catalog every week: what's new, what dropped, what went on sale.
  • DTC / dropshipping research β€” discover niches by surveying which vendors are inside which collections across a list of stores.
  • Pricing surveillance β€” compare your prices vs. a curated panel of peers.
  • M&A / investor due diligence β€” snapshot a target's full SKU count, vendor mix, and pricing distribution from a single run.

Input

FieldRequiredDefaultDescription
storesyesβ€”List of domains or URLs. "gymshark.com", "https://allbirds.com", "shop.example.com" β€” all valid.
maxProductsPerStoreno5000Hard cap per store. Shopify itself enforces a page index ceiling of ~150 (true max β‰ˆ 37 500 products / store).
includeCollectionsnotrueAlso pull /collections.json into the per-store summary.
rowModenoper_productper_product (one row, all variants nested) or per_variant (one row per SKU).
includeBodyHtmlnofalseInclude the long-form HTML product description. Skip unless you need it β€” adds 2–10 KB / row.
detectStacknofalseFetch the store's homepage and detect installed apps + tech stack. Adds installedApps and techStack to the per-store summary. One extra HTTP request per store, no billing impact.

Output

Dataset β€” one row per product (or per variant). Key-value store β€” OUTPUT with per-store summaries and the billing ledger summary; BILLING_LOG with the full audit trail of every charge call.

Pricing

PAY_PER_EVENT, $0.001 per product extracted (product_extracted). A 5 000-SKU store costs ~$5. The per_variant mode is free of extra charge β€” billing is per product, not per variant.

v0.2 β€” what's new

Stack detection. Set detectStack: true to fetch the store's homepage once and surface the installed apps + tech stack in the per-store summary:

"installedApps":["klaviyo", "recharge", "loox", "gorgias", "meta_pixel"],
"techStack":{
"checkout":["shopify"],
"subscriptions":["recharge"],
"reviews":["loox"],
"email":["klaviyo"],
"chat":["gorgias"],
"pixels":["meta_pixel", "tiktok_pixel"],
"analytics":["ga4"]
}

Tells covered: Recharge / Ordergroove / Appstle / Loop (subscriptions); Loox / Judge.me / Yotpo / Stamped / Okendo / Reviews.io (reviews); Klaviyo / Privy / Justuno / Omnisend / Mailchimp (email); Gorgias / Intercom / Tidio / Crisp / Zendesk / Drift (chat); Smile.io / Refersion (loyalty); GA4 / Northbeam / TripleWhale (analytics); Meta / TikTok / Pinterest (pixels); Shogun / PageFly / GemPages (page builders); Rebuy (upsell).

Adds one extra GET request per store. No billing impact β€” billing still charges per product_extracted.

Notes & limits

  • Non-Shopify URLs are skipped, not failed. If a domain doesn't serve /products.json, the store appears in OUTPUT with a skipped reason (404, non-JSON, network) and no rows or charges are emitted.
  • Shopify hard limit. The public storefront API caps the page parameter around 150 β€” a true ceiling of ~37 500 products. The actor stops cleanly when it hits that.
  • No personal data. This endpoint is public catalog data designed to be served to anyone who lands on the store. No customer info, no order info, no inventory counts beyond available: true/false.
  • Polite by default. Uses a normal browser User-Agent and follows redirects; no Apify Proxy needed because the endpoint is openly served.

Source

Each Shopify storefront exposes:

  • GET /products.json?limit=250&page=N β€” paginated product feed.
  • GET /collections.json?limit=250 β€” collection metadata.

These are public endpoints, served the same way to scrapers, plugins, and your browser's view-source.

You might also like

Shopify Products Scraper - Catalog, Prices, SKU & Variants

makework36/shopify-products-scraper

Scrape any Shopify store catalog via public /products.json. Title, SKU, price, variants, images, vendor, tags. No auth, no proxy, $5/1K products.

πŸ‘ User avatar

deusex machine

2

Shopify Products Scraper

dami_studio/shopify-products-scraper

Scrape the full product catalog of any Shopify store via the public /products.json endpoint. One clean row per product: title, URL, vendor, type, tags, description, price range, stock, variants and images. No key, no login, no anti-bot.

2

5.0

Shopify Products Scraper β€” Any Store, No Auth | $0.9/1K

bovi/shopify-products-scraper

Scrape any Shopify store product catalog via the public /products.json endpoint. No auth, no proxy, no API key. Per-variant or per-product rows with price, SKU, availability, compare_at_price, images, tags. Pay per result.

πŸ‘ User avatar

Vitalii Bondarev

2

Shopify Store Scraper

rupom888/shopify-store-scraper

Scrape any Shopify store for products, variants, prices, inventory, images, tags, and collections. No API key needed - uses Shopify's public JSON endpoints.

Shopify Scraper

simpleapi/shopify-scraper

πŸ›’ Shopify Scraper (shopify-scraper) pulls structured data from any public Shopify storeβ€”products, variants, prices, inventory, images, descriptions, vendors & tags. ⚑ Export to CSV/JSON. πŸ”Ž Perfect for competitor analysis, price monitoring, catalog building & lead generation. πŸš€

Shopify Scraper

automation-lab/shopify-scraper

Monitor any Shopify store for competitor catalog, price, availability, variants, collections, and review data. Extract DTC storefront intelligence with no API key.

πŸ‘ User avatar

Stas Persiianenko

49

Shopify Store Scraper

scraperx/shopify-store-scraper

πŸ›οΈ Shopify Store Scraper extracts products, prices, variants, inventory, images, collections & SEO data from public Shopify stores. ⚑ Fast, scalable, API-ready. πŸ“Š CSV/JSON export. πŸš€ Ideal for competitor analysis, price tracking, and catalog enrichment.

Shopify Store Scraper | Metadata & Catalog Extractor

taroyamada/shopify-store-intelligence

Shopify store scraper that pulls public storefront metadata, product catalogs, collections, and vendor data directly from JSON endpoints. No browser, no auth. Returns structured tables ready for competitive catalog research.