VOOZH about

URL: https://apify.com/zhorex/jd-scraper

⇱ JD.com Scraper 2026 β€” Product & Seller Data, No Login [DEPRECATED] Β· Apify


πŸ‘ [DEPRECATED v0.7] JD.com Scraper οΏ½ Price Endpoint Blocked avatar

[DEPRECATED v0.7] JD.com Scraper οΏ½ Price Endpoint Blocked

Deprecated

Pricing

from $8.00 / 1,000 product detail extracteds

Go to Apify Store

[DEPRECATED v0.7] JD.com Scraper οΏ½ Price Endpoint Blocked

Deprecated

DEPRECATED οΏ½ JD.com pricing endpoints blocked at proxy-infrastructure level on Apify residential pool. Actor returns enrichment fields only (brand, title, category, images, stock) and does NOT charge οΏ½ realtimePrice universally null. Do not subscribe. v1.0 with premium proxy on roadmap.

Pricing

from $8.00 / 1,000 product detail extracteds

Rating

0.0

(0)

Developer

πŸ‘ Sami

Sami

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

2

Monthly active users

a month ago

Last modified

Share

[DEPRECATED v0.7] JD.com Scraper β€” Price Endpoint Blocked

⚠️ DEPRECATED β€” Do not subscribe

JD.com's pricing endpoints (p.3.cn, c0.3.cn, item-soa.jd.com, cd.jd.com/promotion, union-pim.jd.com) are all blocked at the proxy-infrastructure level on Apify residential CN. Two distinct block patterns were verified across five endpoints and four proxy geographies (CN / HK / SG / no-country):

  • *.jd.com price aggregators return HTML error pages instead of JSON (WAF intercept)
  • *.3.cn hosts return curl (56) CONNECT tunnel failed, response 590 (proxy refuses to tunnel)

As a result: realtimePrice is universally null on this Actor today, and the PPE gate (added in v0.6.5) refuses to charge for any record that lacks a price. The Actor returns the enrichment fields (brand, title, category, images, stock, JD-self-run flag, service tags) for diagnostic visibility but never bills.

Subscribing to this Actor today will give you $0 in charges but also $0 in usable price data. If you need JD product pricing data, contact the author about funding a Bright Data / Oxylabs / Soax premium residential pool β€” a v1.0 release with that integration is on the roadmap.

Existing Saved Tasks continue to function (they receive the diagnostic records, not billing).


Extract enrichment fields from JD.com (京东 / Jingdong) product pages. Currently shipped fields (when not blocked): productTitle, brandName (3-layer fallback), categoryPath, images, isJdSelfRun, stockStatus, serviceTags, sellerId. Real-time price extraction is currently non-functional β€” see deprecation warning above.

Part of the Chinese Digital Intelligence Suite by zhorex β€” pairs with the Chinese Brand Monitor (cross-platform brand mention aggregator across Weibo + RedNote + Bilibili + Douban + Xueqiu, $0.045/mention) and the individual Weibo, RedNote, Xueqiu, Douban scrapers for full-stack China data coverage.


What you get per product

For each JD product URL or SKU ID you submit, one record with:

  • Identifiers β€” productId, productUrl, brandName, categoryPath (full breadcrumb)
  • Specs β€” full JD specs panel as a {key: value} dict (商品名称, 商品编号, δΈŠζžΆζ—Άι—΄, dimensions, color options…)
  • Images β€” primaryImageUrl plus full descriptionImages array for catalog ingestion
  • Pricing β€” realtimePrice queried fresh at scrape time (not stale page-load price)
  • Seller signal β€” sellerId, isJdSelfRun flag (true = JD's own warehouse + warranty + return logistics; false = third-party merchant)
  • Stock β€” stockStatus enum (in_stock / low_stock / out_of_stock), best-effort stockCount
  • Service tags β€” JD's protection options decoded to human-readable English: 7-day return, JD Tianhua (warranty), JD Plus exclusive, cash on delivery, etc.
  • Origin β€” shippingCity (defaults to Beijing β€” JD's central warehouse region)
  • Timestamp β€” scrapedAt UTC ISO 8601

Why this Actor, not a generic e-commerce scraper

  • isJdSelfRun flag β€” JD's hybrid model means each SKU is either fulfilled by JD itself (own logistics, warranty, return path) or by a third-party merchant on the marketplace. Generic scrapers don't distinguish; this one surfaces the flag on every record.
  • Service tag decoding β€” JD's protection options arrive as opaque numeric codes; this Actor maps them to human-readable English so downstream pipelines don't have to maintain the code table.
  • Real-time price β€” realtimePrice is fetched fresh at scrape time, not parsed from cached HTML, so it captures flash-discount cycles that move within hours.

Example input

{
"mode":"product_detail",
"productUrls":[
"https://item.jd.com/100037053980.html",
"100066898260"
]
}

The Actor accepts a mix of full URLs and bare SKU IDs. Duplicates are removed automatically.


Example output (truncated)

{
"mode":"product_detail",
"productId":"100037053980",
"productTitle":"εΎ—εŠ›οΌˆdeliοΌ‰ 6018 ε‰ͺεˆ€ε‰ͺ子ε‰ͺ纸裁ε‰ͺ εŠžε…¬ζ–‡ε…·η”¨ε“",
"brandName":"εΎ—εŠ›",
"categoryPath":["ζ–‡ζ•™ζ–‡εŒ–η”¨ε“","εˆ‡ε±‘ζ–‡ε…·","εŠžε…¬/ε­¦η”Ÿε‰ͺεˆ€"],
"specs":{"商品名称":"εΎ—εŠ› 6018","商品编号":"100037053980"},
"realtimePrice":"12.90",
"priceCurrency":"CNY",
"priceSource":"item-soa",
"sellerId":"0",
"isJdSelfRun":true,
"stockStatus":"in_stock",
"serviceTags":["7-day return","JD Tianhua (warranty)"],
"primaryImageUrl":"https://img12.360buyimg.com/n0/jfs/...",
"descriptionImages":["...","..."],
"shippingCity":"εŒ—δΊ¬",
"productUrl":"https://item.jd.com/100037053980.html",
"scrapedAt":"2026-05-16T10:00:00+00:00"
}

Pricing (Pay-Per-Event)

EventPrice
product-detail-scraped$0.008 / product

Realistic workflow costs

WorkflowVolumeCost
Daily price tracking on 200 SKUs (per month)6,000 details/mo~$48 / month
Competitor SKU enrichment (one-time batch)1,000 products$8
Hedge-fund-grade daily refresh, 5,000 SKUs150,000 details/mo~$1,200 / month
Catalog migration / one-time pull50,000 SKUs$400

Proxy

The Actor's input schema defaults to Apify residential proxy with apifyProxyCountry: "CN" β€” leave it on for production workflows. Apify residential proxy is billed separately from the per-event price (typically a few cents per MB transferred β€” see your Apify Billing β†’ Proxy usage).

Most non-CN residential IPs also work because item.jd.com applies lighter rate-limiting than other JD subdomains. The default config is the safest bet but you can experiment.


Status note (v0.6.3)

JD's classic price endpoint p.3.cn/prices/mgets is currently rate-limited at the edge for most shared residential proxy pools (Apify included). v0.6.3 ships with a multi-endpoint price fallback β€” the Actor tries item-soa.jd.com/getWareBusiness first (a modern aggregator that also returns brand info), then c0.3.cn/stock, then the legacy p.3.cn. The first endpoint that returns parseable data wins; price + brand are filled together when possible.

If all three price hosts are blocked AND the page parser can't recover a brand name from the HTML / title, the record is pushed with an error: "missing_price_and_brand" field and no PPE event is charged for it. You always see the diagnostic in the dataset, you never pay for an empty record.

Brand-name extraction has three fallback layers: the canonical #parameter-brand link, the parameter2 ε“η‰Œ field, and a title-pattern heuristic (JD product titles wrap the brand inside the first (...οΌ‰ pair).


What this Actor does NOT do (and why)

Earlier versions (v0.1–v0.5) shipped four modes: product_detail, seller_store, product_search, product_reviews. The latter three were removed in v0.6 after extensive testing because JD's WAF reliably blocks them on Apify's shared residential proxy pools, returning η³»η»ŸηΉεΏ™ or silently redirecting to the JD homepage. We tested:

  • curl_cffi with Chrome TLS impersonation β€” blocked
  • Playwright with full Chromium + JS execution + primed cookies β€” blocked
  • Four proxy geographies (CN / HK / SG / no-country) β€” all blocked
  • Mobile API endpoints β€” return 403 (require JD mobile-app signing scheme)

The block is at the IP-reputation layer, not anything fixable client-side. Rather than ship modes that return zero items and accidentally charge buyers, v0.6 ships only the mode that works reliably.

If you specifically need search, reviews, or seller-store data from JD, contact the author about integrating a premium residential proxy pool (Bright Data / Oxylabs / Soax) with cleaner reputation against JD specifically. That work is parked as a roadmap item; one paying buyer would unlock it.

Saved tasks that still call the removed modes get a clean rejection message β€” no PPE charge.


Use cases

  1. Competitor pricing intelligence β€” realtimePrice enables sub-hour competitor SKU price tracking. Pair with a daily-cron schedule via Apify's Saved Tasks.
  2. Catalog enrichment β€” pull every JD product from your brand's SKU list to refresh title, specs, current price, stock status for downstream BI / dashboards.
  3. JD-self-run vs marketplace audit β€” the isJdSelfRun flag identifies which of your SKUs are sold directly by JD (your authorized channel) vs by third-party merchants (potential gray-market / unauthorized resale).
  4. AI training data β€” JD product titles, specs, and category paths are clean labeled data for product-classification, product-matching, and Chinese commerce NLP tasks.

Part of the Chinese Digital Intelligence Suite

  • πŸ†• Chinese Brand Monitor β€” Cross-platform brand mention aggregator (Weibo + RedNote + Bilibili + Douban + Xueqiu in one normalized feed, sentiment + dedup, $0.045/mention). Pairs perfectly with this Actor: track your competitors' JD product pricing here, then monitor consumer sentiment about those brands across all 5 social platforms in one call.
  • Weibo Scraper β€” public sentiment, hot search, KOL posts
  • RedNote Scraper β€” lifestyle / consumer brand reviews
  • Xueqiu Scraper β€” Chinese stock discussion & cashtag sentiment
  • Douban Scraper β€” film / book / music reviews & ratings

Compliance posture

  • Only public JD data β€” same content any anonymous browser visitor sees on item pages.
  • No login bypass β€” does not attempt authenticated-only content.
  • No personal data harvesting β€” only the seller-identification metadata JD itself displays publicly.

Buyers running this commercially are responsible for downstream compliance with their own jurisdiction's data laws.


Support

Found a bug? Need a field that's not extracted? Open an issue on the Actor page β€” typical turnaround 48 hours.

If this Actor saves you time, a 30-second review is the single biggest thing that helps β€” it brings the tool to other buyers and pays for continued maintenance.

You might also like

Watch Arbitrage Tracker β€” Rolex/Patek/AP Γ— 13 Marketplaces

kazkn/watch-arbitrage-mcp

Cross-platform Patek/Rolex/AP arbitrage. Tracks 13 marketplaces: Chrono24, WatchBox, Bob's, Watchfinder, European Watch, Watches of Switzerland, Watch Club, Spliedt, A Collected Man, Analog:Shift, Bachmann & Scher, Yahoo Japan + Hodinkee. Telegram alerts on cross-country spreads. Pay-Per-Event.

All-in-One Facebook Scraper

get-leads/all-in-one-facebook-scraper

Facebook scraper β€” 12 modes: pages, posts, events, groups, search, reviews, comments, marketplace, reels & ads. HTTP-only, 256MB, fast. Premium residential proxy (~95% success rate). Up to 50% cheaper than alternatives. MCP-ready for AI agents.

91

Bol.com Price Tracker - NL/BE Drops + AI Briefs

harvestlab/bol-com-scraper

Track Bol.com NL/BE product prices, availability, sellers, ratings, delivery notes, and price-drop history. Built for Benelux market monitoring with clear diagnostics when public pages block access.

HomeStars Scraper β€” Business Leads, Reviews & Contacts

scrapersdelight/homestars-scraper

Scrape HomeStars.com home-service businesses into lead lists β€” company, rating, review count, location, categories, and full profiles β€” by city, category, or search URL. No login or API key. From $3 per 1,000 listings, $8 per 1,000 full profiles.

πŸ‘ User avatar

Scrapers Delight

5

5.0

(1)

Pinnacle Odds Scraper β€” h2h, spreads, totals + 5K specials

zhorex/sports-odds-aggregator

Pre-match + live Pinnacle odds. 11 sports, h2h / spreads / totals + 5,000+ specials per sport (futures, yes-no, exact totals, first-to-score, team props). PPE $0.01-0.04 per snapshot. Datacenter proxy. The Odds API + OddsJam alternative for sharp bettors and EV teams.

Website Change Monitor & Diff Tracker

ryanclinton/website-change-monitor

Monitor any website for content changes with automatic diff detection. Track pricing pages, competitor sites, ToS updates, and more. Compares snapshots, reports added/removed text, and supports CSS selector targeting for precise monitoring.

18

Home Services Lead Finder - HVAC, Plumbing, Roofing

seibs.co/home-services-lead-finder

Enriched Google Maps leads for US home services contractors. Email, tech stack (ServiceTitan/Housecall Pro/Jobber/FieldEdge), license numbers, service area, social profiles. Built for sales teams selling INTO contractors and PE firms rolling them up.

Facebook Review Export and Business Reputation Monitor

scrapemint/facebook-review-intelligence

For local businesses, agencies, and franchise operators. Pulls every Facebook recommendation for any business page with reviewer name, text, date, likes, and comment count. Monitor reputation and benchmark competitors without a SaaS subscription.