Pricing
from $4.99 / 1,000 results
π Walmart Data Extractor
π Walmart Data Extractor pulls product details, pricing, ratings & availability from Walmart for fast market research. π Automate leads, monitor competitors & track trends with reliable data. π Great for B2B insights & analytics.
Pricing
from $4.99 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract rich, structured product data from Walmart.com at scale. Feed it
category pages, search pages, or product (/ip/) URLs β or just a keyword β and
get back prices, images, brand, full specifications, ratings, seller info, and
much more. Built for reliability with automatic proxy escalation, anti-bot
browser impersonation, retries, and real-time dataset saving.
β¨ Why Choose This Actor?
- π Bulk URLs β mix category, search and product URLs in a single run.
- π‘οΈ Smart proxy escalation β starts direct, falls back to datacenter, then residential automatically, and sticks with residential once it has to.
- π§° Anti-bot by design β uses
impitbrowser impersonation (real TLS/HTTP fingerprints) instead of heavy headless browsers. - πΎ Live results β products stream into the output table as they're scraped, grouped by source section, so a mid-run stop never loses data.
- β Reviews & specs β opt into reviews and get full
idmlspecifications. - π§© Customizable output β reshape every record with your own Python hooks.
π Key Features
| Feature | Description |
|---|---|
| Category scraping | Auto-paginates browse/category pages |
| Search scraping | Search pages or a raw keyword |
| Product detail | Direct /ip/ URL extraction |
| Reviews | includeReviews / onlyReviews |
| Limits | maxItems (global) and endPage |
| Location | Best-effort zipCode targeting |
| Proxy | direct β datacenter β residential (sticky) |
π₯ Input
{"startUrls":[{"url":"https://www.walmart.com/browse/auto-tires/brake-pads/91083_1074765_9038935_4670095_4582920"}],"search":"laptop","maxItems":10,"endPage":null,"zipCode":"10001","includeReviews":false,"onlyReviews":false,"proxy":{"useApifyProxy":false}}
| Field | Type | Description |
|---|---|---|
startUrls | array | Walmart category / search / product URLs (bulk). Required. |
search | string | Keyword β converted to a search URL. |
maxItems | integer | Cap on total products. Empty = no limit. |
endPage | integer | Last category/search page to read. |
zipCode | string | US ZIP for localized pricing/availability. |
postalCode | integer | β οΈ Deprecated β use zipCode. |
includeReviews | boolean | Attach reviews to each product. |
onlyReviews | boolean | Keep only reviews + identifiers. |
extendOutputFunction | string | Python def extendOutputFunction(product) β dict merged in. |
outputFilterFunction | string | Python def outputFilterFunction(product) β reshape/drop. |
proxy | object | Proxy config. Default: no proxy (auto-escalates on block). |
π€ Output
Each product is pushed as one dataset row with the full Walmart product object plus convenience columns for the table view:
{"name":"MAX Advanced Brakes - Brake Kit ...","brand":"Max Advanced Brakes","priceString":"$194.99","price":194.99,"availabilityStatus":"IN_STOCK","usItemId":"1902495893","productUrl":"https://www.walmart.com/ip/.../1902495893","imageUrl":"https://i5.walmartimages.com/seo/...jpeg","sourceSection":"browse_auto_tires","sourceUrl":"https://www.walmart.com/browse/...","priceInfo":{"currentPrice":{"price":194.99,"priceString":"$194.99"}},"idml":{"specifications":{},"longDescription":"..."},"reviews":null}
A structured, per-section summary (mirroring results_by_url) is also written to
the key-value store as OUTPUT.
π How to Use (Apify Console)
- Log in at https://console.apify.com β Actors.
- Open Walmart Data Extractor.
- Paste your Walmart URLs (or a keyword), set
maxItems, and configure proxy. - Click Start.
- Watch products stream into the run log and Output tab in real time.
- Export to JSON / CSV / XLSX when done.
π€ Use via API
curl-X POST "https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN"\-H"Content-Type: application/json"\-d'{"startUrls":[{"url":"https://www.walmart.com/search?q=laptop"}],"maxItems":10}'
π― Best Use Cases
- π° Price monitoring & repricing
- π Catalog & assortment analysis
- π Competitor & market research
- π·οΈ Brand / seller tracking
π³ Pricing
This actor uses the pay-per-event model. The primary event is
row_result, charged once per product saved to the dataset. Platform startup is
covered by the synthetic apify-actor-start event. You only pay for the products
you actually receive.
β FAQ
Which URLs are supported? Category/browse pages, search pages, and product
(/ip/) pages.
Do I need a proxy? No. The actor runs direct by default and only escalates to datacenter then residential proxies if Walmart blocks the request.
Can I limit the run? Yes β use maxItems for a global cap and endPage to
stop pagination early.
Why are some fields null? Walmart omits fields per product; reviews are only
attached when includeReviews/onlyReviews is enabled.
βοΈ Cautions / Legal
- Data is collected only from publicly available Walmart pages.
- You are responsible for compliance with Walmart's ToS and applicable laws (GDPR, CCPA, etc.). Use reasonable rate limits and scrape responsibly.
π Support & Feedback
Open an issue on the Actor's Issues tab with your run ID and input, and we'll take a look.
