Pricing
from $0.001 / result
Fashion Product Page Scraper
Under maintenanceThis actor extracts detailed product information from valid fashion e-commerce URLsβincluding product name, description, price, image urls, available sizes, and brand metadata. It verifies if a link is a genuine product page and determines real-time stock availability. Read more at www.flashi.ai
Pricing
from $0.001 / result
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
5
Total users
0
Monthly active users
6 months ago
Last modified
Categories
Share
π Fashion Product Page Scraper
Built by Flash Intelligence Extract detailed product data, real-time pricing, and stock availability from fashion e-commerce websites.
This Actor is a specialized scraping tool designed for the fashion and apparel industry. Unlike generic scrapers, it understands the unique structure of fashion product pages. It automatically detects product details, validates if a link is a genuine product page (filtering out blogs or collections), and returns structured JSON data ready for analysis.
β¨ Key Features
- Real-Time Stock Check: Instantly verifies if an item is currently in stock.
- Size Intelligence: Distinguishes between Available Sizes and Out-of-Stock Sizes.
- Smart Validation: Automatically validates URLs. If you input a non-product page (like a homepage or blog), the scraper flags it immediately rather than returning incorrect data.
- Rich Metadata: Extracts deep details including Brand, Material, Color, Condition, Gender, and Category.
- Image Extraction: Retrieves high-quality product image URLs.
- Price Monitoring: Captures the current selling price.
π Use Cases
- Competitor Price Monitoring: Track how competitors are pricing similar fashion items.
- Inventory Tracking: Monitor when specific sizes go out of stock or are replenished.
- Dropshipping & Retail Arbitrage: Find underpriced items and verify availability instantly.
- Trend Analysis: Aggregate data on materials, colors, and descriptions across different brands.
π Input Parameters
The Actor takes a simplified input object. You only need to provide the target url.
Example Input
{"url":"[https://theminnesotan.com/products/bears-trucker](https://theminnesotan.com/products/bears-trucker)"}
Start a new web scraping project quickly and easily in Python with our empty project template. It provides a basic structure for the Actor with Apify SDK and allows you to easily add your own functionality.
π¦ Output Data
Results are stored in the default Apify Dataset (JSON, CSV, Excel, HTML).
1. Successful Scrape (Status: 200)
Valid product pages return detailed metadata and stock status.
{"input_url":"[https://theminnesotan.com/products/bears-trucker](https://theminnesotan.com/products/bears-trucker)","scrape_status":"success","product_in_stock":true,"is_valid_product_page":true,"product_details":{"product_title":"Bears Trucker Hat","product_description":"Classic trucker hat featuring the Bears logo...","product_price":29.99,"product_sizes_available":["S","M","L","XL"],"product_sizes_out_of_stock":["XXL"],"product_images":["[https://cdn.shopify.com/s/files/1/image_1.jpg](https://cdn.shopify.com/s/files/1/image_1.jpg)","[https://cdn.shopify.com/s/files/1/image_2.jpg](https://cdn.shopify.com/s/files/1/image_2.jpg)"],"product_color":"Navy/Orange","product_brand":"The Minnesotan","product_material":"Cotton/Polyester Mesh","product_sex":"Unisex","product_category":"Accessories > Hats","product_condition":"New"}}
β οΈ Error States & Validation
This Actor is designed to be robust. It distinguishes between a Technical Failure (which stops the run) and a Validation Failure (which logs the error and continues).
1. Invalid Page (Soft Failure)
If the URL provided is reachable but is not a fashion product page (e.g., a blog post, a homepage, or a privacy policy), the Actor will not fail.
Instead, it returns a successful response with an invalid_page status. This allows you to process bulk URLs without stopping the entire run because of one bad link.
{"input_url":"[https://theminnesotan.com/blogs/news](https://theminnesotan.com/blogs/news)","scrape_status":"invalid_page","error":"The provided URL does not appear to be a product page."}
Included features
- Apify SDK for Python - a toolkit for building Apify Actors and scrapers in Python
- Input schema - define and easily validate a schema for your Actor's input
- Request queue - queues into which you can put the URLs you want to scrape
- Dataset - store structured data where each object stored has the same attributes
How it works
Insert your own code to async with Actor: block. You can use the Apify SDK with any other Python library.
Resources
- Python tutorials in Academy
- Video guide on getting data using Apify API
- Integration with Make, GitHub, Zapier, Google Drive, and other apps
- A short guide on how to build web scrapers using code templates:
Getting started
For complete information see this article. In short, you will:
- Build the Actor
- Run the Actor
Pull the Actor for local development
If you would like to develop locally, you can pull the existing Actor from Apify console using Apify CLI:
-
Install
apify-cliUsing Homebrew
$brew install apify-cliUsing NPM
$npm-ginstall apify-cli -
Pull the Actor by its unique
<ActorId>, which is one of the following:- unique name of the Actor to pull (e.g. "apify/hello-world")
- or ID of the Actor to pull (e.g. "E2jjCZBezvAZnX8Rb")
You can find both by clicking on the Actor title at the top of the page, which will open a modal containing both Actor unique name and Actor ID.
This command will copy the Actor into the current directory on your local machine.
$apify pull <ActorId>
Documentation reference
To learn more about Apify and Actors, take a look at the following resources:
