VOOZH about

URL: https://apify.com/crawlerbros/website-image-scraper

⇱ Website Image Scraper Β· Apify


Pricing

from $1.00 / 1,000 results

Go to Apify Store

Website Image Scraper

Extract every image URL from a website. Crawls the start page (and optionally internal links up to a configurable depth), parses `<img>` tags, `<picture>`/`<source>`, `srcset` candidates, and CSS `background-image` declarations. HTTP-only, no proxy or browser needed.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

1

Bookmarked

35

Total users

16

Monthly active users

2 months ago

Last modified

Share

Extract every image URL from a website. Crawls the start page (and optionally internal links up to a configurable depth), then parses <img> tags, <picture>/<source>, srcset candidates, <link rel="icon">, and CSS background-image declarations. HTTP-only β€” no browser, no proxy, no API key.

What it does

  • Pull every image URL referenced on a page β€” <img src>, lazy-loaded data-src, srcset candidates, picture sources, favicons, inline style="background-image: url(...)".
  • Crawl deeper β€” follow internal links up to maxCrawlDepth (same host only) to grab images from linked pages too.
  • Filter by format β€” restrict to specific extensions (e.g. only SVG, only WebP/AVIF).
  • Bounded β€” maxImagesPerPage and maxTotalImages keep runs cost-predictable on large galleries.

Input

FieldTypeDefaultDescription
startUrlstring (required)https://apify.comPage to start crawling. Must be http:// or https://.
maxCrawlDepthinteger1 (0–5)0 = only the start URL; 1+ = follow internal links one level (same host only).
maxImagesPerPageinteger200 (1–5000)Cap per page β€” keeps pathological galleries bounded.
maxTotalImagesinteger1000 (1–50000)Hard cap on total images emitted across the whole run.
imageExtensionsarray[jpg, jpeg, png, gif, webp, svg, avif, bmp, ico]Only URLs whose path ends in one of these are kept.
includeBackgroundImagesbooleantrueAlso extract from inline style="background-image: url(...)".
userAgentstring(Chrome 131)Optional UA override.

Example input

{
"startUrl":"https://apify.com",
"maxCrawlDepth":1,
"maxImagesPerPage":200,
"maxTotalImages":500,
"imageExtensions":["jpg","png","webp","svg"],
"includeBackgroundImages":true
}

Output

One record per unique image URL. Empty fields are omitted (no nulls).

{
"url":"https://apify.com/static/hero.jpg",
"sourcePage":"https://apify.com/",
"pageTitle":"Apify Β· The full-stack web-scraping & automation platform",
"alt":"Apify hero image",
"hasAltText":true,
"title":"Apify",
"width":1200,
"height":600,
"extension":"jpg",
"discoveredVia":"img-tag",
"mimeTypeHint":"image/jpeg",
"crawlDepth":0,
"scrapedAt":"2024-12-16T14:23:11+00:00"
}

Output fields

  • url β€” absolute URL of the image (data: URIs and javascript: pseudo-URLs are filtered out).
  • sourcePage β€” the page where the image was discovered.
  • pageTitle β€” <title> of the page where the image was found (handy for grouping the dataset by page name).
  • alt β€” alt attribute of the <img> tag (when present).
  • hasAltText β€” derived boolean: true when alt is present and non-empty. Lets you filter accessibility issues without testing for field presence.
  • title β€” title attribute (when present).
  • width / height β€” explicit pixel dimensions from the tag (only emitted when numeric).
  • extension β€” lowercase file extension parsed from the URL path (e.g. "jpg", "svg", "webp"). Useful for format-bucket aggregations.
  • discoveredVia β€” one of img-tag, srcset, picture-source, link-icon, css-background.
  • mimeTypeHint β€” derived from the file extension (e.g. image/png, image/svg+xml).
  • crawlDepth β€” depth at which the page was crawled (0 = startUrl).
  • scrapedAt β€” ISO-8601 timestamp.

Use cases

  • Content audits β€” see every image a website serves up, broken down by source (img tag vs CSS background).
  • Asset inventory β€” pull all logos, hero images, and icons from a competitor or brand site.
  • Format migration β€” find every JPEG/PNG to convert to WebP/AVIF, or every PNG to convert to SVG.
  • SEO / accessibility β€” list images with hasAltText: false to flag accessibility issues at a glance.

FAQ

Does it download the image binaries? No. The actor only collects URLs and metadata. Combine with a separate downloader (or pipe URLs into Apify's standard "URL list" actor) if you need the bytes.

Does it work on JavaScript-rendered pages? Mostly no. This scraper is HTTP-only β€” it sees the server-rendered HTML, not what runs after the page boots. If a site lazy-loads images via React/Vue, you may only see fallback / placeholder images. For SPA-rendered content, use a Playwright-based actor instead.

Can I limit it to a single page? Set maxCrawlDepth: 0. Only the start URL is fetched.

Does it follow external links? No. Internal-link crawling only follows links to the same host as startUrl to keep cost and scope bounded.

What if the site has no images at all? You get a single sentinel record {"type": "website_image_scraper_error", "reason": "no_images_found"} so the dataset is non-empty. The run still completes successfully.

How does it deduplicate? By absolute URL. The same image referenced from multiple pages produces one record (the first-seen page is recorded as sourcePage).

You might also like

Image Scraper

rapidtech1898/image-scraper

Extract image links from any website quickly and easily. Enter a URL and the scraper collects all available image URLs in seconds. Perfect for designers, marketers, and developers who need fast access to image sources without manual searching.

103

1.0

Bulk Image Downloader

trudax/bulk-image-downloader

Download all images from a website with this easy-to-use Bulk Image Downloader. Scrape all images from any website by URL to a zip file with a single click.

Image Scraper - Download All Images From Site

logiover/website-image-media-extractor

Scrape all images from a website without API or login. Bulk image & media URL extractor with alt text; export to CSV/JSON for AI datasets.

Website Image Scraper

gomorrhadev/website-image-scraper

Website Image Scraper is a fast, lightweight tool that crawls websites to extract image URLs (jpg, png, svg) without downloading files or using browsers. It supports recursive crawling, respects robots.txt, and efficiently collects image links for analysis or monitoring or a later download.

πŸ‘ User avatar

Gomorrha UG (haftungsbeschrΓ€nkt)

305

5.0

Web Images Scraper

jupri/web-images-scraper

Scrape Images from a Webpage

Bulk Image Downloader

onescales/bulk-image-downloader

The Bulk Image Downloader is a powerful Apify actor that extracts and downloads images from web pages or processes direct image URLs in bulk. Whether you need to download a single image or thousands of images from multiple websites, this tool handles it all efficiently.

1.1K

5.0

Google Images Scraper

hooli/google-images-scraper

Scrape image details from images.google.com. Add your query and number of images and extract image details such as image URL, image source, description, image dimensions, thumbnail, and more. Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Website Scraper

dz_omar/ai-lead-extractor

Extract information from websites using intelligent AI πŸ€–from contact details to custom data fields, summaries, and creative content 🌐. Automatically crawl contact, about, and team pages to gather emails, phone numbers, job titles, and social links. Batch process hundreds of URLs efficiently.

πŸ‘ User avatar

FlowExtract API

143

3.9

Google Image Downloader

kawsar/google-image-downloader

Google Image Downloader scrapes Google Images for any keyword and returns full-size image URLs, thumbnails, source pages, and dimensions, so you can build datasets, run SEO audits, or collect images without writing a single line of code.

Image Downloader

apify/image-downloader

Related articles

Top 5 Google Image Search APIs to extract web image data
Read more