👁 Sitemap & URL Extractor — Get Every URL of a Website avatar

Sitemap & URL Extractor — Get Every URL of a Website

Pricing

Pay per usage

👁 Sitemap & URL Extractor — Get Every URL of a Website

Sitemap & URL Extractor — Get Every URL of a Website

Get every URL of a website: parses sitemap.xml and sitemap-indexes (discovered via robots.txt or the default location), with a same-site crawl fallback when there's no sitemap. Returns each URL + lastmod. No API key.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

👁 Daniel Brenner

Daniel Brenner

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

15 days ago

Last modified

What you get (per URL)

url — the page URL (absolute, deduped)
lastmod — last-modified date from the sitemap, when present (honest-null otherwise)
source — "sitemap" or "crawl" (how the URL was found)
discoveredAt

How to use it

{"startUrls":["https://example.com"],"maxResults":5000}

Pass a site URL (the sitemap is found automatically) or a direct sitemap URL. It handles sitemap-indexes (sites that split their sitemap into many files) by following each child sitemap, and if there's no sitemap at all it falls back to a polite, same-site crawl. It respects robots.txt, identifies itself, and fetches one request at a time.

Pair it: discover → extract → audit

This is the discover step of a clean "feed-your-AI" toolkit by dataquarry:

Discover — this actor: every URL of a site.
Extract — dataquarry/website-to-markdown: turn those URLs into clean, LLM-ready Markdown.
Audit — dataquarry/website-seo-metadata-checker: SEO & metadata for each page.

Also see the dataquarry OSM place-data scrapers and free guides at openplacedata.com.

Clean & honest

Reads only public sitemap.xml/robots.txt and (in fallback) public pages; respects robots.txt; sends a descriptive User-Agent; no logins, no PII. Missing values are null, never guessed.

FAQ

Do I need an API key? No — give it a URL and run it. It's free.

What if the site has no sitemap? It crawls the site's own links (same-domain, bounded) so you still get a URL list.

Does it handle huge sitemap-indexes? Yes — it follows child sitemaps up to the maxSitemaps and maxResults caps you set.

Sitemap URL Extractor — robots.txt + sitemap.xml Crawl

v0iddo/sitemap-url-extractor

Discover every URL a site exposes via its public sitemap chain. Reads robots.txt, follows Sitemap declarations, recursively descends sitemap-index files, extracts URLs with lastmod, changefreq, priority.

👁 User avatar

vøiddo

Sitemap Extractor

automationagents/web-sitemap

Extract all URLs from a website's sitemap (XML, robots.txt, or crawl discovery).

👁 User avatar

Alex Jordan

👁 Sitemap URL Extractor - List All URLs in a Sitemap avatar

Sitemap URL Extractor - List All URLs in a Sitemap

dltik/sitemap-url-extractor

Extract every URL from any XML sitemap, with lastmod, changefreq and priority. Resolves sitemap indexes recursively. Pass a sitemap.xml or just a site root to auto-discover its sitemaps. Pure HTTP, no browser — fast and cheap.

👁 User avatar

Walid

👁 Sitemap URL Extractor avatar

Sitemap URL Extractor

seemuapps/sitemap-extractor

Extract every URL from a website's sitemap.xml. Recursively walks nested sitemap indexes and returns loc, lastmod, changefreq, and priority for each page.

👁 User avatar

Andrew

Sitemap URL Discovery (sitemap.xml + robots.txt → all URLs)

gochujang/sitemap-url-discovery

Given a domain, finds sitemap.xml / sitemap_index.xml (also via robots.txt), recursively expands sitemap indexes, returns one row per discovered URL with lastmod / changefreq / priority. SEO audits, crawl-target prep, content cataloging. $0.0001/URL + $0.01 site fee.

👁 User avatar

Hojun Lee

👁 Sitemap Finder & URL Extractor · Crawl Any XML Sitemap avatar

Sitemap Finder & URL Extractor · Crawl Any XML Sitemap

corent1robert/sitemap-detector

Find and crawl XML sitemaps from any website. Follows sitemap indexes, handles gzip, and exports every page URL with source file and lastmod into a clean dataset. No config needed.

👁 User avatar

Corentin Robert

👁 Sitemap URL Extractor avatar

Sitemap URL Extractor

mikolabs/sitemap-url-extractor

Extract every URL and its metadata from any sitemap.xml in seconds. Paste one or more sitemap URLs, run the Actor, and get a clean, structured dataset with url, lastmod, changefreq, priority, and more — ready to export as CSV, JSON, or Excel.

👁 User avatar

mikolabs

Sitemap to URL List Extractor

scrapeworks/sitemap-to-urls

Extract every URL from any website's sitemap as clean JSON. Handles sitemap indexes (recursive) and gzipped sitemaps automatically. Includes lastmod, priority, and changefreq.

👁 User avatar

Nicolas van Arkens

Sitemap API

vivid_astronaut/sitemap

👁 User avatar

Fabio Suizu

👁 Sitemap URL Extractor avatar

Sitemap URL Extractor

crawlerbros/sitemap-url-extractor

Extract every URL from any site's sitemap.xml with handles sitemap index files (nested sitemaps), gzipped sitemaps, and robots.txt discovery. Returns URL, lastmod, changefreq, priority, and optional image/video/alternate-language fields. No proxy, no cookies, no login.

👁 User avatar

Crawler Bros

URL: https://apify.com/dataquarry/sitemap-url-extractor

⇱ Sitemap & URL Extractor — Get Every URL of a Website · Apify

Sitemap & URL Extractor — Get Every URL of a Website

What you get (per URL)

How to use it

Pair it: discover → extract → audit

Clean & honest

FAQ

You might also like

Sitemap URL Extractor — robots.txt + sitemap.xml Crawl

Sitemap Extractor

Sitemap URL Extractor - List All URLs in a Sitemap

Sitemap URL Extractor

Sitemap URL Discovery (sitemap.xml + robots.txt → all URLs)

Sitemap Finder & URL Extractor · Crawl Any XML Sitemap

Sitemap URL Extractor

Sitemap to URL List Extractor

Sitemap API

Sitemap URL Extractor