Sitemap URL Extractor - List All URLs in a Sitemap
Pricing
$0.50 / 1,000 url extracteds
Sitemap URL Extractor - List All URLs in a Sitemap
Extract every URL from any XML sitemap, with lastmod, changefreq and priority. Resolves sitemap indexes recursively. Pass a sitemap.xml or just a site root to auto-discover its sitemaps. Pure HTTP, no browser β fast and cheap.
Pricing
$0.50 / 1,000 url extracteds
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
12 days ago
Last modified
Categories
Share
Sitemap URL Extractor β List All URLs in a Sitemap
β Bookmark this actor so pulling every URL from a sitemap is one click away on your next crawl or audit.
Extract every URL from any XML sitemap β along with its lastmod, changefreq and priority. The actor resolves sitemap indexes recursively, so one nested sitemap_index.xml expands into all its child sitemaps automatically. Pass a direct sitemap.xml URL, or just a site root and it auto-discovers sitemaps from robots.txt.
Pure HTTP + XML parsing β no browser, fast and cheap.
What it does
- Reads a sitemap.xml, sitemap index, or site root (auto-discovery)
- Walks nested sitemap indexes recursively (depth-limited, dedup-safe)
- Returns one record per URL with loc, lastmod, changefreq, priority
- Tracks which source sitemap each URL came from
Input
| Field | Type | Description |
|---|---|---|
url | string | A sitemap.xml / sitemap index URL, or a site root to auto-discover sitemaps |
maxUrls | integer | Max URLs to extract (1β50000, default 10000) |
Example input
{"url":"https://www.theverge.com/sitemaps","maxUrls":5000}
You can also pass a site root like https://apify.com β the actor finds the sitemaps listed in robots.txt (and common locations) and extracts from them.
Output
One record per URL:
{"loc":"https://www.example.com/blog/post-1","lastmod":"2026-05-20","changefreq":"weekly","priority":"0.8","source_sitemap":"https://www.example.com/sitemap-posts.xml"}
Use cases
- SEO crawls β get the canonical URL list for a site before auditing it.
- Content inventory β export every page with its last-modified date.
- Migration QA β diff old vs new sitemaps to catch dropped URLs.
- Feed a scraper β pipe the URL list into another actor for full-page extraction.
Pricing
Pay-per-event: $0.0005 per URL extracted ($0.50 per 1,000 URLs). Up to 50,000 URLs per run.
Next step: make those URLs AI-discoverable
Have the full URL list? Turn it into a standards-compliant llms.txt so ChatGPT, Claude and Perplexity understand your site β the AI-era complement to your sitemap.
Built by dltik. Pure HTTP β no browser, fast and cheap.
