VOOZH about

URL: https://apify.com/accurate_pouch/sitemap-analyzer

โ‡ฑ Sitemap Analyzer โ€” Parse, Validate & Check URLs ยท Apify


๐Ÿ‘ Sitemap Analyzer โ€” Parse, Validate & Check URLs avatar

Sitemap Analyzer โ€” Parse, Validate & Check URLs

Pricing

$4.00 / 1,000 sitemap analyzeds

Go to Apify Store

Sitemap Analyzer โ€” Parse, Validate & Check URLs

Parse XML sitemaps, extract all URLs, validate structure (priority, changefreq, lastmod), optionally check HTTP status of every URL. Supports sitemap indexes.

Pricing

$4.00 / 1,000 sitemap analyzeds

Rating

0.0

(0)

Developer

๐Ÿ‘ Manchitt Sanan

Manchitt Sanan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Categories

Share

Parse XML sitemaps, extract all URLs with their metadata (lastmod, changefreq, priority), validate structure against standards, and optionally check HTTP status of every URL. Supports sitemap indexes. $0.003 per sitemap.


What it does

  1. Parse โ€” extracts all <url> entries with loc, lastmod, changefreq, priority
  2. Validate โ€” checks for invalid priority values, bad changefreq, missing lastmod, >50K URL limit
  3. Status check (optional) โ€” HEAD request to every URL, reports broken links within the sitemap
  4. Sitemap index โ€” detects <sitemapindex> and lists all child sitemaps

Quick start

{
"urls":["https://www.sitemaps.org/sitemap.xml"],
"checkStatus":false
}

Input

FieldTypeDefaultDescription
urlsarray(required)Sitemap URLs to analyze
checkStatusbooleanfalseCheck HTTP status of every URL in sitemap
timeoutinteger10000Request timeout for status checks
dryRunbooleanfalseAnalyze without charges

Output

{
"sitemapUrl":"https://example.com/sitemap.xml",
"type":"sitemap",
"urlCount":142,
"entries":[
{
"loc":"https://example.com/page-1",
"lastmod":"2026-04-01",
"changefreq":"weekly",
"priority":"0.8",
"statusCode":200,
"statusOk":true
}
],
"issues":[
"Only 100/142 URLs have lastmod dates"
],
"status":"success"
}

Pricing

$0.003 per sitemap analyzed (pay-per-event pricing).

  • Errors and dry runs are never charged.
  • 10 sitemaps = $0.03
  • 100 sitemaps = $0.30

Related actors in this suite

Other tools by accurate_pouch for site crawling + structure analysis:

  • Broken Link Checker โ€” Recursive crawl, sitemap + robots.txt parsing, webhook, Sheets export. $0.005/page.
  • Website Change Monitor โ€” Text, hash, or selector-based change detection; diff + webhook. $0.005/page.
  • TheCrawler โ€” Web scraper + LLM-powered structured extraction (handles hreflang, pagination, redirect chains). AGPL-3.0, also on npm (thecrawler@0.1.1). $0.005/page.
  • Lighthouse Auditor โ€” PageSpeed Insights API, Core Web Vitals, deltas, competitor comparison. $0.005/audit.
  • Tech Stack Detector โ€” 7,517 signatures across 105 categories. $0.02/URL.

Run on Apify

๐Ÿ‘ Run on Apify

No setup needed. Click above to run in the cloud. $0.003 per operation.

You might also like

Sitemap URL Extractor - List All URLs in a Sitemap

dltik/sitemap-url-extractor

Extract every URL from any XML sitemap, with lastmod, changefreq and priority. Resolves sitemap indexes recursively. Pass a sitemap.xml or just a site root to auto-discover its sitemaps. Pure HTTP, no browser โ€” fast and cheap.

Sitemap Analyzer โ€” Recursive Parse, Health Check, AI Tags

and_krm/sitemap-analyzer

Parse any sitemap.xml recursively, extract all URLs with metadata, check HTTP health status, and optionally cluster URLs by topic using Claude AI. Perfect for SEO audits and site migration.

Sitemap URL Extractor

seemuapps/sitemap-extractor

Extract every URL from a website's sitemap.xml. Recursively walks nested sitemap indexes and returns loc, lastmod, changefreq, and priority for each page.

Sitemap URL Extractor

mikolabs/sitemap-url-extractor

Extract every URL and its metadata from any sitemap.xml in seconds. Paste one or more sitemap URLs, run the Actor, and get a clean, structured dataset with url, lastmod, changefreq, priority, and more โ€” ready to export as CSV, JSON, or Excel.

Sitemap URL Extractor

crawlerbros/sitemap-url-extractor

Extract every URL from any site's sitemap.xml with handles sitemap index files (nested sitemaps), gzipped sitemaps, and robots.txt discovery. Returns URL, lastmod, changefreq, priority, and optional image/video/alternate-language fields. No proxy, no cookies, no login.

Sitemap Extractor: Every URL, Recursive, Reliable

thoob/sitemap-extractor

Reads sitemap.xml, sitemap index files, .gz compressed sitemaps, and robots.txt Sitemap directives, and returns one clean row per URL with lastmod, changefreq, and priority. Billed only per delivered URL.

Pono Data

2