VOOZH about

URL: https://apify.com/automationagents/web-sitemap

⇱ Sitemap Extractor Β· Apify


Pricing

from $20.00 / 1,000 successful api calls

Go to Apify Store

Extract all URLs from a website's sitemap (XML, robots.txt, or crawl discovery).

Pricing

from $20.00 / 1,000 successful api calls

Rating

0.0

(0)

Developer

πŸ‘ Alex Jordan

Alex Jordan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

9 days ago

Last modified

Categories

Share

What does Sitemap Extractor do?

Sitemap Extractor discovers and extracts all URLs from a website's sitemap β€” supporting XML sitemaps, sitemap index files, robots.txt discovery, and automatic crawl-based detection β€” returning a clean list of page URLs ready for bulk scraping or SEO analysis.

Built on the Apify platform, results are returned in seconds and integrate with Apify's scheduling, webhooks, and 1,500+ tools.

Why use Sitemap Extractor?

  • Bulk scraping setup β€” Get the full list of URLs on a site to feed into other scraping workflows
  • SEO auditing β€” Audit all indexed pages across a domain for on-page SEO issues
  • Content monitoring β€” Track new pages added to competitor sites over time with scheduled runs
  • Website migration β€” Extract all source URLs before a site migration to verify redirects
  • Competitor analysis β€” Understand the full content architecture of any competitor website

How to use Sitemap Extractor

  1. Click Try for free on this Actor's page
  2. Enter the website root URL or a direct sitemap URL (e.g. https://stripe.com or https://stripe.com/sitemap.xml)
  3. Set max_urls to limit results if needed (default 1000)
  4. Click Start and wait a few seconds
  5. Download your results from the Output tab in JSON, CSV, or Excel

Input

FieldTypeRequiredDescription
urlstringβœ…Website root URL or direct sitemap URL
max_urlsinteger❌Maximum number of URLs to return (default 1000)
cacheboolean❌Use cached result if available (default true)

Example input:

{
"url":"https://stripe.com",
"max_urls":500
}

Output

Example output:

{
"urls":[
"https://stripe.com/",
"https://stripe.com/payments",
"https://stripe.com/billing",
"https://stripe.com/docs"
],
"total_urls":487,
"sitemap_source":"https://stripe.com/sitemap.xml",
"meta":{"cache_hit":false,"execution_time_ms":1100}
}

You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Data fields

FieldTypeDescription
urlsarrayList of all discovered page URLs
total_urlsintegerTotal number of URLs found
sitemap_sourcestringThe sitemap URL that was ultimately used

Pricing / Cost estimation

$0.02 per successful API call on Apify.

  • 1,000 successful Apify runs = $20.00

FAQ & Support

Is this legal? Sitemaps are intentionally published by website owners to help search engines index their pages β€” reading them is fully legitimate.

Known limitations: Password-protected sitemaps or those requiring authentication cannot be accessed. Very large sitemaps (100k+ URLs) will be truncated to max_urls.

Need help? Open an issue in the Issues tab or contact the support team for custom solutions.

You might also like

Sitemap Scraper

pvillalva/sitemap-scraper

The Sitemap Scraper extracts and outputs all URLs from a given sitemap.

πŸ‘ User avatar

Percival Villalva

268

Robots.txt & Sitemap Analyzer

automation-lab/robots-sitemap-analyzer

This actor fetches and parses robots.txt and sitemap.xml files for any list of websites. It extracts crawl directives (user-agent rules, allowed/disallowed paths, crawl-delay), discovers sitemap URLs, and counts the number of pages listed in each sitemap. Use it for SEO audits, competitive...

πŸ‘ User avatar

Stas Persiianenko

16

Sitemap URL Extractor

onescales/sitemap-url-extractor

Provide a website link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

570

5.0

Sitemap URL Extractor

getdataforu/sitemap-url-extractor

Provide a website link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

2

5.0

Find Sitemap from url

eesti/find-sitemap-from-url

A powerful [Apify Actor] that finds sitemap URLs for any website. This Actor helps you discover XML sitemaps by checking common locations, robots.txt files, and analyzing HTML content for sitemap links.