Pricing
from $3.00 / 1,000 results
Go to Apify Store
Sitemap Keyword Extractor
π Extract comprehensive product data from Amazon product pages with structured metadata, pricing, reviews, and availability information. Fast, reliable, and production-ready.
Pricing
from $3.00 / 1,000 results
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
5 months ago
Last modified
Categories
Share
πΊοΈ Extract all pages from XML sitemaps and detect keywords from URL structures for SEO analysis and content planning. Fast, efficient, and production-ready.
πΊ What It Extracts
- Sitemap Data: All URLs, last modification dates, change frequencies, priorities
- Keyword Detection: Automatically extracts keywords from URL paths
- Page Metadata: Complete sitemap information for each page
- Statistics: Total page count and keyword analysis
π Key Features
| Feature | Description |
|---|---|
| πΊοΈ Sitemap Parsing | Supports standard XML sitemap format |
| π Keyword Detection | Automatically extracts keywords from URL paths |
| π Structured Output | Clean JSON format with page and keyword data |
| β‘ Fast Processing | Efficient parsing of large sitemaps |
| π Error Handling | Gracefully handles malformed sitemaps |
| π SEO Insights | Provides keyword counts and patterns |
π₯ Input
Required
sitemapUrl(string): The URL of the sitemap.xml file- Example:
"https://example.com/sitemap.xml" - Supports standard XML sitemap format
- Example:
π€ Output
Returns structured sitemap data:
{"sitemapUrl":"https://example.com/sitemap.xml","totalPages":150,"pages":[{"url":"https://example.com/products/widget","lastmod":"2024-01-15","changefreq":"weekly","priority":"0.8","detectedKeywords":["products","widget"],"keywordCount":2}],"_metadata":{"runId":"abc123","processedAt":"2024-01-15T12:00:00.000Z","processingTimeMs":2500}}
π‘ Use Cases
- β SEO Audits - Analyze site structure and keyword distribution
- β Content Planning - Identify content gaps and opportunities
- β Competitor Analysis - Study competitor site structures
- β Site Mapping - Generate comprehensive site maps
- β Keyword Research - Extract keywords from URL patterns
- β Content Strategy - Plan content based on existing structure
βοΈ Technical Details
- Parser: Uses Cheerio for efficient XML parsing
- Keyword Detection: Extracts meaningful keywords from URL path segments
- Error Handling: Validates sitemap format and handles errors gracefully
- Performance: Optimized for processing large sitemaps efficiently
π Example Usage
Basic Extraction
{"sitemapUrl":"https://example.com/sitemap.xml"}
Large Sitemaps
{"sitemapUrl":"https://large-site.com/sitemap.xml"}
β οΈ Important Notes
- Supports standard XML sitemap format
- Automatically filters out non-meaningful URL segments for keyword detection
- Handles sitemaps with thousands of URLs efficiently
- Keywords are extracted from URL path segments (e.g.,
/products/widgetβ ["products", "widget"])
