👁 Robots.txt Validator - Crawl Rules Analyzer avatar

Robots.txt Validator - Crawl Rules Analyzer

Pricing

from $1.00 / 1,000 results

👁 Robots.txt Validator - Crawl Rules Analyzer

Robots.txt Validator - Crawl Rules Analyzer

Analyze robots.txt files for any domain. Extract crawl rules, sitemaps, blocked paths, and crawl-delay settings. Validate configuration and identify SEO issues in bulk.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

👁 Ava Torres

Ava Torres

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

robots.txt Validator & Analyzer

Fetch, parse, and analyze robots.txt files for any domain in bulk. Built for SEO professionals, developers, and crawler operators who need to audit site access rules at scale.

What It Does

For each domain you supply, the actor:

Fetches /robots.txt from the domain root over HTTPS (falls back gracefully on 404 or network errors)
Parses all User-agent, Allow, Disallow, Crawl-delay, and Sitemap directives
Reports structured rules grouped by user-agent
Optionally checks whether specific paths are allowed or blocked for your chosen user-agent

Input

Field	Type	Required	Description
`urls`	string[]	Yes	Domains or full URLs (e.g. `google.com`, `https://openai.com/blog`)
`userAgent`	string	No	User-agent to evaluate rules for. Defaults to `*`
`checkPaths`	string[]	No	Specific paths to test for allow/disallow (e.g. `/admin`, `/api/`)
`maxResults`	integer	No	Cap on domains to process. Defaults to 100

Output

One record per domain:

Field	Description
`domain`	Domain name
`robotsTxtUrl`	Full URL of the fetched robots.txt
`robotsTxtFound`	`true` if HTTP 200 was returned
`robotsTxtContent`	Raw robots.txt text
`userAgentRules`	Parsed rule blocks, each with `userAgent` and `rules` array of `{directive, path}`
`sitemapUrls`	All Sitemap URLs declared in the file
`crawlDelay`	Crawl-delay in seconds for the requested user-agent (null if not set)
`analyzedPaths`	Per-path results: `{path, allowed}` for each path in `checkPaths`
`fetchError`	Error message if the file could not be fetched

Example Use Cases

SEO audit: Check which bots can access which parts of your site
Crawler compliance: Verify your spider respects Disallow rules before running at scale
Competitive research: Understand what paths competitors block from indexing
Security review: Identify paths hidden from crawlers (admin panels, staging URLs)
Sitemap discovery: Extract all declared sitemap URLs without manual inspection

Pricing

$0.10 per 1,000 domains checked. Typical run of 100 domains costs less than $0.02.

👁 Robots.txt Generator avatar

Robots.txt Generator

automation-lab/robots-txt-generator

Generate valid robots.txt files from structured rules. Apply presets (block AI bots, SEO-friendly), add custom per-bot rules, sitemaps, and crawl-delay. Zero-proxy, instant output.

👁 User avatar

Stas Persiianenko

👁 Robots Txt Analyzer avatar

Robots Txt Analyzer

zerobreak/robots-txt-analyzer

Robots txt analyzer that fetches and parses crawl rules from any website in bulk, so SEO teams and developers can audit blocked paths, user agents, and sitemap locations across hundreds of domains without manual work.

👁 User avatar

ZeroBreak

Robots.txt Validator - Check Rules, Sitemaps & Crawl Directives

scrappy_garden/robots-txt-validator

Validate robots.txt for one or more websites: fetches /robots.txt per host, parses directive groups (User-agent/Allow/Disallow/Crawl-delay/Sitemap), reports common errors and warnings, and can test URLs against the chosen User-Agent.

👁 User avatar

Bikram Adhikari

robots.txt Parser & URL Tester

scrapeworks/robots-txt

Fetch and parse robots.txt for any site: user-agent rules, crawl-delay, and declared sitemaps. Optionally test whether specific URLs are allowed for a given user-agent, using correct longest-match rules.

👁 User avatar

Nicolas van Arkens

👁 Robots.txt & Sitemap Analyzer avatar

Robots.txt & Sitemap Analyzer

automation-lab/robots-sitemap-analyzer

This actor fetches and parses robots.txt and sitemap.xml files for any list of websites. It extracts crawl directives (user-agent rules, allowed/disallowed paths, crawl-delay), discovers sitemap URLs, and counts the number of pages listed in each sitemap. Use it for SEO audits, competitive...

👁 User avatar

Stas Persiianenko

Robots.txt Validator

predictable_function/my-actor-3

List of website base URLs whose robots.txt files will be validated

👁 User avatar

riya rawat

5.0

Robots.txt Auditor & Sitemap Finder

andok/robotstxt-auditor

Scan robots.txt files in bulk to extract sitemap URLs and verify crawler directives for technical SEO compliance.

👁 User avatar

Andok

👁 Robots.txt Generator avatar

Robots.txt Generator

maximedupre/robots-txt-generator

Generate deployable robots.txt files from presets, custom bot rules, sitemap URLs, and host directives. Create one file or batch files for multiple sites, then export raw text plus validation data.

👁 User avatar

Maxime Dupré

👁 robots.txt Parser & AI Crawler Block Checker avatar

robots.txt Parser & AI Crawler Block Checker

taroyamada/robotstxt-ai-checker

robots.txt parser that audits AI crawler block rules (GPTBot, ClaudeBot, anthropic-ai, PerplexityBot) across thousands of websites in one run. Returns per-bot allow/disallow disposition and crawl-delay.

👁 User avatar

naoki anzai

Sitemap Extractor

automationagents/web-sitemap

Extract all URLs from a website's sitemap (XML, robots.txt, or crawl discovery).

👁 User avatar

Alex Jordan

URL: https://apify.com/pink_comic/robots-txt-validator

⇱ Robots.txt Validator - Crawl Rules Analyzer · Apify

Robots.txt Validator - Crawl Rules Analyzer

robots.txt Validator & Analyzer

What It Does

Input

Output

Example Use Cases

Pricing

You might also like

Robots.txt Generator

Robots Txt Analyzer

Robots.txt Validator - Check Rules, Sitemaps & Crawl Directives

robots.txt Parser & URL Tester

Robots.txt & Sitemap Analyzer

Robots.txt Validator

Robots.txt Auditor & Sitemap Finder

Robots.txt Generator

robots.txt Parser & AI Crawler Block Checker

Sitemap Extractor