VOOZH about

URL: https://apify.com/gentle_cloud/html-validity-report-generator

⇱ HTML Validity Report Generator Β· Apify


Pricing

Pay per usage

Go to Apify Store

HTML Validity Report Generator

Validate web pages against W3C HTML standards. Get detailed error, warning, and info reports using the official W3C Nu HTML Checker API.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

πŸ‘ Monkey Coder

Monkey Coder

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

1

Monthly active users

3 months ago

Last modified

Categories

Share

πŸ” Validate web pages against W3C HTML standards and get detailed quality reports.

✨ What It Does

This Actor validates web pages using the official W3C Nu HTML Checker API. It checks your pages for HTML errors, warnings, and informational messages β€” helping you improve code quality, SEO, and accessibility compliance.

πŸ”§ How It Works

  1. Takes a list of URLs as input
  2. Sends each URL to the W3C Nu HTML Checker API
  3. Collects validation results (errors, warnings, info messages)
  4. Outputs structured data in summary and/or detailed format

πŸ“– How to Use

  1. Add URLs β€” Enter one or more web page URLs to validate
  2. Choose Output Mode:
    • Summary β€” One row per URL with error/warning/info counts and validity status
    • Detailed β€” One row per validation message with line numbers, code extracts, and descriptions
    • Both β€” Get both summary and detailed rows (default)
  3. Configure Filters β€” Optionally include or exclude warnings and info messages
  4. Run β€” The Actor validates each URL and pushes results to the dataset

πŸ“Š Output Example

Summary Row

{
"url":"https://www.example.com",
"is_valid":true,
"total_errors":0,
"total_warnings":2,
"total_info":1,
"total_messages":3,
"checked_at":"2025-01-15T10:30:00.000000"
}

Detailed Row

{
"url":"https://www.example.com",
"message_type":"error",
"message":"Duplicate attribute \"class\".",
"line":42,
"column":18,
"first_line":null,
"first_column":null,
"extract":"<div class=\"main\" class=\"container\">",
"hilite_start":10,
"hilite_length":15,
"checked_at":"2025-01-15T10:30:00.000000"
}

πŸ“ Notes

  • Rate Limiting: The W3C validator is a free public service. The Actor adds a 1-second delay between URL validations to be respectful. For large batches, expect ~1 URL per second.
  • Memory: 256 MB is sufficient for most runs. Increase for very large pages (100+ validation messages each).
  • Timeout: Default is 30 seconds per URL. Increase for slow-loading pages.
  • No API Key Required: Uses the public W3C Nu HTML Checker β€” completely free, no registration needed.
  • Validation Types: Checks HTML5 compliance including proper tag nesting, required attributes, deprecated elements, accessibility issues, and more.

You might also like

W3C Html Reporter

service-paradis/w3c-html-reporter

Get HTML validity reports from various web pages using W3C HTML validator.

πŸ‘ User avatar

Alexandre Paradis

11

HTML Validity Report Generator

tempting_district/html-validity-report-generator

Generate deterministic HTML validity reports with standards-based findings and exact element-level source locations.

W3C Standards Catalog Scraper

parseforge/w3c-standards-catalog-scraper

Scrape W3C standards catalog: title, status, type, date, editors, abstract, shortname, group, deliverer, errata, and specification URL. Covers Recommendations, Working Drafts, Notes, and Candidate Recommendations. Export web standards to JSON, CSV, or Excel for developer tooling.

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

319

5.0

HTML to JSON Smart Parser

parseforge/html-to-json-smart-parser

Convert HTML to structured JSON using AI! Uses OpenAI to extract and structure data from HTML into clean JSON format. Perfect for developers and data analysts who need to transform HTML into structured data without manual parsing.

41

5.0

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

🌐 Download HTML from URLs

scrapio/download-html-from-urls

🌐πŸ“₯ Download HTML from any URL with download-html-from-urls. Extract and save raw page source for analysis, scraping, or automationβ€”fast, reliable, and easy to use. Perfect for developers and data teams. πŸš€βœ¨