W3C Html Reporter

Pricing

from $2.00 / 1,000 results

Try for free

Go to Apify Store

👁 W3C Html Reporter

W3C Html Reporter

Try for free

Get HTML validity reports from various web pages using W3C HTML validator.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

👁 Alexandre Paradis

Alexandre Paradis

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

4 months ago

Last modified

W3C HTML Validity Reporter

The W3C HTML Validity Reporter is an Apify actor that generates reports on the validity of given webpages HTML according to the W3C HTML Validator. The actor takes webpages URL as input and produces reports with detailed information on the validity of the webpages HTML.

Input

The actor takes the following input:

startUrls (required): The URL of the webpages to validate.
proxy (Object): Proxy configuration. You can edit this to use Apify proxy, or provide your own proxy servers. Default value is { "useApifyProxy": false }.
debug (Boolean): See detailed logs when activated. Default value is false.

Output

The actor generates a JSON report on the validity of the webpages HTML. The report includes:

A list of messages given by the validator

Usage

To use the actor, you'll need an Apify account. If you don't have one, sign up for free on the Apify website.

Once you have an account, you can run the actor by creating a new task with the following configuration:

{
"startUrls":[{
"url":"https://example.com"
}
],
"proxy":{
"useApifyProxy":false
},
"debug":false
}

Replace "https://example.com" with the URL of the webpage you want to validate.

Please note that w3c validator use Cloudflare to protect their website against bot. You may need to use Apify proxy in order to use this crawler.

Results example

The output from scraping W3C validator is stored in the dataset. Each messsage is stored as an item inside the dataset. After the run is finished, you can download the scraped data onto your computer or export to any web app in various data formats (JSON, CSV, XML, RSS, HTML Table). Here's a few examples of the outputs you can get:

{
"url":"https://apify.com",
"language":"en",
"severity":"info",
"lastLine":10,
"firstColumn":301,
"lastColumn":357,
"message":"Trailing slash on void elements has no effect and interacts badly with unquoted attribute values.",
"markup":"rowser.\"/><meta name=\"twitter:card\" content=\"summary_large_image\"/><meta ",
"highlightIndex":10,
"highlightLength":57
}

{
"url":"https://apify.com",
"language":"en",
"severity":"warning",
"firstLine":614,
"lastLine":614,
"firstColumn":5684,
"lastColumn":5721,
"message":"Section lacks heading. Consider using “h2”-“h6” elements to add identifying headings to all sections, or else use a “div” element instead for any cases where no heading is needed.",
"markup":"-0 wwExY\"><section class=\"sc-1913faef-1 jYOdxN\"><div c",
"highlightIndex":10,
"highlightLength":38
}

{
"url":"https://apify.com",
"language":"en",
"severity":"error",
"lastLine":10,
"firstColumn":1210,
"lastColumn":1272,
"message":"A “meta” element with an “http-equiv” attribute whose value is “X-UA-Compatible” must have a “content” attribute with the value “IE=edge”.",
"markup":"ent=\"24\"/><meta http-equiv=\"X-UA-Compatible\" content=\"IE=edge,chrome=1\"/><meta ",
"highlightIndex":10,
"highlightLength":63
}

👁 HTML Validity Report Generator avatar

HTML Validity Report Generator

gentle_cloud/html-validity-report-generator

Validate web pages against W3C HTML standards. Get detailed error, warning, and info reports using the official W3C Nu HTML Checker API.

👁 User avatar

Monkey Coder

👁 HTML Validity Report Generator avatar

HTML Validity Report Generator

tempting_district/html-validity-report-generator

Generate deterministic HTML validity reports with standards-based findings and exact element-level source locations.

👁 User avatar

Lone

👁 HTML Scraper avatar

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

👁 User avatar

Scrape Hub

👁 W3C Standards Catalog Scraper avatar

W3C Standards Catalog Scraper

parseforge/w3c-standards-catalog-scraper

Scrape W3C standards catalog: title, status, type, date, editors, abstract, shortname, group, deliverer, errata, and specification URL. Covers Recommendations, Working Drafts, Notes, and Candidate Recommendations. Export web standards to JSON, CSV, or Excel for developer tooling.

👁 User avatar

ParseForge

👁 HTML Scraper pro avatar

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

👁 User avatar

scrapingxpert

321

5.0

👁 HTML to JSON Smart Parser avatar

HTML to JSON Smart Parser

parseforge/html-to-json-smart-parser

Convert HTML to structured JSON using AI! Uses OpenAI to extract and structure data from HTML into clean JSON format. Perfect for developers and data analysts who need to transform HTML into structured data without manual parsing.

👁 User avatar

ParseForge

5.0

Html Renderer

jakubbalada/html-renderer

Generate image for your HTML using a headless browser

👁 User avatar

Jakub Balada

My Actor

david15999/my-actor

HTML scraper

👁 User avatar

David Emanuel Moreira

👁 🌐 Download HTML from URLs avatar

🌐 Download HTML from URLs

scrapio/download-html-from-urls

🌐📥 Download HTML from any URL with download-html-from-urls. Extract and save raw page source for analysis, scraping, or automation—fast, reliable, and easy to use. Perfect for developers and data teams. 🚀✨

👁 User avatar

Scrapio

👁 Download HTML from URLs avatar

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

👁 User avatar

Data Pilot

👁 Blog article image

How to parse HTML in JavaScript

URL: https://apify.com/service-paradis/w3c-html-reporter