Headless Browser HTML Scraper

Pricing

from $1.00 / 1,000 results

Headless Browser HTML Scraper

Render any URL in a real headless browser and return the fully-rendered HTML, the page text, or a selected area by CSS selector. Scroll for lazy content, wait for elements, and capture screenshots. A browserless-style HTML API on Apify.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

👁 Dev Patel

Dev Patel

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

12 days ago

Last modified

What it does

🌐 Renders any URL with a real browser (JavaScript executed)
🧩 Selected area — pass a CSS selector and get every matching element's HTML, text, attributes, and position
📜 Scroll to bottom — trigger infinite-scroll / lazy-loaded content with real wheel events
⏳ Wait for a selector, a load event, or a fixed delay
🖼️ Optional full-page screenshot
🚫 Block images/media/fonts/CSS to speed up and cut bandwidth
🔌 Use it synchronously as an API (run-sync-get-dataset-items)

Input

Field	Type	Description
`urls`	array	Required. URLs to render and scrape.
`selector`	string	Optional CSS selector for the "selected area". Returns each match's HTML/text/attributes/position. Empty = full page only.
`scrollToBottom`	boolean	Scroll down to load lazy content. Default `false`.
`maxScrolls`	integer	Max scroll rounds when scrolling. Default `15`.
`waitForSelector`	string	Wait until this selector appears (≤30s).
`waitUntil`	enum	`domcontentloaded` (default) · `load` · `networkidle`.
`waitMs`	integer	Extra fixed wait after load (ms).
`htmlMode`	enum	`full` (entire DOM, default) or `visible` — just the above-the-fold content shown on open (no scroll), scripts/styles stripped for a short, clean HTML.
`blockResources`	array	Resource types to block. Default `["media","font"]`.
`returnFullHtml`	boolean	Include the rendered HTML (full or visible per `htmlMode`). Default `true`.
`returnText`	boolean	Include page visible text. Default `true`.
`includeScreenshot`	boolean	Capture a full-page screenshot and return its URL. Default `false`.
`proxyConfiguration`	object	Apify Proxy (datacenter) by default; use Residential for bot-protected sites.

Example: full HTML of a JS-rendered page

{"urls":[{"url":"https://www.example.com"}],"waitUntil":"networkidle"}

Example: extract a selected area, after scrolling

{
"urls":[{"url":"https://news.ycombinator.com"}],
"selector":"span.titleline a",
"scrollToBottom":true
}

Output

One record per URL:

{
"url":"https://www.example.com",
"loadedUrl":"https://www.example.com/",
"statusCode":200,
"title":"Example Domain",
"html":"<!DOCTYPE html><html>...</html>",
"text":"Example Domain\nThis domain is for use in...",
"selectedCount":30,
"selectedElements":[
{
"text":"Some headline",
"html":"<a href=\"...\">Some headline</a>",
"attributes":[{"name":"href","value":"https://..."}],
"width":320,"height":18,"top":140,"left":24
}
],
"screenshotUrl":"https://api.apify.com/v2/key-value-stores/.../records/screenshot-1",
"scrapedAt":"2026-06-13T08:00:00.000Z"
}

Use as an API

curl-X POST "https://api.apify.com/v2/acts/USERNAME~browserless-html-scraper/run-sync-get-dataset-items?token=TOKEN"\
-H"Content-Type: application/json"\
-d'{"urls":[{"url":"https://www.example.com"}],"selector":"h1"}'

Notes

For bot-protected sites, switch proxyConfiguration to Residential.
Blocking image/stylesheet speeds things up but can break layout-dependent lazy scrolling on some sites — keep them enabled (don't block) when using scrollToBottom on such pages.

Html Renderer

jakubbalada/html-renderer

Generate image for your HTML using a headless browser

👁 User avatar

Jakub Balada

👁 Screenshot & HTML file from Url avatar

Screenshot & HTML file from Url

leadsbrary/screenshot-html-file-from-url

From 1$/1000 results. Capture website screenshots &/or full-page HTML in one run, from $1/1000 URLs. PNG, JPEG & PDF — full-page, custom viewport, lazy-load scroll, cookie-banner hiding, batch mode. HTML files open correctly in any browser. REST API ready. No watermark.

👁 User avatar

Alexandre Manguis

5.0

👁 Screenshots from HTML avatar

Screenshots from HTML

vojtam/screenshots-from-html

Actor creates screenshots from a saved HTML structure.

👁 User avatar

Vojtěch Mašláň

👁 HTML to PDF Converter avatar

HTML to PDF Converter

rainminer/html-to-pdf-converter

Convert raw HTML or web page URLs into downloadable PDF files using a real browser. Render CSS, images, tables, invoices, reports, and dynamic layouts, then save the generated PDF to the Apify Key-Value Store with dataset metadata.

👁 User avatar

rainminer

👁 HTML Scraper avatar

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

👁 User avatar

Scrape Hub

👁 Download HTML from URLs avatar

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

👁 User avatar

Data Pilot

👁 Generic Html Scraper avatar

Generic Html Scraper

daddyapi/generic-html-scraper

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL

👁 User avatar

DaddyAPI

👁 HTML Scraper pro avatar

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

👁 User avatar

scrapingxpert

321

5.0

👁 🌐 Download HTML from URLs avatar

🌐 Download HTML from URLs

scrapio/download-html-from-urls

🌐📥 Download HTML from any URL with download-html-from-urls. Extract and save raw page source for analysis, scraping, or automation—fast, reliable, and easy to use. Perfect for developers and data teams. 🚀✨

👁 User avatar

Scrapio

👁 Smart Page Fetcher — HTML, Markdown & Text avatar

Smart Page Fetcher — HTML, Markdown & Text

shelvick/smart-page-fetcher

Fetch a batch of URLs and get the page as HTML, Markdown, or clean text. Tries plain HTTP first, renders JavaScript in a real browser when needed, and escalates to stealth + residential proxy for Cloudflare-protected, bot-defended pages, per URL. Pay only for the difficulty each URL needed.

👁 User avatar

Scott Helvick

👁 Blog article image

How to parse HTML in JavaScript

👁 Blog article image

How to scrape dynamic websites with Python

👁 Blog article image

How to use Playwright selectors

URL: https://apify.com/patel_dev_automation/browserless-html-scraper