VOOZH about

URL: https://apify.com/patel_dev_automation/browserless-html-scraper

⇱ Headless Browser HTML Scraper - Render & Extract API Β· Apify


Pricing

from $1.00 / 1,000 results

Go to Apify Store

Headless Browser HTML Scraper

Render any URL in a real headless browser and return the fully-rendered HTML, the page text, or a selected area by CSS selector. Scroll for lazy content, wait for elements, and capture screenshots. A browserless-style HTML API on Apify.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ Dev Patel

Dev Patel

Maintained by Community

Actor stats

1

Bookmarked

6

Total users

4

Monthly active users

12 days ago

Last modified

Share

A generic, browserless-style HTML API. Give it any URL and it opens a real headless Chromium browser, fully renders the JavaScript, optionally scrolls and waits, then returns the full rendered HTML β€” or just a selected area by CSS selector.

Think of it as a self-hosted browserless.io /content + /scrape on Apify.

What it does

  • 🌐 Renders any URL with a real browser (JavaScript executed)
  • 🧩 Selected area β€” pass a CSS selector and get every matching element's HTML, text, attributes, and position
  • πŸ“œ Scroll to bottom β€” trigger infinite-scroll / lazy-loaded content with real wheel events
  • ⏳ Wait for a selector, a load event, or a fixed delay
  • πŸ–ΌοΈ Optional full-page screenshot
  • 🚫 Block images/media/fonts/CSS to speed up and cut bandwidth
  • πŸ”Œ Use it synchronously as an API (run-sync-get-dataset-items)

Input

FieldTypeDescription
urlsarrayRequired. URLs to render and scrape.
selectorstringOptional CSS selector for the "selected area". Returns each match's HTML/text/attributes/position. Empty = full page only.
scrollToBottombooleanScroll down to load lazy content. Default false.
maxScrollsintegerMax scroll rounds when scrolling. Default 15.
waitForSelectorstringWait until this selector appears (≀30s).
waitUntilenumdomcontentloaded (default) Β· load Β· networkidle.
waitMsintegerExtra fixed wait after load (ms).
htmlModeenumfull (entire DOM, default) or visible β€” just the above-the-fold content shown on open (no scroll), scripts/styles stripped for a short, clean HTML.
blockResourcesarrayResource types to block. Default ["media","font"].
returnFullHtmlbooleanInclude the rendered HTML (full or visible per htmlMode). Default true.
returnTextbooleanInclude page visible text. Default true.
includeScreenshotbooleanCapture a full-page screenshot and return its URL. Default false.
proxyConfigurationobjectApify Proxy (datacenter) by default; use Residential for bot-protected sites.

Example: full HTML of a JS-rendered page

{"urls":[{"url":"https://www.example.com"}],"waitUntil":"networkidle"}

Example: extract a selected area, after scrolling

{
"urls":[{"url":"https://news.ycombinator.com"}],
"selector":"span.titleline a",
"scrollToBottom":true
}

Output

One record per URL:

{
"url":"https://www.example.com",
"loadedUrl":"https://www.example.com/",
"statusCode":200,
"title":"Example Domain",
"html":"<!DOCTYPE html><html>...</html>",
"text":"Example Domain\nThis domain is for use in...",
"selectedCount":30,
"selectedElements":[
{
"text":"Some headline",
"html":"<a href=\"...\">Some headline</a>",
"attributes":[{"name":"href","value":"https://..."}],
"width":320,"height":18,"top":140,"left":24
}
],
"screenshotUrl":"https://api.apify.com/v2/key-value-stores/.../records/screenshot-1",
"scrapedAt":"2026-06-13T08:00:00.000Z"
}

Use as an API

curl-X POST "https://api.apify.com/v2/acts/USERNAME~browserless-html-scraper/run-sync-get-dataset-items?token=TOKEN"\
-H"Content-Type: application/json"\
-d'{"urls":[{"url":"https://www.example.com"}],"selector":"h1"}'

Notes

  • For bot-protected sites, switch proxyConfiguration to Residential.
  • Blocking image/stylesheet speeds things up but can break layout-dependent lazy scrolling on some sites β€” keep them enabled (don't block) when using scrollToBottom on such pages.

You might also like

Screenshot & HTML file from Url

leadsbrary/screenshot-html-file-from-url

From 1$/1000 results. Capture website screenshots &/or full-page HTML in one run, from $1/1000 URLs. PNG, JPEG & PDF β€” full-page, custom viewport, lazy-load scroll, cookie-banner hiding, batch mode. HTML files open correctly in any browser. REST API ready. No watermark.

πŸ‘ User avatar

Alexandre Manguis

24

5.0

Screenshots from HTML

vojtam/screenshots-from-html

Actor creates screenshots from a saved HTML structure.

πŸ‘ User avatar

VojtΔ›ch MaΕ‘lÑň

76

HTML to PDF Converter

rainminer/html-to-pdf-converter

Convert raw HTML or web page URLs into downloadable PDF files using a real browser. Render CSS, images, tables, invoices, reports, and dynamic layouts, then save the generated PDF to the Apify Key-Value Store with dataset metadata.

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

Generic Html Scraper

daddyapi/generic-html-scraper

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

321

5.0

🌐 Download HTML from URLs

scrapio/download-html-from-urls

🌐πŸ“₯ Download HTML from any URL with download-html-from-urls. Extract and save raw page source for analysis, scraping, or automationβ€”fast, reliable, and easy to use. Perfect for developers and data teams. πŸš€βœ¨

Smart Page Fetcher β€” HTML, Markdown & Text

shelvick/smart-page-fetcher

Fetch a batch of URLs and get the page as HTML, Markdown, or clean text. Tries plain HTTP first, renders JavaScript in a real browser when needed, and escalates to stealth + residential proxy for Cloudflare-protected, bot-defended pages, per URL. Pay only for the difficulty each URL needed.

4

Related articles

How to parse HTML in JavaScript
Read more
How to scrape dynamic websites with Python
Read more
How to use Playwright selectors
Read more