VOOZH about

URL: https://apify.com/scrapeai/html-downloader

โ‡ฑ Download HTML from URLs ยท Apify


Pricing

$5.00/month + usage

Go to Apify Store

Download HTML from URLs

This actor takes a list of URLs and downloads HTML of each page.

Pricing

$5.00/month + usage

Rating

5.0

(3)

Developer

๐Ÿ‘ ScrapeAI

ScrapeAI

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

3

Monthly active users

7 months ago

Last modified

Categories

Share

๐Ÿง  HTML Downloader

This Apify actor takes a list of URLs and downloads the full HTML content of each page. It simply scrapes the complete HTML code for all given URLs. You can define proxy settings and optional selector waiting.

โœ… Use Cases

๐Ÿ“„ Download HTML content from multiple websites

๐Ÿ•ท๏ธ Archive web pages for offline analysis

๐Ÿ“Š Extract raw HTML for custom parsing

๐Ÿ” Monitor website changes over time

๐Ÿ“ฅ Input Configuration

You can customize the actor using the following input fields:

{
"requestListSources":[
{
"url":"https://apify.com"
}
],
"proxyConfiguration":{
"useApifyProxy":true
},
"handlePageTimeoutSecs":60,
"maxRequestRetries":1,
"useChrome":false
}

๐Ÿงพ Fields Explained Field Type Description requestListSources array Required. Array of URLs to download. Each item can have optional userData with waitForSelector proxyConfiguration object Proxy settings - choose no proxy, Apify Proxy, or custom proxy URLs handlePageTimeoutSecs integer Optional. Maximum time to spend processing one page (default: 60) maxRequestRetries integer Optional. How many retries before giving up (default: 1) useChrome boolean Optional. Use real Chrome browser instead of Chromium (default: false)

๐Ÿ“ค Output

The actor returns a dataset containing HTML content for each URL. Each record includes the original URL, final URL (after redirects), page title, and full HTML content.

๐Ÿงฉ Sample Output

[
{
"url":"https://apify.com",
"loadedUrl":"https://apify.com/",
"title":"Apify - Web Scraping & Data Extraction | Apify",
"html":"<!DOCTYPE html>\n<html lang=\"en\">\n<head>\n<meta charset=\"utf-8\">\n..."
}
]

๐Ÿ”’ Proxy Configuration

This actor supports flexible proxy configuration:

No proxy (default)

Apify Proxy for residential IPs

Custom proxy URLs

Default proxy settings:

{
"useApifyProxy":true
}

๐Ÿš€ How to Use

Open the actor in Apify Console

Click "Try actor" or create a new task

Add URLs to the requestListSources array

Configure proxy settings if needed

Run the actor

Download HTML content in JSON, CSV, or XML format

โš™๏ธ Advanced Input Example

{
"requestListSources":[
{
"url":"https://example.com",
"userData":{
"waitForSelector":".content-loaded"
}
},
{
"url":"https://another-site.com"
}
],
"proxyConfiguration":{
"useApifyProxy":true,
"apifyProxyGroups":["RESIDENTIAL"]
},
"handlePageTimeoutSecs":120,
"maxRequestRetries":3,
"useChrome":true
}

๐Ÿ› ๏ธ Tech Stack

๐Ÿงฉ Apify SDK โ€” for actor and data handling

๐Ÿ•ท๏ธ Crawlee โ€” for robust crawling and scraping

๐ŸŒ Puppeteer โ€” for browser automation and rendering dynamic content

โš™๏ธ Node.js โ€” fast, scalable backend environment

You might also like

Download HTML from URLs

mtrunkat/url-list-download-html

This actor takes a list of URLs and downloads HTML of each page.

๐Ÿ‘ User avatar

Marek Trunkรกt

9.3K

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

Full page screenshot

practicaltools/apify-screenshot

This Apify actor takes full-page screenshots of web pages with support for lazy loading content and provides temporary download URLs.

๐Ÿ‘ User avatar

Practical Tools

51

1.0

Generic Html Scraper

daddyapi/generic-html-scraper

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

309

5.0

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

Related articles

How to parse HTML in JavaScript
Read more