This Apify actor takes a list of URLs and downloads the full HTML content of each page. It simply scrapes the complete HTML code for all given URLs. You can define proxy settings and optional selector waiting.

✅ Use Cases

📄 Download HTML content from multiple websites

🕷️ Archive web pages for offline analysis

📊 Extract raw HTML for custom parsing

🔍 Monitor website changes over time

📥 Input Configuration

You can customize the actor using the following input fields:

{
"requestListSources":[
{
"url":"https://apify.com"
}
],
"proxyConfiguration":{
"useApifyProxy":true
},
"handlePageTimeoutSecs":60,
"maxRequestRetries":1,
"useChrome":false
}

🧾 Fields Explained Field Type Description requestListSources array Required. Array of URLs to download. Each item can have optional userData with waitForSelector proxyConfiguration object Proxy settings - choose no proxy, Apify Proxy, or custom proxy URLs handlePageTimeoutSecs integer Optional. Maximum time to spend processing one page (default: 60) maxRequestRetries integer Optional. How many retries before giving up (default: 1) useChrome boolean Optional. Use real Chrome browser instead of Chromium (default: false)

📤 Output

The actor returns a dataset containing HTML content for each URL. Each record includes the original URL, final URL (after redirects), page title, and full HTML content.

🧩 Sample Output

[
{
"url":"https://apify.com",
"loadedUrl":"https://apify.com/",
"title":"Apify - Web Scraping & Data Extraction | Apify",
"html":"<!DOCTYPE html>\n<html lang=\"en\">\n<head>\n<meta charset=\"utf-8\">\n..."
}
]

🔒 Proxy Configuration

This actor supports flexible proxy configuration:

No proxy (default)

Apify Proxy for residential IPs

Custom proxy URLs

Default proxy settings:

{
"useApifyProxy":true
}

🚀 How to Use

Open the actor in Apify Console

Click "Try actor" or create a new task

Add URLs to the requestListSources array

Configure proxy settings if needed

Run the actor

Download HTML content in JSON, CSV, or XML format

⚙️ Advanced Input Example

{
"requestListSources":[
{
"url":"https://example.com",
"userData":{
"waitForSelector":".content-loaded"
}
},
{
"url":"https://another-site.com"
}
],
"proxyConfiguration":{
"useApifyProxy":true,
"apifyProxyGroups":["RESIDENTIAL"]
},
"handlePageTimeoutSecs":120,
"maxRequestRetries":3,
"useChrome":true
}

🛠️ Tech Stack

🧩 Apify SDK — for actor and data handling

🕷️ Crawlee — for robust crawling and scraping

🌐 Puppeteer — for browser automation and rendering dynamic content

⚙️ Node.js — fast, scalable backend environment

👁 Download HTML from URLs avatar

Download HTML from URLs

mtrunkat/url-list-download-html

This actor takes a list of URLs and downloads HTML of each page.

👁 User avatar

Marek Trunkát

9.3K

URL Redirects

manishrc/url-redirect

Actor that takes a list of URLs and provides a list of loaded URLs after redirects

👁 User avatar

Manish Chiniwalar

434

👁 Download HTML from URLs avatar

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

👁 User avatar

Data Pilot

👁 Full page screenshot avatar

Full page screenshot

practicaltools/apify-screenshot

This Apify actor takes full-page screenshots of web pages with support for lazy loading content and provides temporary download URLs.

👁 User avatar

Practical Tools

1.0

👁 Generic Html Scraper avatar

Generic Html Scraper

daddyapi/generic-html-scraper

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL

👁 User avatar

DaddyAPI

👁 HTML Scraper pro avatar

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

👁 User avatar

scrapingxpert

309

5.0

👁 HTML Scraper avatar

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

👁 User avatar

Scrape Hub

My Actor

david15999/my-actor

HTML scraper

👁 User avatar

David Emanuel Moreira

Website Screenshot Capture — Full Page & Custom Viewport

wsgcjj/screenshot-capture

Takes a list of URLs and captures screenshots of each page using Playwright (headless Chromium). Supports viewport size, full-page capture, and PNG/JPEG formats. Ideal for website monitoring, content archiving, and competitive analysis.

👁 User avatar

陈俊杰

🔗✨ Link Extractor Pro: URL to HTML List Downloader

dainty_screw/link-extractor-pro-url-to-html-list-downloader

Maximize productivity with HTML URL List Downloader. Quickly extract, manage, and organize URLs from HTML pages. Ideal for SEO professionals and digital marketers. Streamline your workflow today!

👁 User avatar

codemaster devops

200

5.0

👁 Blog article image

How to parse HTML in JavaScript

URL: https://apify.com/scrapeai/html-downloader

⇱ Download HTML from URLs · Apify

Download HTML from URLs

You might also like

Download HTML from URLs

URL Redirects

Download HTML from URLs

Full page screenshot

Generic Html Scraper

HTML Scraper pro

HTML Scraper

My Actor

Website Screenshot Capture — Full Page & Custom Viewport

🔗✨ Link Extractor Pro: URL to HTML List Downloader

Related articles