🌐 Download HTML from URLs

Pricing

from $5.99 / 1,000 results

🌐 Download HTML from URLs

🌐 Download HTML from URLs tool fetches page source instantly for analysis, scraping & SEO audits. ✅ Handles multiple links, preserves markup & speeds research. 🚀 Perfect for developers, marketers & data teams.

Pricing

from $5.99 / 1,000 results

Rating

0.0

(0)

Developer

👁 Scrapier

Scrapier

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

✨ Why Choose Us?

🚀 Fast by default — direct connections and parallel downloads, no wasted proxy traffic.
🛡️ Self-healing on blocks — automatically falls back to datacenter and then residential proxies, and keeps the stronger proxy for the rest of the run.
🎭 Browser rendering fallback — pages that fail plain HTTP get rendered in a real headless Chromium browser.
💾 Live results — every page is saved to the dataset the moment it finishes, so even interrupted runs keep their data.
🔗 Flexible input — URLs with or without https://, bulk paste, file upload, or Google Sheets.

🔑 Key Features

Bulk URL input (requestListSources editor — paste lists, upload files, link sheets)
Full page HTML (fullHtml) and extracted <body> HTML (html) per page
Automatic retries with exponential backoff (configurable)
Configurable concurrency, page timeout, and polite request delays
Detailed per-URL debug info (#debug) — status code, retries, error messages, proxy tier

📥 Input

{
"startUrls":[
{"url":"https://apify.com"},
{"url":"example.com"}
],
"proxyConfiguration":{"useApifyProxy":false},
"pageTimeoutSecs":60,
"maxRetries":3,
"maxConcurrency":5,
"requestDelaySecs":0
}

Field	Type	Default	Description
`startUrls`	array	—	Required. List of URLs to download. Missing schemes default to `https://`.
`proxyConfiguration`	object	no proxy	Proxy settings. By default requests go direct; on blocks the actor escalates to datacenter → residential automatically.
`pageTimeoutSecs`	integer	`60`	Max seconds to spend downloading one page.
`maxRetries`	integer	`3`	Extra attempts for a failing URL.
`maxConcurrency`	integer	`5`	Pages downloaded in parallel.
`requestDelaySecs`	number	`0`	Optional polite delay before each request (jitter added).

📤 Output

One dataset record per URL:

{
"url":"https://apify.com",
"finishedAt":"2026-06-10T10:14:29.693Z",
"fullHtml":"<!DOCTYPE html><html>...</html>",
"html":"<body>...</body>"
}

Field	Description
`url`	The downloaded URL.
`finishedAt`	UTC timestamp when the page finished downloading.
`fullHtml`	Complete HTML source of the page.
`html`	Just the `<body>...</body>` portion.
`#debug`	Hidden field with status code, retry count, error messages, and the proxy tier used.
`#error`	Hidden boolean — `true` if no HTML could be retrieved.

🚀 How to Use (Apify Console)

Log in at console.apify.com → Actors.
Open Download HTML from URLs.
Paste your URLs into 🔗 Website URLs (bulk paste works!).
Optionally tweak proxy, timeout, retries, and concurrency.
Click Start and watch the live progress logs.
Open the Output tab when the run completes.
Export to JSON / CSV / XLSX with one click.

🤖 Use via API

curl-X POST "https://api.apify.com/v2/acts/<ACTOR_ID>/run-sync-get-dataset-items?token=$APIFY_TOKEN"\
-H"Content-Type: application/json"\
-d'{"startUrls":[{"url":"https://apify.com"}]}'

💼 Best Use Cases

📰 Archiving article or product pages
🔎 Feeding HTML into your own parsers / LLM pipelines
🧪 Monitoring page content and structure changes
🗂️ Bulk snapshotting of competitor or partner sites
🤖 Pre-fetching pages for downstream AI extraction Actors

💰 Pricing

This actor uses the pay-per-event model with one simple event:

Event	Charged when
`page-downloaded`	A page's HTML is successfully downloaded and saved to the dataset.

Failed URLs are never charged. When your spending limit is reached, the run stops gracefully and keeps everything collected so far.

❓ Frequently Asked Questions

Do I need a proxy? Usually not — the actor starts with direct connections. If a site blocks the request, it escalates to datacenter and then residential proxies automatically.

Does it render JavaScript? Yes, when needed. Pages that fail the fast HTTP download are automatically rendered in a real headless browser.

Can I paste URLs without https://? Yes — example.com is automatically converted to https://example.com.

What happens to URLs that fail completely? They're still saved to the dataset with an empty fullHtml and full error details in #debug, so you always know exactly what happened — and you're not charged for them.

⚖️ Legal

This actor downloads only publicly available web pages. You are responsible for complying with the target websites' terms of service and applicable laws (GDPR, CCPA, etc.) when using the downloaded data.

💬 Support and Feedback

Found a bug or need a feature? Open an issue on the actor's Issues tab in Apify Console — we respond quickly!

👁 🌐 Download HTML from URLs avatar

🌐 Download HTML from URLs

simpleapi/download-html-from-urls

🌐 Download HTML from URLs instantly. Scrape & archive raw page source for analysis, monitoring, or data pipelines. 🚀 Supports automation, fast fetching, and reliable extraction. Perfect for developers, SEO, and research workflows.

👁 User avatar

SimpleAPI

👁 🌐 Download HTML from URLs avatar

🌐 Download HTML from URLs

scrapio/download-html-from-urls

🌐📥 Download HTML from any URL with download-html-from-urls. Extract and save raw page source for analysis, scraping, or automation—fast, reliable, and easy to use. Perfect for developers and data teams. 🚀✨

👁 User avatar

Scrapio

👁 🌐 Download HTML from URLs avatar

🌐 Download HTML from URLs

api-empire/download-html-from-urls

🌐 Download HTML from URLs quickly! Extract page source instantly for scraping, parsing, and testing workflows. 🤖 Save time, reduce errors, and automate data collection with reliable output. ✅ Perfect for developers, SEO audits, and web research.

👁 User avatar

API Empire

👁 Download HTML from URLs avatar

Download HTML from URLs

scrapeai/html-downloader

This actor takes a list of URLs and downloads HTML of each page.

👁 User avatar

ScrapeAI

5.0

(3)

👁 Project Gutenberg Books Scraper avatar

Project Gutenberg Books Scraper

parseforge/project-gutenberg-books-scraper

Search 75,000+ free public-domain books from Project Gutenberg. Returns title, author with birth/death years, cover image, plain-text and EPUB download URLs, Kindle and HTML formats, subjects, bookshelves, language, copyright status, summaries and download counts. Filter by author or language.

👁 User avatar

ParseForge

👁 Yelp Business Scraper avatar

Yelp Business Scraper

beatanalytics/yelp-business-scraper

Extract 25+ fields from any Yelp business — ratings, hours, photos, address, categories, and attributes. Search by query and location or look up by URL. No API key needed. Export as CSV, JSON, or Excel.

👁 User avatar

Beat Analytics

👁 Kleinanzeigen Scraper avatar

Kleinanzeigen Scraper

beatanalytics/kleinanzeigen-scraper

Scrape Kleinanzeigen posts and search results. Extract titles, prices, descriptions, images, seller info, and properties from any listing. Search by keyword and location or look up posts by URL. No API key needed. Export as CSV, JSON, or Excel.

👁 User avatar

Beat Analytics

5.0

(1)

👁 Website Content to Markdown for LLM Training avatar

Website Content to Markdown for LLM Training

easyapi/website-content-to-markdown-for-llm-training

🚀 Transform web content into clean, LLM-ready Markdown! 📘 Scrape multiple pages, extract main content, and convert to Markdown format. Perfect for AI researchers, data scientists, and LLM developers. Fast, efficient, and customizable. Supercharge your AI training data today! 🌐📝🧠

👁 User avatar

EasyApi

322

5.0

(2)

👁 Google Maps Place Details Scraper avatar

Google Maps Place Details Scraper

beatanalytics/google-maps-place-details-scraper

Extract detailed place information from Google Maps: name, address, coordinates, rating, review count, categories, phone, website, description, and timezone. Find places by URL, Place ID, or built-in search. Export as CSV, JSON, or Excel.

👁 User avatar

Beat Analytics

👁 HTML To PDF API avatar

HTML To PDF API

igview-owner/html-to-pdf-api

Convert HTML content and webpage URLs to high-quality PDF documents instantly. HTML to PDF converter with customizable page formats (A4, Letter), scale control, background graphics, and smart waiting for dynamic content. Perfect for reports, documentation, and automated PDF generation workflows.

👁 User avatar

Sachin Kumar Yadav

URL: https://apify.com/scrapier/download-html-from-urls

⇱ 🌐 Download HTML from URLs · Apify

🌐 Download HTML from URLs

✨ Why Choose Us?

🔑 Key Features

📥 Input

📤 Output

🚀 How to Use (Apify Console)

🤖 Use via API

💼 Best Use Cases

💰 Pricing

❓ Frequently Asked Questions

⚖️ Legal

💬 Support and Feedback

You might also like

🌐 Download HTML from URLs

🌐 Download HTML from URLs

🌐 Download HTML from URLs

Download HTML from URLs

Project Gutenberg Books Scraper

Yelp Business Scraper

Kleinanzeigen Scraper

Website Content to Markdown for LLM Training

Google Maps Place Details Scraper

HTML To PDF API