👁 Web Page to Single-Page PDF & HTML (Automation-Ready) avatar

Web Page to Single-Page PDF & HTML (Automation-Ready)

Pricing

$29.99/month + usage

👁 Web Page to Single-Page PDF & HTML (Automation-Ready)

Web Page to Single-Page PDF & HTML (Automation-Ready)

Convert webpages to single-page PDFs and extract raw HTML via API. Captures full scroll height (no A4 splits). Built for automation with n8n, Make, and Zapier. Ideal for archiving, AI workflows, compliance, and bulk processing.

Pricing

$29.99/month + usage

Rating

0.0

(0)

Developer

👁 Gavin Campbell

Gavin Campbell

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Web Page to Single-Page PDF Converter (Automation Ready)

Capture full-length webpages as single-page PDFs and extract raw HTML source code via API.

Designed for seamless integration with automation platforms like n8n, Make.com, and Zapier, this Apify Actor allows you to programmatically archive web content, generate visual reports, and feed clean data into your AI workflows.

Unlike standard converters that cut pages into A4 sheets, this tool captures the entire scrollable area of a webpage into one continuous PDF file, ensuring no data is cut off at page breaks.

🚀 Key Features

Single-Page "Long" PDFs: Captures the full height of the webpage in a single continuous document. Perfect for newsletters, landing pages, and social media feeds.
HTML Source Extraction: Option to save the exact view-source: HTML code alongside the visual PDF.
Bulk Processing: Handle thousands of URLs in a single run.
Anti-Blocking: Built-in support for Apify Proxy and stealth mode to bypass bot detection.
Smart Waiting: Configurable waitUntil strategies (e.g., networkidle0) ensure dynamic JavaScript content loads completely before capture.

💡 Use Cases

Compliance & Archiving: Automatically screenshot and save the HTML source of your legal pages, T&Cs, or partner sites for compliance auditing.
Marketing Swipe Files: Build a visual database of competitor landing pages, emails, and ad creatives.
AI Knowledge Base: Feed the raw HTML output into LLMs (like ChatGPT or Claude) via n8n to analyze page structure or content without parsing complex DOMs yourself.
Invoicing & Receipts: Convert web-based invoice views into portable PDF files for accounting systems.
Design QA: Automate visual regression testing by capturing full-page renders of your staging environment.

⚙️ Input Configuration

Field	Type	Default	Description
`startUrls`	Array	`[]`	A list of URLs you want to convert. Supports direct URLs or object format.
`saveHtml`	Boolean	`true`	If enabled, saves the raw HTML source code (`.html`) to the Key-Value store.
`proxyConfiguration`	Object	`Apify Proxy`	Recommended to keep enabled to avoid IP bans.
`waitUntil`	String	`networkidle0`	When to take the snapshot. Use `networkidle0` for strict loading or `domcontentloaded` for speed.

🔌 Automation Integrations

This Actor is built to be a backend microservice. Here is how to connect it to your favorite workflow automation tools.

1. n8n Integration

Goal: Trigger the actor from a workflow and download the resulting PDF.

Add the "Apify" Node: In your n8n workflow, add the Apify node.
Select Action: Choose Run Actor.
Actor ID: Search for web-to-pdf-converter (or use the Actor ID from the Apify console).

Input: switch to JSON mode and map your URL:

{
"startUrls":[{"url":"{{$json.your_url_field}}"}],
"saveHtml":true
}

Wait for Finish: Ensure the "Synchronous" option is checked (or use a separate "Wait" node and "Get Dataset Items" node for long runs).
Retrieve Files: The output will contain a pdfUrl. Use an HTTP Request node to GET that URL and save the binary data.

2. Make.com (Integromat) Integration

Goal: Save a webpage to Google Drive every time a new row is added to Google Sheets.

Trigger: Google Sheets (Watch Rows).
Action: Add the Apify module -> Run Actor.

Settings:

Actor: Select this actor.

Body:

{
"startUrls":[{"url":"{{1.url}}"}],
"saveHtml":true
}

Action: Add Apify module -> Get Dataset Items.
- Dataset ID: Map the defaultDatasetId from the previous step.
Action: Add HTTP module -> Get a file.
- URL: Map the pdfUrl from the dataset items.
Action: Google Drive -> Upload a File.

3. Zapier Integration

Goal: Email a PDF version of a webpage when a specific event occurs.

Trigger: Any Zapier trigger (e.g., "New Trello Card").
Action: Search for Apify.
Event: Select Run Actor.

Configure:

Actor: Paste the Actor ID.

Input Body:

{
"startUrls":[{"url":"https://example.com"}]
}

Action: Select Apify -> Get Dataset Items (to get the PDF link).
Action: Gmail -> Send Email. Use the pdfUrl in the attachment field or body.

📦 Output Format

The actor stores results in two locations:

Key-Value Store: The physical files.
- Page_Title_hash.pdf (The visual render)
- Page_Title_hash_source.html (The source code)
Dataset: The JSON metadata used for linking.

Sample Dataset JSON:

{
"url":"https://apify.com",
"title":"Apify: The Web Scraping and Automation Platform",
"pdfUrl":"https://api.apify.com/v2/key-value-stores/mYStoReId/records/Apify_hash.pdf",
"htmlUrl":"https://api.apify.com/v2/key-value-stores/mYStoReId/records/Apify_hash_source.html",
"timestamp":"2023-10-27T14:30:00.000Z"
}

🛠 Troubleshooting

PDF is blank/white: Try changing waitUntil to networkidle0. This forces the crawler to wait until all network activity (images, scripts) has settled.
Cookie Consent Popups: The actor attempts to hide scrollbars, but popups may obscure content. For complex sites, you may need an actor with custom "click" logic or use a pre-navigation hook (advanced usage).
Access Denied: Ensure you are using the proxyConfiguration set to useApifyProxy: true to avoid 403 errors.

Built with ❤️ using the Apify SDK and Puppeteer.

👁 Html To Pdf Api avatar

Html To Pdf Api

simplifysme/html-to-pdf-api

📄 Convert any HTML page or URL to high-quality PDF documents via API. Perfect for reports, invoices, documentation, web page archiving, and automated document generation.

👁 User avatar

SimplifySME Toolbox

👁 HTML To PDF for N8N avatar

HTML To PDF for N8N

exciting_perfume/HTML-to-PDF-Apify-Actor

Generate accurate PDFs from HTML or URLs using Chromium. Supports CSS, fonts, and backgrounds. Automation-ready and perfect for n8n workflows, reports, invoices, and contracts.

👁 User avatar

Gavin Campbell

👁 HTML to PDF Converter avatar

HTML to PDF Converter

automation-lab/html-to-pdf-converter

Convert HTML content or web pages to PDF documents. Supports raw HTML strings, single URLs, and bulk URL lists. Full control over page size, margins, orientation, headers, and footers.

👁 User avatar

Stas Persiianenko

👁 n8n Workflow Automation Templates Scraper avatar

n8n Workflow Automation Templates Scraper

scraped/n8n-workflow-automation-templates-scraper

A tool that automatically scrapes and collects n8n workflow automation templates from the n8n for easy access and use.

👁 User avatar

scraped

326

HTML to PDF converter

apify/html-to-pdf-converter

Convert HTML string to A4 PDF.

👁 User avatar

Apify

200

4.3

👁 n8n-mcp avatar

n8n-mcp

nourishing_courier/web-data-for-ai

n8n-mcp

👁 User avatar

Ani Björkström

👁 n8n Workflows Scraper avatar

n8n Workflows Scraper

dadhalfdev/n8n-workflows-scraper

This scraper extracts pre-built, free workflow templates directly from the n8n template library. Pick a category and sort order, and the scraper will navigate n8n's library to extract not only the metadata of each workflow but the full, raw JSON configuration. Get up to 150 workflows per run.

👁 User avatar

Marco Rodrigues

👁 Reddit Scraper Pro avatar

Reddit Scraper Pro

webdatalabs/reddit-scraper-pro

High-performance Reddit scraper (99%+ success rate) for automation workflows. Monitor subreddits, track keywords with sentiment analysis, scrape comments, and integrate with n8n/Zapier for powerful automation.

👁 User avatar

WebDataLabs

132

5.0

👁 n8n Documentation MCP Server avatar

n8n Documentation MCP Server

agentify/n8n-mcp-server

n8n MCP Server provides AI assistants with structured access to n8n node documentation, properties, and validation tools for building and verifying workflows efficiently.

👁 User avatar

agentify

👁 Reddit Scraper - Markdown for AI & n8n avatar

Reddit Scraper - Markdown for AI & n8n

clearpath/reddit-to-llm-api

Extract Reddit posts and comments as LLM-ready Markdown. No API key needed. Direct n8n/Make integration—connect output to AI nodes instantly. 20x faster than browser scrapers. Perfect for lead gen, product validation, and market research workflows.

👁 User avatar

ClearPath

👁 Blog article image

How to publish your Apify Actor as an n8n node

URL: https://apify.com/exciting_perfume/web-page-to-single-page-pdf-and-html