VOOZH about

URL: https://apify.com/antonio_espresso/html-to-markdown-converter

⇱ Html to Markdown Converter Β· Apify


Pricing

$10.00 / 1,000 conversions

Go to Apify Store

Html to Markdown Converter

Crawl a target URL and convert its HTML content into clean, structured Markdown with optional heading-based chunking.

Pricing

$10.00 / 1,000 conversions

Rating

0.0

(0)

Developer

πŸ‘ Antonio Blago

Antonio Blago

Maintained by Community

Actor stats

0

Bookmarked

39

Total users

1

Monthly active users

4 months ago

Last modified

Share

Html to Markdown Converter

Convert HTML pages to clean, structured Markdown with optional heading-level chunking.

This Apify actor crawls one or more URLs, extracts HTML content and converts it into readable Markdown. It supports both full-document conversion and heading-based section chunking (h1–h6).


πŸš€ Features

  • πŸ•·οΈ Crawl any public website
  • πŸͺ„ Convert HTML to GitHub-style Markdown
  • 🧩 Automatically chunk content by heading
  • 🧼 Cleans up temporary crawl artifacts
  • 🐞 Logs crawl/parsing issues to Apify logs

πŸ“₯ Input Example

{
"urls":["https://apify.com"],
"output_format":"json",
"debug":false
}
FieldTypeDescription
urlsarrayList of URLs to crawl
output_formatstring"json" or "df" (DataFrame-like)
debugbooleanEnables verbose logging

πŸ“€ Output Example

Each record includes:

  • url – Source URL
  • markdown – Full converted content
  • chunks – Markdown split into sections by heading
{
"url":"https://apify.com",
"markdown":"# Welcome to Apify\n...",
"chunks":[
"# Welcome to Apify",
"## What is Apify?",
"## Get Started"
]
}

▢️ Usage

You can run this actor:

🟣 Apify CLI

$apify call antonio_espresso/html-to-markdown-converter --input'{ "urls": ["https://apify.com"] }'

🐍 Python

import requests
ACTOR_ID ="antonio_espresso/html-to-markdown-converter"
TOKEN ="APIFY_API_TOKEN"# Replace with your token
response = requests.post(
f"https://api.apify.com/v2/actor-tasks/{ACTOR_ID}/runs",
headers={"Authorization":f"Bearer {TOKEN}"},
json={"urls":["https://apify.com"],"output_format":"json"}
)
print(response.json())

πŸ‘¨β€πŸ’» Author

Made with ❀️ by SEO Antonio Blago

πŸ”— My Other Apify Actors:


βš™οΈ Tech Stack

  • Python 3.10+
  • Apify SDK for Python
  • Advertools for Markdown generation

πŸ§ͺ Local Dev

pip install-r requirements.txt
python src/markdown_convert.py

Push to Apify:

$apify push

🧼 License

MIT

You might also like

Web-to-Markdown Generator for AI & RAG Pipelines

profitstack/web-to-markdown-generator-for-ai-rag-pipelines

Convert any website into clean, heading-based chunking, LLM-ready Markdown for RAG and AI agents.

Ai Ready Web Page To Markdown Converter

mustafa.irshaid.113/ai-ready-web-page-to-markdown-converter

Convert any webpage into structured Markdown and HTML using just a URL. Get the page title, link, and contentβ€”perfect for SEO, devs, and AI crawlers. Fast, clean, and ideal for repurposing or analysis. Start turning websites into Markdown instantly.

πŸ‘ User avatar

Mustafa Irshaid

16

Html To Markdown Converter πŸ“„

powerful_bachelor/html-to-markdown-converter

πŸ“„βœ¨ HTML to Markdown Converter transforms web pages into clean, portable Markdown. Simply input a URL to extract content while preserving structure, formatting, and media elements.πŸ”„ Perfect for content repurposing, documentation, and creating readable, platform-independent text from any webpage! πŸš€

πŸ‘ User avatar

Powerful Bachelor

36

Website to Markdown Converter

lofomachines/website-to-markdown-converter

Best faster and cheaper way to convert any web page into clean, structured, LLM-ready Markdown.

HTML to Markdown

web.harvester/html-to-markdown

Convert HTML to clean Markdown. Supports GFM tables, code blocks, and custom rules. Perfect for content migration and documentation.

3

Markdown Maker: HTML to Markdown πŸ“

shahidirfan/Markdown-Maker

Instantly convert complex HTML into clean, structured Markdown. This lightweight actor is optimized to render web content into a format that is easily readable for AI LLMs, reducing token usage and improving context. Perfect for RAG pipelines and preparing data for training.

Markdown Anything β€” URL to Markdown

s-r/markdown-anything

Convert any URL to clean markdown using a 3-provider fallback chain. Batch input, high concurrency.