Pricing
$0.01 / actor start
Pandoc Document Converter
Convert documents between formats (HTML, Markdown, DOCX, EPUB, PDF, LaTeX, RST, ODT, PPTX) using Pandoc. Accepts raw text or URL input.
Pricing
$0.01 / actor start
Rating
0.0
(0)
Developer
Actor stats
1
Bookmarked
16
Total users
6
Monthly active users
21 days ago
Last modified
Categories
Share
π Pandoc Document Converter
Convert documents between multiple formats using the powerful Pandoc document conversion engine. Supports HTML, Markdown, DOCX, EPUB, PDF, LaTeX, RST, ODT, PPTX, and more.
β¨ Features
- 20+ format support β Convert between HTML, Markdown, GFM, CommonMark, LaTeX, RST, DOCX, EPUB, ODT, PPTX, PDF, plain text, AsciiDoc, MediaWiki, Org-mode, and more
- URL input β Fetch content directly from a URL and convert it
- Raw text input β Paste HTML, Markdown, or any supported format directly
- Binary output β DOCX, EPUB, ODT, PPTX, and PDF files are saved to the key-value store for easy download
- PDF generation β Powered by WeasyPrint (no heavy LaTeX installation needed)
- Standalone mode β Produce complete documents with proper headers and footers
π§ How It Works
- You provide content (raw text or a URL to fetch from)
- You specify the input format and desired output format
- The Actor runs Pandoc CLI to perform the conversion
- Text output (HTML, Markdown, etc.) is returned in the dataset
- Binary output (DOCX, EPUB, PDF, etc.) is saved to the key-value store and base64-encoded in the dataset
π How to Use
- Set input β Either paste content in the "Content" field or enter a URL in "Source URL"
- Choose formats β Set "Input Format" (e.g.,
html) and "Output Format" (e.g.,markdown) - Run the Actor
- Get results β Check the dataset for text output, or download binary files from the key-value store
Common Conversions
| From | To | Use Case |
|---|---|---|
html | markdown | Convert web pages to Markdown |
markdown | html | Render Markdown as HTML |
html | docx | Save web content as Word document |
markdown | docx | Create Word documents from Markdown |
html | epub | Convert articles to e-book format |
markdown | pdf | Generate PDF from Markdown |
html | plain | Strip HTML tags, extract plain text |
latex | html | Convert LaTeX papers to web format |
html | rst | Convert to reStructuredText |
π Sample Output (text conversion)
{"from_format":"html","to_format":"markdown","input_size_bytes":245,"output_size_bytes":128,"output_type":"text","output":"# Hello World\n\nThis is a **sample HTML** document for conversion.\n\n- Item 1\n- Item 2\n- Item 3\n","converted_at":"2026-03-20T08:30:00.000000"}
π Sample Output (binary conversion)
{"from_format":"html","to_format":"docx","input_size_bytes":245,"output_size_bytes":8432,"output_type":"binary","output_base64":"UEsDBBQAAAAI...","download_key":"output.docx","converted_at":"2026-03-20T08:30:00.000000"}
Binary files (DOCX, EPUB, ODT, PPTX, PDF) are also saved to the key-value store with the key output.<format> for direct download.
π Input Formats
html, markdown, gfm (GitHub Flavored Markdown), commonmark, latex, rst, textile, org, mediawiki, json (Pandoc AST)
π€ Output Formats
html, markdown, gfm, commonmark, latex, rst, plain, docx, epub, odt, pptx, asciidoc, mediawiki, org, pdf
β οΈ Notes
- Input size limit: 10 MB maximum
- PDF output: Uses WeasyPrint engine (supports CSS styling, no LaTeX needed)
- Binary output: Files are base64-encoded in the dataset AND saved to the key-value store for direct download
- URL fetching: Basic HTTP GET with browser-like User-Agent. Sites with advanced anti-bot protection may not work.
- Memory: Recommended 1 GB for large documents or PDF generation
