VOOZH about

URL: https://apify.com/shahidirfan/file-to-markdown

โ‡ฑ File to Markdown ยท Apify


Pricing

Pay per usage

Go to Apify Store

Transform files into clean, readable Markdown instantly. Convert PDFs, documents, images, and more to structured Markdown format. Perfect for automating documentation workflows, content migration, and building knowledge bases. Ideal for developers, writers, and content teams.

Pricing

Pay per usage

Rating

5.0

(1)

Developer

๐Ÿ‘ Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

1

Monthly active users

4 months ago

Last modified

Share

File to Markdown Converter

Convert documents, spreadsheets, images, and text files into clean, AI-ready Markdown. Provide a public file URL or upload a file directly in the Actor input.

Features

  • URL conversion โ€” Convert any publicly accessible file URL
  • File upload conversion โ€” Upload a local file via the input UI
  • Metadata included โ€” Get title, tokens, and processing timing (when available)
  • Single-result output โ€” Writes exactly one result item per run

Use Cases

Knowledge Base Preparation

Turn PDFs and docs into Markdown for internal documentation and search.

AI / RAG Pipelines

Convert files into Markdown to feed LLM workflows, embeddings, and agent tools.

Reporting and Analysis

Extract readable text from spreadsheets and reports for downstream processing.

Input Parameters

Provide either url or file. If both are provided, the Actor uses file.

ParameterTypeRequiredDefaultDescription
urlStringNoExample URLPublic file URL to convert
fileStringNoโ€”Upload a file using the input UI
outputFormatStringNojsonjson (recommended) or markdown (URL only)
timeoutSecsIntegerNo30Timeout for conversion and download steps

Output Data

The Actor stores the result in:

  • Default dataset (one item)
  • Key-value store record OUTPUT

Each result contains:

FieldTypeDescription
successBooleanWhether conversion succeeded
sourceString"url" or "upload"
titleStringBest-effort title (when available)
contentStringConverted Markdown content
urlStringOriginal URL (URL conversions)
filenameStringFilename (uploads)
file_typeStringFile extension/type (uploads)
tokensNumberToken count (when available)
duration_msNumberProcessing time (URL conversions, when available)
processing_time_msNumberProcessing time (uploads, when available)

Usage Examples

Convert by URL (recommended)

{
"url":"https://www.rfc-editor.org/rfc/rfc1149.txt",
"outputFormat":"json",
"timeoutSecs":30
}

Convert by URL (Markdown output)

{
"url":"https://www.rfc-editor.org/rfc/rfc1149.txt",
"outputFormat":"markdown"
}

Convert by Upload

{
"file":"(uploaded via UI)",
"outputFormat":"json",
"timeoutSecs":30
}

Sample Output

{
"success":true,
"source":"url",
"url":"https://www.rfc-editor.org/rfc/rfc1149.txt",
"title":"rfc1149.txt",
"content":"# ...",
"method":"...",
"duration_ms":1200,
"tokens":850
}

Tips for Best Results

Use Public URLs

If the file URL requires authentication, upload it via the file input instead.

Start With JSON Output

Use outputFormat: "json" to keep metadata such as token counts when available.

Mind Rate Limits

If the upstream service throttles requests, reduce run frequency or space out runs.

Frequently Asked Questions

Which file types are supported?

Common document, spreadsheet, image, and text formats are supported by the upstream converter.

Where is the result stored?

In the default dataset (one item) and also as the OUTPUT key-value store record.

Can I upload private files?

Yes โ€” use the file input to upload files without needing a public URL.

Integrations

  • Apify API โ€” Fetch results from the dataset or OUTPUT record
  • Make / Zapier โ€” Trigger conversions and forward Markdown downstream
  • Google Sheets / BigQuery โ€” Store and analyze converted outputs

Legal Notice

You are responsible for ensuring you have the right to process the files and URLs you provide.

You might also like

Website To Markdown

swarmgarden/website-to-markdown

Convert any webpage to clean, readable Markdown format. Perfect for content extraction and readability.

70

Markdownify MCP Server

crawlerbros/markdownify-mcp-server

Convert any webpage to clean, formatted Markdown perfect for AI consumption. Ideal for building knowledge bases, documentation scrapers, and content migration tools.

18

5.0

HTML to Markdown

web.harvester/html-to-markdown

Convert HTML to clean Markdown. Supports GFM tables, code blocks, and custom rules. Perfect for content migration and documentation.

3

Markdown to PDF MCP Server

parseforge/markdown-to-pdf-mcp

Convert Markdown content to PDF format using Model Context Protocol (MCP). Perfect for developers, content creators, and businesses who need to programmatically convert Markdown documents to professional PDFs with custom styling, page sizes, margins, and orientations.

8

5.0

Website Content to Markdown for LLM Training

easyapi/website-content-to-markdown-for-llm-training

๐Ÿš€ Transform web content into clean, LLM-ready Markdown! ๐Ÿ“˜ Scrape multiple pages, extract main content, and convert to Markdown format. Perfect for AI researchers, data scientists, and LLM developers. Fast, efficient, and customizable. Supercharge your AI training data today! ๐ŸŒ๐Ÿ“๐Ÿง 

Convert To Markdown

datavault/convert-to-markdown

Convert to Markdown, converts documents, spreadsheets, images (OCR), audio (transcription), and web/data files into clean Markdown. It runs fully locally, requires no API keys, and is ideal for LLMs, docs, and archiving.

Doc To Markdown MCP Server

abotapi/doc-to-markdown-mcp

An MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friendly Markdown format.