VOOZH about

URL: https://glama.ai/mcp/servers/search/tools-for-ocr-and-generating-json-or-markdown

⇱ Tools for OCR and generating JSON or Markdown | Glama


Search for:

Tools for OCR and generating JSON or Markdown

View all MCP Servers

  • Why this server?

    This server extracts text content from local PDF files, supporting OCR capabilities, which is directly relevant to the user's request.

    F
    license
    C
    quality
    D
    maintenance
    An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
    Last updated
    1
    30
  • Why this server?

    This server extracts webpage content and transforms it into clean, LLM-optimized Markdown, fulfilling the 'generate markdown' part of the user's query after potentially using OCR.

    A
    license
    -
    quality
    D
    maintenance
    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
    Last updated
    4
    MIT
  • Why this server?

    Same as above but without the mention of the ID.

    A
    license
    A
    quality
    D
    maintenance
    Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
    Last updated
    1
    36
    17
    MIT
  • Why this server?

    Generates a JSON file tree from a specified directory's 'src' folder, which aligns with the 'generate json' part of the query.

    A
    license
    B
    quality
    D
    maintenance
    Generates a JSON file tree from a specified directory's 'src' folder, adhering to '.gitignore' rules, allowing for quick project structure review in Claude.
    Last updated
    1
    MIT
  • Why this server?

    Analyzes web content and adds it to your knowledge base, storing content as Markdown files, which fits the use case of extracting information and formatting it.

    A
    license
    A
    quality
    F
    maintenance
    A tool for Model Context Protocol (MCP) that allows you to analyze web content and add it to your knowledge base, storing content as Markdown files for easy viewing with tools like Obsidian.
    Last updated
    3
    10
    MIT
  • Why this server?

    This server is capable of collecting code from files and directories into a single markdown document, a useful function when processing information.

    A
    license
    B
    quality
    -
    maintenance
    MCP server for collecting code from files and directories into a single markdown document.
    Last updated
    2
    8
    MIT
  • Why this server?

    A document conversion server that transforms various file formats to Markdown format.

    A
    license
    A
    quality
    F
    maintenance
    A document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.
    Last updated
    10
    2
    15
    MIT
  • Why this server?

    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities which could be used in conjunction with OCR to then convert and format.

    A
    license
    B
    quality
    D
    maintenance
    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.
    Last updated
    16
    47
    17
    MIT
  • Why this server?

    A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing. This also supports OCR use cases.

    A
    license
    C
    quality
    C
    maintenance
    A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing.
    Last updated
    8
    17
    MIT