VOOZH about

URL: https://glama.ai/mcp/servers/search/methods-to-scrape-a-website-and-fetch-markdown-files-from-github

⇱ Methods to Scrape a Website and Fetch Markdown Files from GitHub | Glama


Search for:

Methods to Scrape a Website and Fetch Markdown Files from GitHub

View all MCP Servers

  • Why this server?

    This server can retrieve web page content and convert HTML to markdown, which aligns with the user's need to '抓取整个文档网站'.

  • Why this server?

    This server allows access to GitHub repositories, which can be used to retrieve markdown files as requested by the user.

    A
    license
    -
    quality
    D
    maintenance
    A server that allows AI assistants to browse and read files from specified GitHub repositories, providing access to repository contents via the Model Context Protocol.
    Last updated
    6
    MIT
  • Why this server?

    This provides GitHub data analysis, useful if the user wants to grab markdown from a github repo, it may give some useful context for analysis

    A
    license
    A
    quality
    D
    maintenance
    Provides GitHub data analysis for repositories, developers, and organizations, enabling insights into open source ecosystems through API calls and natural language queries.
    Last updated
    5
    14
    MIT
  • Why this server?

    This is a scraper tool which would allow for web scraping of documentation sites. It allows flexible options for parsing and rendering.

    A
    license
    A
    quality
    C
    maintenance
    A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites.
    Last updated
    4
    95
    MIT
  • Why this server?

    This provides RAG capabilities which are needed to retrieve documents, and provides the semantic document search that is helpful.

    A
    license
    -
    quality
    -
    maintenance
    Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support.
    Last updated
    10
    16
  • Why this server?

    This will be useful to fetch web content. The MCP provides content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.

    F
    license
    B
    quality
    D
    maintenance
    A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.
    Last updated
    5
    5
  • Why this server?

    For more complete and in depth Git Repository operations, this server provides read, search, and manipulation Git repositories.

  • Why this server?

    This will be able to access repository information and manage workflows.

    A
    license
    C
    quality
    F
    maintenance
    An MCP server that enables integration with GitHub Enterprise API, allowing users to access repository information, manage issues, pull requests, workflows, and other GitHub features through Cursor.
    Last updated
    28
    11
    28
    ISC
  • Why this server?

    If the documentation site is on Google drive, this server enables listing, reading, and searching over files

    A
    license
    -
    quality
    F
    maintenance
    Enables integration with Google Drive for listing, reading, and searching over files, supporting various file types with automatic export for Google Workspace files.
    Last updated
    7,666
    69
    MIT