VOOZH about

URL: https://glama.ai/mcp/servers/search/how-to-scrape-websites-and-retrieve-information

⇱ How to scrape websites and retrieve information | Glama


Search for:

How to scrape websites and retrieve information

View all MCP Servers

  • Why this server?

    Enables fetching web content using the Node.js undici library, supporting various HTTP methods and content formats.

    A
    license
    B
    quality
    D
    maintenance
    An MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.
    Last updated
    3
    1,184
    11
    MIT
  • Why this server?

    Provides unified access to multiple search engines, AI tools, and content processing services.

    A
    license
    B
    quality
    B
    maintenance
    🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi), AI tools (Perplexity, FastGPT), and content processing services (Jina AI, Kagi). Combines search, AI responses, content processing, and enhancement features through a single interface.
    Last updated
    3
    600
    313
  • Why this server?

    Enables LLMs to search, retrieve, and manage documents through Rememberizer's knowledge management API.

    A
    license
    -
    quality
    D
    maintenance
    A Model Context Protocol server enabling LLMs to search, retrieve, and manage documents through Rememberizer's knowledge management API.
    Last updated
    35
    Apache 2.0
  • Why this server?

    Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.

    A
    license
    A
    quality
    D
    maintenance
    Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
    Last updated
    4
    4
    5,461
    786
    MIT
  • Why this server?

    Provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web.

    A
    license
    A
    quality
    C
    maintenance
    A server that provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web
    Last updated
    1
    184
    174
    MIT
  • Why this server?

    Enables retrieval and processing of web page content for LLMs by converting HTML to markdown.

    A
    license
    B
    quality
    D
    maintenance
    Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.
    Last updated
    1
    3
    MIT
  • Why this server?

    Enables AI assistants to interact with the Unstructured API, providing tools to list, create, update, and manage sources, destinations, and workflows.

  • Why this server?

    Enables semantic search, image search, and cross-modal search functionalities through integration with Jina AI's neural search capabilities.

    A
    license
    -
    quality
    D
    maintenance
    Enables semantic search, image search, and cross-modal search functionalities through integration with Jina AI's neural search capabilities.
    Last updated
    5
    MIT
  • Why this server?

    Implementation of an MCP server for the RAG Web Browser Actor. This Actor serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT.