VOOZH about

URL: https://glama.ai/mcp/servers/search/a-tool-for-extracting-data-from-websites

⇱ A tool for extracting data from websites | Glama


Search for:

A tool for extracting data from websites

View all MCP Servers

  • Why this server?

    This server is a dedicated scraper tool leveraging the Oxylabs Web Scraper API, designed for fetching and processing web content, making it a direct match for 'web scraper'.

    A
    license
    A
    quality
    C
    maintenance
    A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites.
    Last updated
    4
    95
    MIT
  • Why this server?

    This server explicitly enables 'web scraping and crawling capabilities' for various needs, supporting different engines and output formats, directly addressing the user's request.

    A
    license
    -
    quality
    C
    maintenance
    Enables web scraping and crawling capabilities for LLM clients, supporting single-page scraping, multi-page website crawling, and web search with multiple engines (Playwright, Cheerio, Puppeteer) and flexible output formats including markdown, HTML, text, and screenshots.
    Last updated
    11
    6
    MIT
  • Why this server?

    This server focuses on 'web scraping and data extraction from websites' with advanced features like geographic flexibility and anti-detection, perfectly fitting the user's search.

    A
    license
    B
    quality
    B
    maintenance
    Enables web scraping and data extraction from websites with geographic flexibility, privacy features, and anti-detection capabilities. Supports scraping general websites, Google Search, Amazon Search, and Reddit with customizable parameters for rendering, geolocation, and locale.
    Last updated
    30
    78
    30
    ISC
  • Why this server?

    This server enables 'intelligent web scraping through a browser automation tool,' which directly provides the functionality the user is looking for.

    F
    license
    -
    quality
    D
    maintenance
    Enables intelligent web scraping through a browser automation tool that can search Google, navigate to webpages, and extract content from various websites including GitHub, Stack Overflow, and documentation sites.
    Last updated
    1
  • Why this server?

    As its name suggests, this is a 'TypeScript-based web scraping server' offering multiple export formats and support for dynamic websites, making it a very strong match.

    A
    license
    A
    quality
    C
    maintenance
    A TypeScript-based web scraping server built on the Model Context Protocol that offers multiple export formats, content extraction rules, and support for both static and dynamic (SPA) websites.
    Last updated
    7
    12
    1
    MIT
  • Why this server?

    This server is described as a 'comprehensive web scraping server' that transforms web content, indicating it's precisely what the user is looking for.

    A
    license
    B
    quality
    D
    maintenance
    A comprehensive web scraping server that transforms web content into clean, agent-ready Markdown with automatic citations and efficient caching. It features a robust suite of tools for metadata extraction, sentiment analysis, SEO auditing, and security scanning while strictly adhering to robots.txt policies.
    Last updated
    48
    16
    18
    MIT
  • Why this server?

    This server is a 'lightweight web scraping server' designed to extract various types of data from websites, aligning directly with the user's need for a web scraper.

    F
    license
    -
    quality
    D
    maintenance
    A lightweight web scraping server that allows Claude Desktop users to extract various types of data from websites, including text, links, images, tables, headlines, and metadata using CSS selectors.
    Last updated
    4
  • Why this server?

    This server 'provides advanced web scraping' capabilities, including smart content extraction and browser automation, making it a highly relevant option.

    A
    license
    -
    quality
    D
    maintenance
    Provides advanced web scraping with HTTP client, smart content extraction to Markdown, browser automation via Playwright, screenshot/PDF generation, and Docker sandbox execution environments.
    Last updated
    1
    MIT
  • Why this server?

    Described as a 'headless web scraping server that extracts main content from web pages,' this server is a direct match for the user's request for a web scraper.

    A
    license
    -
    quality
    A
    maintenance
    A headless web scraping server that extracts main content from web pages into Markdown, text, or HTML for AI and automation integration. It features per-domain rate limiting and robust error handling using Playwright and BeautifulSoup.
    Last updated
    MIT