VOOZH about

URL: https://glama.ai/mcp/servers/integrations/apache-spark

⇱ Apache Spark | Glama


  • Why this server?

    Provides tools for searching and retrieving Apache Spark documentation, enabling full-text keyword searches with section filtering and access to the full content of documentation pages.

    A
    license
    A
    quality
    A
    maintenance
    Provides full-text search and retrieval tools for Apache Spark documentation using SQLite FTS5 with BM25 ranking. It enables AI assistants to efficiently search, filter by section, and read specific Spark documentation pages.
    Last updated
    2
    MIT
  • Why this server?

    Provides read-only access to Apache Spark data through SQL models, allowing for querying live data via natural language questions without requiring SQL knowledge. Tools include listing available tables, retrieving column information, and executing SQL SELECT queries against Spark.

  • Why this server?

    Offers information about Duyet's expertise with Apache Spark through CV resources and tools, enabling discussions about data engineering projects.

    F
    license
    B
    quality
    B
    maintenance
    An experimental Model Context Protocol server that enables AI assistants to access information about Duyet, including his CV, blog posts, and GitHub activity through natural language queries.
    Last updated
    8
    2
  • Why this server?

    Utilizes Apache Spark for writing Parquet/ORC file formats to MinIO storage as part of data processing pipelines.

    A
    license
    -
    quality
    C
    maintenance
    MCP server with 32 tools for ETL ingestion, AI-generated data quality rules, AI transformations, vector search, and natural-language SQL. Works across Postgres, MongoDB, Kafka, S3/MinIO, HashiCorp Vault, and five vector stores (Qdrant, Weaviate, Milvus, Chroma, pgvector).
    Last updated
    10
  • Why this server?

    Provides searchable documentation for Apache Spark as part of the data engineering knowledge base.

    A
    license
    -
    quality
    D
    maintenance
    Provides AI assistants with searchable access to documentation from 170+ curated repositories and 1000+ popular GitHub projects across 20+ categories including trading, AI/ML, DevOps, and web development.
    Last updated
    3
    MIT
  • Why this server?

    Integrates with Spark History Server to provide detailed job analysis, performance diagnostics, and workload-specific configuration recommendations for Spark applications.

    A
    license
    -
    quality
    D
    maintenance
    Provides intelligent guidance for EMR cluster management, configuration recommendations, and monitoring capabilities
    Last updated
    1
    MIT
  • Why this server?

    Connects to Apache Spark History Server to query and analyze Spark applications, jobs, stages, executors, SQL queries, and more, enabling AI agents to investigate performance, failures, and bottlenecks.

    A
    license
    -
    quality
    B
    maintenance
    Exposes Spark History Server data as tools for AI agents, enabling natural language querying of Spark applications, jobs, stages, and performance metrics.
    Last updated
    177
    Apache 2.0
  • Why this server?

    Provides query optimization and data discovery capabilities for Apache Spark by exposing logical and physical query plans, catalog and table information to AI systems.

    A
    license
    -
    quality
    C
    maintenance
    A server implementation of MCP for Apache Spark that provides query plans and catalog information to AI systems for query optimization and data discovery.
    Last updated
    18
    Apache 2.0
  • Why this server?

    Provides tools to optimize Apache Spark code, including automatic optimization of PySpark code and performance analysis with execution metrics.

    F
    license
    -
    quality
    D
    maintenance
    An MCP server that optimizes Apache Spark code using Claude AI, providing intelligent code optimization suggestions and performance analysis.
    Last updated
    29