VOOZH about

URL: https://apify.com/bronze_quarterback/rag-pipeline-manager-mcp

โ‡ฑ Rag Pipeline Manager Mcp ยท Apify


Pricing

$49.00/month + usage

Go to Apify Store

Pricing

$49.00/month + usage

Rating

0.0

(0)

Developer

๐Ÿ‘ Segun Zubair

Segun Zubair

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

4 months ago

Last modified

Share

RAG Pipeline Manager MCP Server

Manage your RAG (Retrieval-Augmented Generation) pipelines through Claude or any MCP-compatible client. Ingest documents, run semantic searches, and manage Qdrant vector database collections โ€” all through natural language.

Quick Start

Claude Desktop

Add to your Claude Desktop MCP config (claude_desktop_config.json):

{
"mcpServers":{
"rag-pipeline-manager":{
"url":"https://your-apify-actor-url/runs/standby"
}
}
}

Local Development

git clone <repo>
cd rag-pipeline-manager
pip install-r requirements.txt
EMBEDDING_API_KEY=your-key python src/server.py

Configuration

ParameterDescriptionRequiredDefault
QDRANT_URLQdrant server URLNohttp://localhost:6333
QDRANT_API_KEYQdrant API keyNoโ€”
EMBEDDING_BASE_URLOpenAI-compatible embedding API base URLNohttps://api.openai.com/v1
EMBEDDING_API_KEYAPI key for embedding serviceYesโ€”
EMBEDDING_MODELEmbedding model nameNotext-embedding-3-small
EMBEDDING_DIMENSIONVector dimension sizeNo1536

Available Tools

ingest_document

Upload and index a PDF or TXT document into a Qdrant collection.

  • Parameters: file_path (str, required), collection_name (str, required), chunk_size (int, default 512), chunk_overlap (int, default 50)
  • Returns: Summary with document_id, chunk count, and collection name

search_collection

Semantic search across a collection with optional metadata filters.

  • Parameters: query (str, required), collection_name (str, required), top_k (int, default 5), filters (dict, optional)
  • Returns: Ranked results with text, score, and metadata

list_collections

List all Qdrant collections with document counts.

  • Returns: Array of collection names and point counts

delete_document

Remove a document and all its chunks from a collection.

  • Parameters: document_id (str, required), collection_name (str, required)
  • Returns: Confirmation with deleted point count

get_collection_stats

Get detailed statistics for a collection.

  • Parameters: collection_name (str, required)
  • Returns: Point count, vector dimension, and segment info

reindex_collection

Re-embed all documents in a collection with a new or updated model.

  • Parameters: collection_name (str, required), model (str, optional)
  • Returns: Summary with reindexed point count

Example Prompts

  1. "Ingest the quarterly report PDF into the 'financial-docs' collection"
  2. "Search the knowledge-base collection for information about return policies"
  3. "How many documents are in each of my collections?"
  4. "Delete document abc-123 from the contracts collection"
  5. "Reindex the support-tickets collection using text-embedding-3-large"

Pricing

PlanPriceIncludes
Monthly$49/moUnlimited tool calls, all 6 tools, priority support

Local Development

# Start Qdrant locally
docker run -p6333:6333 qdrant/qdrant
# Install dependencies
pip install-r requirements.txt
# Run the server
exportEMBEDDING_API_KEY=your-key
python src/server.py

Troubleshooting

  • Connection refused to Qdrant: Ensure QDRANT_URL is correct and Qdrant is running
  • Embedding API errors: Verify EMBEDDING_API_KEY is valid and EMBEDDING_BASE_URL points to an OpenAI-compatible endpoint
  • Large file timeouts: For PDFs over 100 pages, consider splitting into smaller files before ingestion
  • Collection not found: Use list_collections to verify collection names โ€” they are case-sensitive

You might also like

RAG Pipeline

labrat011/rag-pipeline

One-click RAG pipeline: chunks text, generates embeddings, and stores vectors in Pinecone or Qdrant. Provide your content and API keys -- the orchestrator handles the rest.

Zillow Property Manager Scraper

crawlerbros/zillow-property-manager-scraper

Scrape Zillow property manager profiles: business name, phone, email, ratings, reviews, rental listings, specialties, and service areas. Search by city/state or provide direct profile URLs

RAG Pipeline Data Collector

scraper_guru/rag-pipeline-data-collector

AI-ready web content extraction for RAG systems, LLMs, and AI agents. Single-page or multi-page scraping with parallel processing.

๐Ÿ‘ User avatar

LIAICHI MUSTAPHA

5

GPT Crawler MCP โ€” Knowledge files for ChatGPT, Claude, RAG

kazkn/gpt-crawler-mcp

Crawl any website and turn it into a clean knowledge file for your custom GPT, Claude Project, or RAG pipeline. Native MCP server in Standby mode + classic batch mode.

Pharma Pipeline Intelligence MCP Server

ryanclinton/pharma-pipeline-intelligence-mcp

Drug pipeline competitive intelligence and threat scoring for pharmaceutical companies, biotech investors, and medical affairs teams.