VOOZH about

URL: https://apify.com/joeslade/docs-mcp-server-starter

⇱ Docs MCP Server β€” Live Docs Search for Claude & Cursor Β· Apify


πŸ‘ Docs MCP Server Starter β€” Live Docs: Claude, Cursor & AI Agents avatar

Docs MCP Server Starter β€” Live Docs: Claude, Cursor & AI Agents

Pricing

Pay per usage

Go to Apify Store

Docs MCP Server Starter β€” Live Docs: Claude, Cursor & AI Agents

Persistent MCP server that gives Claude, Cursor, and any MCP-compatible AI assistant queryable access to technical documentation. Indexes any docs site, exposes search and fetch tools over MCP, caches pages for speed. Ships with templates for Next.js, Tailwind, React, TypeScript, Prisma.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

πŸ‘ Joe Slade

Joe Slade

Maintained by Community

Actor stats

1

Bookmarked

1

Total users

1

Monthly active users

7 days ago

Last modified

Share

Docs MCP Server Starter

Give Claude, Cursor, and any MCP-compatible AI assistant queryable access to up-to-date technical documentation. This Apify Actor runs a persistent MCP server that indexes docs sites, exposes search and fetch tools over the Model Context Protocol, and caches pages for fast follow-up queries.

Ships with five ready-to-use templates (Next.js, Tailwind CSS, React, TypeScript, Prisma). Fork it for any docs site by supplying a URL and a CSS content selector β€” no code changes required.

πŸ‘ Docs MCP Server Starter demo β€” get_toc, search_docs, and get_page running against a local Standby server

Who is this for?

  • Developers using AI coding assistants who want answers grounded in the current docs, not training-data snapshots that may be months out of date
  • Teams with internal documentation who want their AI tooling to answer questions against their own docs, not just public sources
  • MCP builders who want a working Standby-mode reference implementation to fork

What it does

  • Crawls and indexes up to 10 documentation sources on startup
  • Exposes four MCP tools over JSON-RPC on a persistent HTTP endpoint
  • Caches fetched pages in an LRU cache (default 50) so repeat queries return instantly
  • Returns content as clean markdown by default, or raw text on request

Use cases

  • Keep your AI assistant current. Point it at the Next.js, React, or TypeScript docs so Claude or Cursor answers from today's API surface instead of a months-old training snapshot.
  • Query your team's internal docs. Index a private or internal documentation site (any static HTML) and let your AI tooling answer against your own sources, not just public ones.
  • A RAG-free docs layer. Give an AI agent searchable, fetchable docs without standing up a vector store, embedding pipeline, or re-indexing job β€” keyword search over live pages.
  • Fork it as an MCP reference implementation. Shipping your own Standby-mode MCP server? Start here β€” the indexing, caching, and JSON-RPC wiring is already done.

Input

FieldTypeDefaultNotes
sourcesarray (1–10)requiredEach source needs a name plus either a template ID or a custom url + contentSelector. Optional sitemapUrl.
maxPagesPerSourceinteger (1–500)200Cap on how many pages per source get indexed at startup. Lower this to speed up boot for large docs sites.
cacheMaxPagesinteger (1–200)50LRU page cache shared across all sources. Raise this if you query the same pages repeatedly.
markdownOutputbooleantrueConvert extracted page HTML to markdown. Set false to return raw text.

Curated templates

nextjs, tailwind, react, typescript, prisma

Custom source example

{
"sources":[
{"name":"Apify SDK","url":"https://docs.apify.com/sdk/js","contentSelector":"main article"}
]
}

MCP tools

ToolPurposeRequired args
list_sourcesList configured sources with page countsβ€”
get_tocReturn the page index (table of contents) for a sourcesource
search_docsCase-insensitive keyword search across titles and cached contentquery (optional: source, maxResults ≀ 30)
get_pageFetch full page content; checks LRU cache firsturl

Example tool calls

{"jsonrpc":"2.0","id":1,"method":"tools/call",
"params":{"name":"list_sources","arguments":{}}}
{"jsonrpc":"2.0","id":2,"method":"tools/call",
"params":{"name":"search_docs",
"arguments":{"query":"server actions","source":"Next.js Docs","maxResults":5}}}
{"jsonrpc":"2.0","id":3,"method":"tools/call",
"params":{"name":"get_page",
"arguments":{"url":"https://nextjs.org/docs/app/building-your-application/data-fetching/server-actions"}}}

Example tool output

Each tool returns JSON inside the standard MCP result.content[].text envelope. The inner payloads look like this:

search_docs β†’

{
"query":"server actions",
"source":"Next.js Docs",
"results":[
{
"title":"Server Actions and Mutations",
"url":"https://nextjs.org/docs/app/building-your-application/data-fetching/server-actions",
"source":"Next.js Docs",
"snippet":"Server Actions are asynchronous functions executed on the server…",
"matchType":"content"
}
]
}

get_page β†’ page content as markdown (or raw text when markdownOutput is false):

{
"url":"https://nextjs.org/docs/app/building-your-application/data-fetching/server-actions",
"title":"Server Actions and Mutations",
"source":"Next.js Docs",
"content":"# Server Actions and Mutations\n\nServer Actions are asynchronous functions…",
"cachedAt":"2026-05-30T12:00:00.000Z"
}

list_sources β†’ { "sources": [{ "name": "Next.js Docs", "url": "https://nextjs.org/docs", "pageCount": 200 }] }

How it works

  1. Boot: Actor.init(), validate input, resolve any curated templates
  2. Index: for each source, fetch the sitemap (or discover from the entry URL), cap at maxPagesPerSource, and build a page index (url + title)
  3. Serve: start an HTTP server on ACTOR_STANDBY_PORT (default 4321); accept POST requests carrying JSON-RPC MCP messages
  4. Cache: get_page returns cached content when available; on miss it fetches, extracts via the source's contentSelector, optionally converts to markdown, and stores in the LRU
  5. Search: search_docs scans page titles in all source indexes and content in the LRU cache only β€” page bodies are not pre-indexed; warm the cache by calling get_page on the pages you want searchable

Running on Apify

This actor runs in Standby mode β€” a persistent HTTP server, not a batch job. Connect an MCP client to the Standby URL exposed by Apify after deploy. The server starts after indexing completes; check the run logs for MCP server listening. before sending requests.

Running on Apify also means the platform's advantages come for free: scheduling to re-index on a cadence, monitoring and alerts on the server's health, API access to every run, and integrations with the rest of your stack β€” none of which you'd get from a docs server you host yourself.

Pricing

This Actor runs in Standby mode, so it bills for the Apify compute time the server is active β€” there's no per-result charge. Because Standby scales to zero when idle, you pay only while it's actually serving MCP requests (plus a short keep-warm window), not around the clock.

  • Indexing happens once at boot, then queries are answered from the in-memory page index and the LRU cache β€” so steady-state cost stays low even under frequent use.
  • Lower maxPagesPerSource for faster, cheaper boots on large docs sites; raise cacheMaxPages if you query the same pages repeatedly.
  • Try it on the Apify free tier to see real compute usage for your sources before committing to a plan.

See the Pricing section on the Actor's detail page for current rates.

Connect your AI assistant

This actor speaks MCP over JSON-RPC at its Standby URL. Grab the exact URL and your access token from the actor's Standby tab in the Apify Console after the first run.

Claude Desktop / Cursor (stdio clients)

Most desktop MCP clients speak stdio, so bridge to the remote HTTP endpoint with mcp-remote. Add this to your MCP config β€” claude_desktop_config.json for Claude Desktop, or your Cursor MCP settings:

{
"mcpServers":{
"docs":{
"command":"npx",
"args":[
"-y","mcp-remote",
"https://<your-standby-url>.apify.actor?token=<APIFY_TOKEN>"
]
}
}
}

Replace <your-standby-url> and <APIFY_TOKEN> with the values from the Standby tab. (The token can also be sent as an Authorization: Bearer header if your client supports custom headers.)

HTTP / streamable MCP clients

Clients that speak MCP over HTTP can point straight at the Standby URL and supply the Apify token per their auth settings. Once connected, the four tools β€” list_sources, get_toc, search_docs, get_page β€” appear automatically.

Design choices (v1)

  • Keyword search, not vectors. No embedding costs, no vector store to maintain, predictable behavior. Good fit for docs lookups where terminology is precise.
  • Static HTTP fetching only. Works with any docs site served by a static generator (Hugo, Next.js static export, Docusaurus, WordPress, MkDocs, etc.). Sites that require JS execution or sit behind bot challenges (Cloudflare, login walls) won't index β€” pick the underlying static source instead.
  • Search reads cached bodies; uncached pages match on title. Warm the cache by calling get_page on the pages you want full-text searchable.
  • Caps. Max 10 sources, 500 pages per source, 200 pages in cache.

FAQ

How do I connect this to Claude Desktop or Cursor? See Connect your AI assistant. Desktop clients bridge to the Standby URL with mcp-remote; HTTP-capable clients point at the URL directly.

Does it work with private or internal docs? Yes β€” any static-HTML docs site reachable over HTTP. Supply a url + contentSelector instead of a curated template. Sites behind login walls or bot challenges (Cloudflare, auth gates) won't index; point it at the underlying static source.

How is this different from a vector RAG pipeline? No embeddings, no vector store. It runs keyword search over page titles and cached page bodies β€” so there are no embedding costs and the results are predictable and debuggable. That's a good fit when docs terminology is already precise. For fuzzy semantic recall across a huge corpus, a vector approach may suit you better.

Which docs sites are supported out of the box? Five curated templates: Next.js, Tailwind CSS, React, TypeScript, and Prisma. Any other static docs site works via a custom url + contentSelector.

Why does search miss some pages? search_docs matches titles across every indexed page, but full-text matching only covers pages already in the cache. Warm the cache by calling get_page on the pages you want fully searchable (see How it works).

Does it support JavaScript-rendered docs? No β€” v1 uses static HTTP fetching only. JS-rendered or bot-challenged sites won't index; use the underlying static source instead.

Local development

npminstall
npmtest# pipeline, cache, indexer, extractor, mcp, searcher, sitemap suites
apify run # local Standby β€” server listens on ACTOR_STANDBY_PORT or 4321

Other Actors in this collection

Part of a small suite of focused, composable dev-tool Actors β€” same philosophy: do one thing, stay testable, wire into a pipeline.

  • GitHub Repo Intelligence MCP β€” an MCP server that gives your AI agent an opinionated verdict on whether a GitHub repo is actively maintained or abandoned.
  • Changelog Triage Agent β€” monitors product changelogs and classifies every entry as BREAKING, WARNING, or INFO so you catch deprecations early.
  • SERP Topic Gap Monitor β€” finds the topics your competitors rank for that your site is missing.

License

Apache-2.0

You might also like

Airbnb MCP Server β€” Claude, Cursor & AI Agents

makework36/airbnb-mcp-server

Plug-and-play Airbnb MCP server for Claude Desktop, Cursor and any MCP-compatible AI agent. Search listings, compare prices across cities, pull ratings, photos and coordinates β€” no login, no API key.

πŸ‘ User avatar

deusex machine

18

MCP tools – Turn Any Website into an AI Tool in 60 Seconds

clever_fashion/mcp-website-tool

Automatically extract buttons, inputs & forms from any site and get ready-to-use MCP (Model Context Protocol) tools for Cursor, Claude, Claude Desktop, Windsurf, Cline, and any MCP-compatible AI.

πŸ‘ User avatar

Data Farming Team

8

Competitive Intelligence MCP Server

alizarin_refrigerator-owner/competitive-intelligence-mcp-server

9-in-1 MCP server for AI-powered competitor analysis. Use with Claude, GPT or any AI assistant that supports MCP tools. This MCP server bundles 9 competitive intelligence tools into one endpoint. Monitor competitors Analyze Facebook Ads Detect tech stack Aggregate B2B review Research companies

Lead Generation MCP Server β€” AI Agent Tools

samstorm/lead-gen-mcp-server

MCP server for AI agents. Connect Claude, GPT, or any MCP client to find business leads, hiring companies, building permits, and Amazon product research. One endpoint, 13 lead gen tools.

1

Related articles

Best MCP servers for developers
Read more
Build and deploy MCP servers in minutes with a TypeScript template
Read more
How to add Apify MCP Server to Cursor IDE
Read more