Sponsored by Deepsite.site

Go Docs Mcp

Created By
drolosofta month ago
Multi-format document MCP server — read, search, OCR, and extract from PDF, TXT, MD, DOCX, CSV, and images. Single Go binary, 12 tools, smart caching.
Overview

go-docs-mcp

go-docs-mcp

Go License: MIT MCP

Install and Go. One command, single binary. Your AI reads any document — PDF, text, Markdown, DOCX, images.

Go MCP server for multi-format document access — read, search, extract images, OCR, and fetch documents from URLs via the Model Context Protocol.

go install github.com/drolosoft/go-docs-mcp@latest
# That's it. Single binary, starts in milliseconds.

For a deeper look at why an MCP server beats a direct tool, see Why MCP?


🏆 Why go-docs-mcp?

Capabilitygo-docs-mcpNode/TSPythonRust
Single binary, no runtime❌ needs Node❌ needs Python
📦go install one-liner❌ npm+deps❌ pip+venv❌ cargo
📄Multi-format (6 types)❌ one format❌ one format❌ one format
🔍Full-text search⚠️ partial
👁️OCR (scanned PDFs)⚠️ partial
🖼️Image extraction⚠️ partial
📊Table extraction⚠️ partial
📑Document outline⚠️ partial
🌐Fetch from URL⚠️ partial
🔒Dir-locked, read-only⚠️ varies⚠️ varies
💾Smart caching
🏠Fully offline

Every other document MCP server handles one format — a PDF server for PDFs, a DOCX server for DOCX. You'd need three tools to read three formats. go-docs-mcp reads them all from a single binary.


📋 Features — 12 Tools in 5 Categories

CategoryToolDescription
Discoverylist_documentsList all available documents with metadata (filename, format, page count, size)
Discoverylist_formatsList supported document formats and their dependencies
Readingread_documentRead full text, a specific page, or page ranges from any supported document
Readingread_urlDownload a document from a URL and extract its text content
Readingget_document_summaryGet the first 3 pages of text as a quick overview
Searchsearch_documentCase-insensitive full-text search with context and page hints
Analysisget_document_metadataGet full document metadata (title, author, dates, version, etc.)
Analysisget_document_outlineExtract document outline / table of contents
Analysisextract_tablesExtract tables from documents as structured data
Analysisextract_imagesExtract images from a document as base64-encoded data (max 10 per call)
OCRocr_documentForce OCR on a PDF — for scanned/image-based documents or garbled text
OCRread_imageExtract text from an image file (PNG, JPG, TIFF) via OCR
  • Fast — mtime-based in-memory caching avoids redundant extraction
  • Multi-format — PDF, TXT, MD, CSV, DOCX, and images from a single server
  • OCR — automatic fallback to tesseract for image-based/scanned documents
  • Secure — directory-locked access with path traversal prevention
  • Simple — single binary, stdio transport, zero configuration required
  • Portable — works on macOS and Linux

📄 Supported Formats

FormatDependenciesNotes
PDFpoppler (pdftotext, pdfinfo, pdfimages, pdftoppm)Full support — text, images, metadata, OCR fallback
TXT, MD, CSVNoneNative, zero dependencies
DOCXpandoc or Go lib (optional)Word document extraction
Images (PNG, JPG, TIFF)tesseract (optional)OCR text extraction from image files

📦 Prerequisites

  • Go 1.25+ (install)
  • poppler (provides pdftotext, pdfinfo, pdfimages, pdftoppm) — required for PDF support
  • tesseract (optional, for OCR support — scanned PDFs and images)
  • pandoc (optional, for DOCX support)
# macOS
brew install poppler
brew install tesseract        # optional: enables OCR for scanned docs + images
brew install pandoc           # optional: enables DOCX support

# Debian/Ubuntu
apt install poppler-utils
apt install tesseract-ocr     # optional: enables OCR for scanned docs + images
apt install pandoc            # optional: enables DOCX support

# Fedora/RHEL
dnf install poppler-utils
dnf install tesseract         # optional: enables OCR for scanned docs + images
dnf install pandoc            # optional: enables DOCX support

Format note: TXT, MD, and CSV work out of the box with zero dependencies. PDF requires poppler. DOCX requires pandoc. Images require tesseract. Install only what you need.


🚀 Installation

From source

go install github.com/drolosoft/go-docs-mcp@latest

Build locally

git clone https://github.com/drolosoft/go-docs-mcp.git
cd go-docs-mcp
make build      # produces ./go-docs-mcp
make install    # installs to /usr/local/bin/

⚙️ Configuration

The server reads documents from a configured directory. Set DOCS_MCP_DIR to change it:

VariableDefaultDescription
DOCS_MCP_DIR~/.docs-mcp/documents/Directory containing document files to serve
PDF_MCP_DIR(backward compat alias)Legacy alias — works the same as DOCS_MCP_DIR

Place your documents in the directory and the server will find them automatically. All supported formats (PDF, TXT, MD, CSV, DOCX, images) are detected.


💡 Usage

With Claude Code

Add to your .claude/settings.json:

{
  "mcpServers": {
    "docs": {
      "command": "go-docs-mcp",
      "env": {
        "DOCS_MCP_DIR": "/path/to/your/documents"
      }
    }
  }
}

With Claude Desktop

Add to your Claude Desktop configuration (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):

{
  "mcpServers": {
    "docs": {
      "command": "/usr/local/bin/go-docs-mcp",
      "env": {
        "DOCS_MCP_DIR": "/path/to/your/documents"
      }
    }
  }
}

With any MCP client

The server communicates over stdio using JSON-RPC 2.0. Launch the binary and pipe JSON-RPC messages to stdin:

echo '{"jsonrpc":"2.0","id":1,"method":"tools/list","params":{}}' | go-docs-mcp

📖 Tool Reference

list_documents

Lists all documents in the configured directory with format detection.

Parameters: None

Example output:

[
  {
    "filename": "architecture-guide.pdf",
    "format": "pdf",
    "title": "architecture-guide",
    "pages": 42,
    "size_bytes": 1048576
  },
  {
    "filename": "notes.md",
    "format": "markdown",
    "title": "notes",
    "size_bytes": 4096
  }
]

list_formats

Lists all supported document formats and their dependency status.

Parameters: None


read_document

Reads the extracted text content of a document. For PDFs, automatically falls back to OCR if the document is image-based/scanned and pdftotext returns empty text.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe document filename to read
pagenumberNoSingle page number (1-based). Omit for full text.
pagesstringNoPage ranges, e.g. "1-5", "10", "1-3,7,10-12". Overrides page.

Example input:

{
  "filename": "architecture-guide.pdf",
  "pages": "1-3,10-12"
}

search_document

Searches within a document for lines matching a query. Returns matches with 2 lines of context and approximate page numbers.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe document filename to search
querystringYesSearch query (case-insensitive)

Example output:

Found 3 matches for 'microservice' in architecture-guide.pdf:

--- Match 1 (page ~2, line 45) ---
  The system is composed of several
> microservice components that communicate
  via gRPC and message queues.

get_document_summary

Returns the text from the first 3 pages of a document as a quick summary.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe document filename to summarize

get_document_metadata

Returns full document metadata.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe document filename to get metadata for

Example output:

{
  "title": "Architecture Guide",
  "author": "Jane Doe",
  "subject": "System Design",
  "creator": "LaTeX",
  "producer": "pdfTeX",
  "creation_date": "Thu May 15 10:30:00 2025",
  "modification_date": "Thu May 15 10:30:00 2025",
  "pages": 42,
  "file_size_bytes": 1048576,
  "pdf_version": "1.5"
}

get_document_outline

Extracts the document outline (table of contents / bookmarks) as a structured list.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe document filename to extract outline from

extract_tables

Extracts tables from a document as structured data.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe document filename to extract tables from
pagenumberNoSpecific page to extract from. Omit for all pages.

extract_images

Extracts images from a document as base64-encoded data. Returns up to 10 images per call.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe document filename to extract images from
pagenumberNoSpecific page to extract from. Omit for all pages.

Example output:

[
  {
    "page": 1,
    "index": 0,
    "format": "jpeg",
    "width": 800,
    "height": 600,
    "data_base64": "/9j/4AAQSkZJRg..."
  }
]

read_url

Downloads a document from a URL and extracts its text content. Maximum file size: 50MB.

Parameters:

NameTypeRequiredDescription
urlstringYesThe URL of the document to download and read
pagesstringNoPage ranges to extract, e.g. "1-5". Omit for full text.

Example input:

{
  "url": "https://example.com/report.pdf",
  "pages": "1-3"
}

ocr_document

Forces OCR (Optical Character Recognition) on a PDF document using tesseract. Useful for scanned/image-based PDFs or when pdftotext returns garbled text. Requires tesseract and pdftoppm to be installed.

Note: read_document already auto-detects image-based PDFs and falls back to OCR. Use ocr_document when you want to force OCR regardless, or need to specify a non-English language.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe PDF filename to OCR
pagenumberNoSpecific page to OCR (1-based). Omit to OCR all pages.
languagestringNoTesseract language code (default: eng). Use spa for Spanish, fra for French, etc.

Example input:

{
  "filename": "scanned-contract.pdf",
  "page": 1,
  "language": "spa"
}

read_image

Extracts text from an image file using OCR. Supports PNG, JPG, and TIFF. Requires tesseract to be installed.

Parameters:

NameTypeRequiredDescription
filenamestringYesThe image filename to read (PNG, JPG, TIFF)
languagestringNoTesseract language code (default: eng).

Example input:

{
  "filename": "receipt.png",
  "language": "eng"
}

🔒 Security

  • Directory-locked: Only files within the configured DOCS_MCP_DIR are accessible
  • Path traversal prevention: Filenames are sanitized to their base component; ../ is rejected
  • Extension filter: Only supported document formats are served; requests for other file types are denied
  • No write operations: The server is strictly read-only
  • URL downloads: Limited to 50MB, Content-Type validated, temp files cleaned up immediately

🛠️ Development

make build     # Build the binary
make test      # Run tests with race detector
make clean     # Remove build artifacts

Project structure

go-docs-mcp/
  main.go              # MCP server setup, 12 tool registrations
  internal/
    pdf/
      reader.go        # Document extraction, caching, search, metadata, images, OCR
  Makefile             # Build targets
  go.mod               # Module definition

📄 License

MIT - Copyright 2026 Drolosoft


💛 Support

Buy Me A Coffee


DrolosoftTools we wish existed

Server Config

{
  "mcpServers": {
    "docs": {
      "command": "go-docs-mcp",
      "env": {
        "DOCS_MCP_DIR": "/path/to/your/documents"
      }
    }
  }
}
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
Amap Maps高德地图官方 MCP Server
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
DeepChatYour AI Partner on Desktop
RedisA Model Context Protocol server that provides access to Redis databases. This server enables LLMs to interact with Redis key-value stores through a set of standardized tools.
CursorThe AI Code Editor
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
ChatWiseThe second fastest AI chatbot™
Tavily Mcp
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Y GuiA web-based graphical interface for AI chat interactions with support for multiple AI models and MCP (Model Context Protocol) servers.
Serper MCP ServerA Serper MCP Server
WindsurfThe new purpose-built IDE to harness magic
Playwright McpPlaywright MCP server