Sponsored by Deepsite.site

📚 arXiv MCP Server

Created By
Tejas2427 months ago
A streamlined Model Context Protocol server that connects AI assistants to arXiv's vast collection of academic papers. Search, analyze, and download research papers directly from your AI workflow.
Content

📚 arXiv MCP Server

Python MCP Compatible arXiv API License Code Quality CI/CD

Access the world's largest repository of academic papers through the Model Context Protocol

A streamlined Model Context Protocol server that connects AI assistants to arXiv's vast collection of academic papers. Search, analyze, and download research papers directly from your AI workflow.

🚀 Quick Start

Prerequisites

  • Python 3.12+
  • uv package manager

Installation

# Clone and setup
git clone https://github.com/tejas242/arxiv-mcp.git
cd arxiv-mcp
uv sync

# Test the server
uv run main.py

🛠️ Available Functions

FunctionStatusDescriptionParameters
search_papersWorkingSearch arXiv papers with flexible query syntaxquery, max_results, sort_by, sort_order
get_paper_detailsWorkingRetrieve complete metadata for any arXiv paperarxiv_id
build_advanced_queryWorkingConstruct complex search queries with multiple fieldstitle_keywords, author_name, category, abstract_keywords
get_arxiv_categoriesWorkingList all available arXiv subject categoriesNone
search_by_author⚠️ LimitedFind papers by specific author (use search_papers instead)author_name, max_results
search_by_category⚠️ LimitedBrowse papers by category (use search_papers instead)category, max_results
download_paper_pdf🔧 Needs FixDownload paper PDFs (redirect handling issue)arxiv_id, save_path

Function Details

✅ Fully Working Functions

search_papers - The primary search function

  • Supports full arXiv query syntax
  • Handles keywords, authors, categories, titles
  • Configurable sorting and pagination
  • Returns formatted results with abstracts and links

get_paper_details - Detailed paper information

  • Complete metadata extraction
  • Author information with affiliations
  • Category classifications and links
  • Publication dates and updates

build_advanced_query - Query construction helper

  • Combines multiple search criteria
  • Supports title, author, category, and abstract searches
  • Returns properly formatted query strings

get_arxiv_categories - Category reference

  • Complete list of arXiv subject categories
  • Descriptions for each category
  • Helpful for constructing targeted searches

⚠️ Limited Functions (Workarounds Available)

search_by_author - Use search_papers('au:"Author Name"') instead search_by_category - Use search_papers('cat:category_code') instead

🔧 Functions Needing Fixes

download_paper_pdf - HTTP redirect handling needs improvement

  • Currently fails due to HTTPS/HTTP redirect issues
  • PDFs can be accessed directly via the links provided in search results

⚙️ Configuration

Claude Desktop Setup

Configuration Instructions

Add to your Claude Desktop config file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "arxiv-mcp": {
      "command": "uv",
      "args": [
        "--directory",
        "/absolute/path/to/arxiv-mcp",
        "run",
        "main.py"
      ]
    }
  }
}

VS Code MCP Extension

VS Code Configuration
{
  "mcp": {
    "servers": {
      "arxiv-mcp": {
        "command": "uv",
        "args": ["--directory", "/path/to/arxiv-mcp", "run", "main.py"]
      }
    }
  }
}

💡 Usage Examples

Core Search Operations

# Search for papers about transformers
search_papers("transformer architecture")

# Advanced query with specific fields
search_papers('ti:"attention mechanism" AND cat:cs.LG')

# Author-specific search (recommended approach)
search_papers('au:"Geoffrey Hinton"')

# Category browsing (recommended approach)
search_papers('cat:cs.AI')

Research Workflow

# 1. Find the famous "Attention" paper
search_papers('ti:"Attention Is All You Need"')
get_paper_details("1706.03762")

# 2. Explore related work
search_papers("transformer neural networks")

# 3. Build complex queries
query = build_advanced_query(
    title_keywords="few-shot learning",
    author_name="Tom Brown",
    category="cs.LG"
)
search_papers(query)

📊 arXiv Categories Reference

Popular Categories
CodeDescriptionExample Topics
cs.AIArtificial IntelligenceMachine learning, neural networks, AI theory
cs.LGMachine LearningDeep learning, reinforcement learning, statistical learning
cs.CVComputer VisionImage processing, object detection, visual recognition
cs.CLComputation and LanguageNLP, language models, text processing
cs.CRCryptography and SecuritySecurity protocols, encryption, privacy
stat.MLMachine Learning (Statistics)Statistical learning theory, Bayesian methods
physics.gen-phGeneral PhysicsTheoretical physics, quantum mechanics
math.NANumerical AnalysisComputational mathematics, algorithms
q-bio.NCQuantitative BiologyNeuroscience, computational biology

Use get_arxiv_categories() for the complete list of available categories.

🧪 Testing Results

Based on comprehensive testing of all functions:

Working Functions Limited Functions Needs Fix

✅ Reliable Functions

  • Paper search with keywords, authors, categories: 100% success rate
  • Paper detail retrieval: Complete metadata extraction working
  • Query construction: All syntax combinations supported
  • Category listing: All arXiv categories accessible
  • Author search: Use search_papers('au:"Author Name"') instead of search_by_author()
  • Category browsing: Use search_papers('cat:category') instead of search_by_category()

🔧 Known Issues

  • PDF downloads: Redirect handling needs improvement (PDFs accessible via direct links)

🔧 Development

Project Structure

arxiv-mcp/
├── src/arxiv_mcp/          # Main package
│   ├── server.py           # MCP server implementation
│   ├── arxiv_client.py     # arXiv API wrapper
│   ├── models.py           # Pydantic data models
│   └── utils.py            # Helper functions
├── tests/                  # Test suite
├── main.py                 # Entry point
└── pyproject.toml         # Project config

Running Tests

uv run pytest tests/ -v

Debug Mode

# Enable detailed logging
PYTHONPATH=src uv run python -c "
import logging
logging.basicConfig(level=logging.DEBUG)
from arxiv_mcp.server import main
main()
"

⚠️ Troubleshooting

Common Issues & Solutions

Server Not Detected

  • ✅ Verify absolute paths in MCP config
  • ✅ Test server runs: uv run main.py
  • ✅ Restart Claude Desktop after config changes

Search Issues

  • ✅ Use arXiv query syntax (see examples above)
  • ✅ Check category names: get_arxiv_categories()
  • ✅ Try broader search terms
  • ✅ Use search_papers() instead of specific search functions

PDF Download Failures

  • ✅ Access PDFs via links in search results
  • ✅ Check internet connection
  • ✅ Verify arXiv ID format (e.g., "1706.03762")

🙏 Acknowledgments


GitHub Issues Contribute



Made with ⚡ by screenager

Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
Playwright McpPlaywright MCP server
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
Amap Maps高德地图官方 MCP Server
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
Tavily Mcp
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
ChatWiseThe second fastest AI chatbot™
Serper MCP ServerA Serper MCP Server
CursorThe AI Code Editor
DeepChatYour AI Partner on Desktop
WindsurfThe new purpose-built IDE to harness magic
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.