Sponsored by Deepsite.site

MCP Iceberg Catalog

Created By
ahodroj8 months ago
MCP server for interacting with Apache Iceberg catalog from Claude, enabling data lake discovery and metadata search through a LLM prompt.
Content

MCP Iceberg Catalog

smithery badge

A MCP (Model Context Protocol) server implementation for interacting with Apache Iceberg. This server provides a SQL interface for querying and managing Iceberg tables through Claude desktop.

Claude Desktop as your Iceberg Data Lake Catalog

image

How to Install in Claude Desktop

Installing via Smithery

To install MCP Iceberg Catalog for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @ahodroj/mcp-iceberg-service --client claude
  1. Prerequisites

    • Python 3.10 or higher
    • UV package installer (recommended) or pip
    • Access to an Iceberg REST catalog and S3-compatible storage
  2. How to install in Claude Desktop Add the following configuration to claude_desktop_config.json:

{
  "mcpServers": {
    "iceberg": {
      "command": "uv",
      "args": [
        "--directory",
        "PATH_TO_/mcp-iceberg-service",
        "run",
        "mcp-server-iceberg"
      ],
      "env": {
        "ICEBERG_CATALOG_URI" : "http://localhost:8181",
        "ICEBERG_WAREHOUSE" : "YOUR ICEBERG WAREHOUSE NAME",
        "S3_ENDPOINT" : "OPTIONAL IF USING S3",
        "AWS_ACCESS_KEY_ID" : "YOUR S3 ACCESS KEY",
        "AWS_SECRET_ACCESS_KEY" : "YOUR S3 SECRET KEY"
      }
    }
  }
}

Design

Architecture

The MCP server is built on three main components:

  1. MCP Protocol Handler

    • Implements the Model Context Protocol for communication with Claude
    • Handles request/response cycles through stdio
    • Manages server lifecycle and initialization
  2. Query Processor

    • Parses SQL queries using sqlparse
    • Supports operations:
      • LIST TABLES
      • DESCRIBE TABLE
      • SELECT
      • INSERT
  3. Iceberg Integration

    • Uses pyiceberg for table operations
    • Integrates with PyArrow for efficient data handling
    • Manages catalog connections and table operations

PyIceberg Integration

The server utilizes PyIceberg in several ways:

  1. Catalog Management

    • Connects to REST catalogs
    • Manages table metadata
    • Handles namespace operations
  2. Data Operations

    • Converts between PyIceberg and PyArrow types
    • Handles data insertion through PyArrow tables
    • Manages table schemas and field types
  3. Query Execution

    • Translates SQL to PyIceberg operations
    • Handles data scanning and filtering
    • Manages result set conversion

Further Implementation Needed

  1. Query Operations

    • Implement UPDATE operations
    • Add DELETE support
    • Support for CREATE TABLE with schema definition
    • Add ALTER TABLE operations
    • Implement table partitioning support
  2. Data Types

    • Support for complex types (arrays, maps, structs)
    • Add timestamp with timezone handling
    • Support for decimal types
    • Add nested field support
  3. Performance Improvements

    • Implement batch inserts
    • Add query optimization
    • Support for parallel scans
    • Add caching layer for frequently accessed data
  4. Security Features

    • Add authentication mechanisms
    • Implement role-based access control
    • Add row-level security
    • Support for encrypted connections
  5. Monitoring and Management

    • Add metrics collection
    • Implement query logging
    • Add performance monitoring
    • Support for table maintenance operations
  6. Error Handling

    • Improve error messages
    • Add retry mechanisms for transient failures
    • Implement transaction support
    • Add data validation
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
WindsurfThe new purpose-built IDE to harness magic
ChatWiseThe second fastest AI chatbot™
Tavily Mcp
DeepChatYour AI Partner on Desktop
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Amap Maps高德地图官方 MCP Server
Playwright McpPlaywright MCP server
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
CursorThe AI Code Editor
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
Serper MCP ServerA Serper MCP Server
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code