Sponsored by Deepsite.site

MCP YOLOE: Zero-Shot Object Detection & Segmentation

Created By
rjn32s25 days ago
Provide your AI agents with "eyes." This server enables open-vocabulary object detection and instance segmentation using naturally phrased text prompts (e.g., "detect the laptop next to the coffee").
Overview

MCP-YOLO

MCP-YOLO is a powerful Model Context Protocol server that grants AI agents advanced computer vision capabilities. Unlike traditional YOLO models that only detect a fixed list of objects, this server uses Zero-Shot Learning to detect and segment anything you describe.

Key Features

  • Zero-Shot Detection: Detect arbitrary objects using natural language prompts.
  • Precision Segmentation: Get exact polygon masks for every detected object.
  • Flexible Inputs: Works with local file paths, remote image URLs, and Base64 strings.
  • Agent-First: Designed specifically for integration with Claude, IDEs, and autonomous workspace agents.

Example Usage

Ask your agent to:

"Find the 'vintage typewriter' in this image and give me its exact coordinates."

Performance

Uses the state-of-the-art YOLOE26-L architecture, providing a perfect balance of high precision (55.0 mAP) and rapid inference (~6.2ms on T4 GPUs).

Server Config

{
  "mcpServers": {
    "mcp-yolo": {
      "command": "uvx",
      "args": [
        "mcp-yolo"
      ]
    }
  }
}
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
CursorThe AI Code Editor
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
WindsurfThe new purpose-built IDE to harness magic
Serper MCP ServerA Serper MCP Server
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
DeepChatYour AI Partner on Desktop
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
RedisA Model Context Protocol server that provides access to Redis databases. This server enables LLMs to interact with Redis key-value stores through a set of standardized tools.
ChatWiseThe second fastest AI chatbot™
Tavily Mcp
Y GuiA web-based graphical interface for AI chat interactions with support for multiple AI models and MCP (Model Context Protocol) servers.
Playwright McpPlaywright MCP server
Amap Maps高德地图官方 MCP Server