Sponsored by Deepsite.site

MCPControl

Created By
Cheffromspacea year ago
MCP server for Windows OS automation
Content

MCPControl

Project Status: De-prioritized. This project was created as an experiment, but Windows automation isn't something I'm looking to get deep into. I will review and merge PRs but will not be adding features soon. May revisit in the future. Looking for a maintainer.

MCPControl Logo

Latest Release

Windows control server for the Model Context Protocol, providing programmatic control over system operations including mouse, keyboard, window management, and screen capture functionality.

Note: This project currently supports Windows only.

🔥 Why MCPControl?

MCPControl bridges the gap between AI models and your desktop, enabling secure, programmatic control of:

  • 🖱️ Mouse movements and clicks
  • ⌨️ Keyboard input and shortcuts
  • 🪟 Window management
  • 📸 Screen capture and analysis
  • 📋 Clipboard operations

🔌 Quick Start

  1. Install Node.js (if not already installed)

    # Visit https://nodejs.org and download the latest LTS version
    
  2. Configure MCP settings in your Claude client:

    {
      "mcpServers": {
        "MCPControl": {
          "command": "npx",
          "args": [
            "--no-cache",
            "-y",
            "mcp-control"
          ]
        }
      }
    }
    
  3. Restart your client and MCPControl will appear in your MCP menu!

Assisted Automation

  • Application Testing: Delegate repetitive UI testing to Claude, allowing AI to navigate through applications and report issues
  • Workflow Automation: Have Claude operate applications on your behalf, handling repetitive tasks while you focus on creative work
  • Form Filling: Let Claude handle data entry tasks with your supervision

AI Experimentation

  • AI Gaming: Watch Claude learn to play simple games through visual feedback
  • Visual Reasoning: Test Claude's ability to navigate visual interfaces and solve visual puzzles
  • Human-AI Collaboration: Explore new interaction paradigms where Claude can see your screen and help with complex tasks

Development and Testing

  • Cross-Application Integration: Bridge applications that don't normally communicate
  • UI Testing Framework: Create robust testing scenarios with visual validation
  • Demo Creation: Automate the creation of product demonstrations

⚠️ IMPORTANT DISCLAIMER

THIS SOFTWARE IS EXPERIMENTAL AND POTENTIALLY DANGEROUS

By using this software, you acknowledge and accept that:

  • Giving AI models direct control over your computer through this tool is inherently risky
  • This software can control your mouse, keyboard, and other system functions which could potentially cause unintended consequences
  • You are using this software entirely at your own risk
  • The creators and contributors of this project accept NO responsibility for any damage, data loss, or other consequences that may arise from using this software
  • This tool should only be used in controlled environments with appropriate safety measures in place

USE AT YOUR OWN RISK

🌟 Features

🪟 Window Management

  • List all windows
  • Get active window info
  • Focus, resize & reposition

🖱️ Mouse Control

  • Precision movement
  • Click & drag operations
  • Scrolling & position tracking

⌨️ Keyboard Control

  • Text input & key combos
  • Key press/release control
  • Hold key functionality

📸 Screen Operations

  • High-quality screenshots
  • Screen size detection
  • Active window capture

🛠️ Development Setup

If you're interested in contributing or building from source, please see CONTRIBUTING.md for detailed instructions.

Development Requirements

To build this project for development, you'll need:

  1. Windows operating system (required for the keysender dependency)
  2. Node.js 18 or later (install using the official Windows installer which includes build tools)
  3. npm package manager
  4. Native build tools:
    • node-gyp: npm install -g node-gyp
    • cmake-js: npm install -g cmake-js

The keysender dependency relies on Windows-specific native modules that require these build tools.

📋 Project Structure

  • /src
    • /handlers - Request handlers and tool management
    • /tools - Core functionality implementations
    • /types - TypeScript type definitions
    • index.ts - Main application entry point

🔖 Repository Branches

  • main - Main development branch with the latest features and changes
  • release - Stable release branch that mirrors the latest stable tag (currently v0.1.22)

Version Installation

You can install specific versions of MCPControl using npm:

# Install the latest stable release (from release branch)
npm install mcp-control

# Install a specific version
npm install mcp-control@0.1.22

📚 Dependencies

🚧 Known Limitations

  • Window minimize/restore operations are currently unsupported
  • Multiple screen functions may not work as expected, depending on setup
  • The get_screenshot utility does not work with the VS Code Extension Cline. See GitHub issue #1865
  • Some operations may require elevated permissions depending on the target application
  • Only Windows is supported
  • MCPControl works best at 1280x720 resolution, single screen. Click accuracy is optimized for this resolution. We're working on an offset/scaling bug and looking for testers or help creating testing tools

👥 Contributing

See CONTRIBUTING.md

Note: As this project is currently in a maintenance-only mode, we're primarily focusing on bug fixes. However, fully implemented and well-tested features will be considered for merging. Looking for maintainers to help keep the project active.

⚖️ License

This project is licensed under the MIT License - see the LICENSE file for details.

📖 References

MseeP.ai Security Assessment Badge

Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
CursorThe AI Code Editor
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
DeepChatYour AI Partner on Desktop
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
Amap Maps高德地图官方 MCP Server
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
ChatWiseThe second fastest AI chatbot™
WindsurfThe new purpose-built IDE to harness magic
Y GuiA web-based graphical interface for AI chat interactions with support for multiple AI models and MCP (Model Context Protocol) servers.
Serper MCP ServerA Serper MCP Server
Playwright McpPlaywright MCP server
Tavily Mcp