- 🚀 operative.sh web-eval-agent MCP Server
🚀 operative.sh web-eval-agent MCP Server
An MCP server that autonomously evaluates web applications.
Content
🚀 operative.sh web-eval-agent MCP Server
Let the coding agent debug itself, you've got better things to do.

🔥 Supercharge Your Debugging
operative.sh's MCP Server launches a browser-use powered agent to autonomously execute and debug web apps directly in your code editor.
⚡ Features
- 🌐 Navigate your webapp using BrowserUse (2x faster with operative backend)
- 📊 Capture network traffic - requests are intelligently filtered and returned into the context window
- 🚨 Collect console errors - captures logs & errors
- 🤖 Autonomous debugging - the Cursor agent calls the web QA agent mcp server to test if the code it wrote works as epected end-to-end.
🧰 MCP Tool Reference
| Tool | Purpose |
|---|---|
web_eval_agent | 🤖 Automated UX evaluator that drives the browser, captures screenshots, console & network logs, and returns a rich UX report. |
setup_browser_state | 🔒 Opens an interactive (non-headless) browser so you can sign in once; the saved cookies/local-storage are reused by subsequent web_eval_agent runs. |
Key arguments
-
web_eval_agenturl(required) – address of the running app (e.g.http://localhost:3000)task(required) – natural-language description of what to test ("run through the signup flow and note any UX issues")headless_browser(optional, defaultfalse) – set totrueto hide the browser window
-
setup_browser_stateurl(optional) – page to open first (handy to land directly on a login screen)
You can trigger these tools straight from your IDE chat, for example:
Evaluate my app at http://localhost:3000 – run web_eval_agent with the task "Try the full signup flow and report UX issues".
🏁 Quick Start (macOS/Linux)
- Pre-requisites (typically not needed):
- brew:
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)" - npm: (
brew install npm) - jq:
brew install jq
- Run the installer after getting an api key (free)
- Installs playwright
- Installs uv
- Inserts JSON into your code editor (Cursor/Cline/Windsurf) for you!
curl -LSf https://operative.sh/install.sh -o install.sh && bash install.sh && rm install.sh
- Visit your favorite IDE and restart to apply the changes
- Send a prompt in chat mode to call the web eval agent tool! e.g.
Test my app on http://localhost:3000. Use web-eval-agent.
🛠️ Manual Installation
- Get your API key at operative.sh/mcp
- Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
- Source environment variables after installing UV
Mac
source ~/.zshrc
Linux
source ~/.bashrc
- Install playwright:
npm install -g chromium playwright && uvx --with playwright playwright install --with-deps
- Add below JSON to your relevant code editor with api key
- Restart your code editor
🔃 Updating
uv cache clean- refresh MCP server
"web-eval-agent": {
"command": "uvx",
"args": [
"--refresh-package",
"webEvalAgent",
"--from",
"git+https://github.com/Operative-Sh/web-eval-agent.git",
"webEvalAgent"
],
"env": {
"OPERATIVE_API_KEY": "<YOUR_KEY>"
}
}
Operative Discord Server
🛠️ Manual Installation (Mac + Cursor/Cline/Windsurf)
- Get your API key at operative.sh/mcp
- Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh)
- Install playwright:
npm install -g chromium playwright && uvx --with playwright playwright install --with-deps
- Add below JSON to your relevant code editor with api key
- Restart your code editor
Manual Installation (Windows + Cursor/Cline/Windsurf)
We're refining this, please open an issue if you have any issues!
- Do all this in your code editor terminal
curl -LSf https://operative.sh/install.sh -o install.sh && bash install.sh && rm install.sh- Get your API key at operative.sh/mcp
- Install uv
(curl -LsSf https://astral.sh/uv/install.sh | sh) uvx --from git+https://github.com/Operative-Sh/web-eval-agent.git playwright install- Restart code editor
🚨 Issues
- Updates aren't being received in code editors, update or reinstall for latest version: Run
uv cache cleanfor latest - Any issues feel free to open an Issue on this repo or in the discord!
- 5/5 - static apps without changes weren't screencasting, fixed!
uv clean+ restart to get fix
Changelog
- 4/29 - Agent overlay update - pause/play/stop agent run in the browser
📋 Example MCP Server Output Report
📊 Web Evaluation Report for http://localhost:5173 complete!
📝 Task: Test the API-key deletion flow by navigating to the API Keys section, deleting a key, and judging the UX.
🔍 Agent Steps
📍 1. Navigate → http://localhost:5173
📍 2. Click "Login" (button index 2)
📍 3. Click "API Keys" (button index 4)
📍 4. Click "Create Key" (button index 9)
📍 5. Type "Test API Key" (input index 2)
📍 6. Click "Done" (button index 3)
📍 7. Click "Delete" (button index 10)
📍 8. Click "Delete" (confirm index 3)
🏁 Flow tested successfully – UX felt smooth and intuitive.
🖥️ Console Logs (10)
1. [debug] [vite] connecting…
2. [debug] [vite] connected.
3. [info] Download the React DevTools …
…
🌐 Network Requests (10)
1. GET /src/pages/SleepingMasks.tsx 304
2. GET /src/pages/MCPRegistryRegistry.tsx 304
…
⏱️ Chronological Timeline
01:16:23.293 🖥️ Console [debug] [vite] connecting…
01:16:23.303 🖥️ Console [debug] [vite] connected.
01:16:23.312 ➡️ GET /src/pages/SleepingMasks.tsx
01:16:23.318 ⬅️ 304 /src/pages/SleepingMasks.tsx
…
01:17:45.038 🤖 🏁 Flow finished – deletion verified
01:17:47.038 🤖 📋 Conclusion repeated above
👁️ See the "Operative Control Center" dashboard for live logs.
Star History
Built with <3 @ operative.sh
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
DeepChatYour AI Partner on Desktop
WindsurfThe new purpose-built IDE to harness magic
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Amap Maps高德地图官方 MCP Server
ChatWiseThe second fastest AI chatbot™
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
Serper MCP ServerA Serper MCP Server
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题;
Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
CursorThe AI Code Editor
Playwright McpPlaywright MCP server
Tavily Mcp