- nova-act-mcp-server
nova-act-mcp-server
Content
nova-act-mcp-server
nova‑act‑mcp‑server is a zero‑install Model Context Protocol (MCP) server that exposes Amazon Nova Act browser‑automation tools for AI agents.
What's New in v3.2.0
- File-Based Screenshots: Screenshots now save to files instead of inline base64, eliminating MCP message size limits
- Optimized Image Quality: Reduced screenshot quality (60/30) for smaller file sizes (~50KB vs 500KB+)
- Enhanced Error Handling: Improved exception handling for browser operations
- Better Testing: Comprehensive test coverage for screenshot functionality
- MCP Inspector Integration: New testing script for easy validation
Quick Start (uvx)
Step 1: Get a Nova Act API Key
Obtain your API key from Nova Act.
Step 2: Add to MCP Client Configuration
{
"mcpServers": {
"nova_act_browser_tools": {
"command": "uvx",
"args": ["nova-act-mcp-server"],
"env": { "NOVA_ACT_API_KEY": "YOUR_NOVA_ACT_API_KEY_HERE" }
}
}
}
Step 3: Start Using Browser Tools
AI agents can now call tools like start_session, execute_instruction, etc. through any MCP-compatible client such as Claude Desktop or VS Code.
Core Tools Overview
| Tool | Description | Key Parameters |
|---|---|---|
start_session | Starts a new browser session | url, headless=True |
execute_instruction | Runs a natural language instruction in the browser | session_id, instruction |
inspect_browser | Gets a screenshot and state of the current browser | session_id, include_screenshot=True |
end_session | Closes a browser session and cleans up resources | session_id |
list_browser_sessions | Lists all active browser sessions | None |
Local Development & Testing
Setup
# Clone the repository
git clone https://github.com/madtank/nova-act-mcp.git
cd nova-act-mcp
# Create and activate virtual environment
uv venv
source .venv/bin/activate # On Windows: .venv\Scripts\activate
# Install with development dependencies
uv pip install -e .[dev]
Running Tests
# Run all tests (integration tests require NOVA_ACT_API_KEY)
pytest
# Run only unit tests (no API key required)
pytest tests/unit
# Run integration tests (requires API key)
NOVA_ACT_API_KEY="your_key_here" pytest tests/integration
Running Locally with MCP Inspector UI
# Start the server with the MCP Inspector
npx @modelcontextprotocol/inspector -e PYTHONUNBUFFERED=1 -e NOVA_ACT_API_KEY="YOUR_KEY" -- python -m nova_mcp_server
Then visit http://localhost:6274 in your browser. For optimal experience, set the Inspector UI timeout to 60 seconds for start_session operations.
Advanced Tools
| Tool | Description |
|---|---|
fetch_file | Downloads a file from the current page |
view_html_log | Gets the HTML content of the current page |
compress_logs | Creates and returns a ZIP of session logs and screenshots |
License
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
ChatWiseThe second fastest AI chatbot™
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
CursorThe AI Code Editor
Playwright McpPlaywright MCP server
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
WindsurfThe new purpose-built IDE to harness magic
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
Serper MCP ServerA Serper MCP Server
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
Amap Maps高德地图官方 MCP Server
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题;
Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
DeepChatYour AI Partner on Desktop
Tavily Mcp