Selenium MCP Server

Created By

Raghvendra-Raghuvanshia year ago

Model Context Protocol server for Selenium WebDriver - enables LLMs to control web browsers

Content

Selenium MCP Server

A Model Context Protocol (MCP) server for browser automation using Selenium WebDriver. Enables LLMs and teams to automate browsers via a standard protocol, similar to Playwright MCP.

🚀 Key Features

Multi-browser support (Chrome, Firefox, Edge, Safari)
Team-friendly: visible browser by default
MCP-compliant API (navigate, click, type, screenshot, etc.)
Easy onboarding for teams and CI

📦 Exposing MCP Server for Team Usage

1. Publish to npm (for team-wide access)

Ensure your package.json is correct (see below)
Run:

npm publish --access public

Or for private registry:

npm publish --access restricted

2. Install and Run (for any team)

Global install (recommended for teams)

npm install -g selenium-mcp-server
selenium-mcp-server --browser chrome

Or use npx (no install needed)

npx selenium-mcp-server@latest --browser chrome

3. Configure in MCP Client (e.g., Cursor, CI, or custom)

Add to your .cursor/mcp.json or equivalent:

{
  "mcpServers": {
    "selenium": {
      "command": "selenium-mcp-server",
      "args": ["--browser", "chrome"]
    }
  }
}

For CI, run the server as a background process and point clients to its port.

📝 Onboarding for New Teams

Install Node.js 18+ and Java 11+
Install the MCP server globally or use npx
Share the npm package name and README with your team
Provide example configs and usage (see above)
For support, use the GitHub repo issues page

Example `package.json` for Publishing

{
  "name": "selenium-mcp-server",
  "version": "1.0.0",
  "description": "Selenium MCP server for browser automation",
  "main": "index.js",
  "bin": { "selenium-mcp-server": "index.js" },
  "repository": { "type": "git", "url": "https://github.com/your-org/selenium-mcp-server.git" },
  "author": "Your Name",
  "license": "MIT"
}

📋 Available Tools

browser_navigate - Navigate to any URL
browser_snapshot - Get page accessibility snapshot with element references
browser_click - Click on elements using references from snapshots
browser_type - Type text into form fields with optional submission
browser_wait_for - Wait for time, text to appear, or text to disappear
browser_take_screenshot - Capture screenshots of the current page

🛠️ Installation & Setup

For Cursor Users

Add to Cursor Settings:
- Go to Cursor Settings → MCP → Add new MCP Server
- Name: selenium-mcp
- Command type: command
- Command: npx
- Arguments: ["selenium-mcp-server@latest"]
Alternative JSON Configuration:

{
  "mcpServers": {
    "selenium-mcp": {
      "command": "npx",
      "args": ["selenium-mcp-server@latest"]
    }
  }
}

For Other MCP Clients

Use the same configuration pattern with your MCP client:

{
  "mcpServers": {
    "selenium-mcp": {
      "command": "npx",
      "args": ["selenium-mcp-server@latest"]
    }
  }
}

Alternative Installation Methods

Click to see other installation options

Method 1: Local Installation

npm install selenium-mcp-server

Method 2: Global Installation

npm install -g selenium-mcp-server

Quick Start

🎯 Zero-Installation (Recommended)

1. Add Configuration

Create .cursor/mcp.json in your project root:

{
  "mcpServers": {
    "selenium": {
      "command": "npx",
      "args": [
        "selenium-mcp-server@latest",
        "--browser", "chrome"
      ]
    }
  }
}

2. Restart Cursor

Quit Cursor completely
Restart Cursor
npx will automatically download and run the server

Alternative Setup Methods

Option A: Local Installation

1. Install in your project

npm install selenium-mcp-server

2. Configure Cursor

{
  "mcpServers": {
    "selenium": {
      "command": "node",
      "args": ["./node_modules/selenium-mcp-server/dist/index.js", "--browser", "chrome", "--headless"]
    }
  }
}

Option B: Global Installation

1. Install globally

npm install -g selenium-mcp-server

2. Configure Cursor

{
  "mcpServers": {
    "selenium": {
      "command": "selenium-mcp",
      "args": ["--browser", "chrome", "--headless"]
    }
  }
}

3. Use in Cursor

Ask Cursor to perform browser automation:

"Navigate to google.com and take a screenshot"
"Fill out the contact form on example.com"
"Extract all links from the current page"

Configuration Options

selenium-mcp [options]

Options:
  --browser <browser>        Browser to use (chrome, firefox, edge, safari) (default: "chrome")
  --headless                 Run browser in headless mode
  --executable-path <path>   Path to browser executable
  --user-data-dir <path>     Path to user data directory
  --isolated                 Keep browser profile in memory
  --viewport-size <size>     Browser viewport size (e.g., 1280,720)
  --output-dir <path>        Path to directory for output files
  -h, --help                 Display help for command

Available Tools

browser_navigate - Navigate to URLs
browser_navigate_back - Go back in history
browser_navigate_forward - Go forward in history

Page Interaction

browser_click - Click on elements
browser_type - Type text into inputs
browser_hover - Hover over elements
browser_drag - Drag and drop elements
browser_select_option - Select dropdown options
browser_press_key - Press keyboard keys

Information Gathering

browser_snapshot - Capture accessibility tree
browser_take_screenshot - Take screenshots
browser_network_requests - List network requests
browser_console_messages - Get console messages

Tab Management

browser_tab_list - List open tabs
browser_tab_new - Open new tabs
browser_tab_select - Switch tabs
browser_tab_close - Close tabs

Utilities

browser_wait_for - Wait for elements/text
browser_file_upload - Upload files
browser_handle_dialog - Handle alerts/dialogs
browser_resize - Resize browser window
browser_pdf_save - Save page as PDF

Usage Examples

// In Cursor, ask:
"Navigate to https://example.com and take a screenshot"

Form Interaction

// In Cursor, ask:
"Go to the contact page and fill out the form with name 'John Doe' and email 'john@example.com'"

Data Extraction

// In Cursor, ask:
"Visit the product page and extract all product names and prices"

Requirements

Node.js 18+
Java 11+ (for Selenium WebDriver)
Browser drivers (automatically managed by WebDriverManager)

Configuration Examples

Local Development

{
  "mcpServers": {
    "selenium": {
      "command": "selenium-mcp",
      "args": ["--browser", "chrome"]
    }
  }
}

Headless Production

{
  "mcpServers": {
    "selenium": {
      "command": "selenium-mcp",
      "args": [
        "--browser", "chrome",
        "--headless",
        "--viewport-size", "1920,1080"
      ]
    }
  }
}

Custom Browser Path

{
  "mcpServers": {
    "selenium": {
      "command": "selenium-mcp",
      "args": [
        "--browser", "chrome",
        "--executable-path", "/path/to/chrome"
      ]
    }
  }
}

Troubleshooting

Browser Not Found

If you get browser not found errors:

Install the browser (Chrome, Firefox, etc.)
Use --executable-path to specify browser location
Check that Java is installed and accessible

Permission Issues

On macOS, you might need to grant accessibility permissions:

Go to System Preferences > Security & Privacy > Privacy
Select "Accessibility"
Add your terminal application

Memory Issues

For large pages or long-running sessions:

{
  "mcpServers": {
    "selenium": {
      "command": "selenium-mcp",
      "args": ["--isolated", "--headless"]
    }
  }
}

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

License

MIT License - see LICENSE file for details.

Playwright MCP - Similar server using Playwright
Model Context Protocol - The protocol specification
Selenium WebDriver - The underlying browser automation framework

References

For more, see CONTRIBUTING.md and TEAM_SETUP_GUIDE.md

Recommend Servers

TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.

BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.

Baidu Map百度地图核心API现已全面兼容MCP协议，是国内首家兼容MCP协议的地图服务商。

AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.

Tavily Mcp

Serper MCP ServerA Serper MCP Server

Howtocook Mcp基于Anduin2017 / HowToCook （程序员在家做饭指南）的mcp server，帮你推荐菜谱、规划膳食，解决“今天吃什么“的世纪难题； Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"

Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code

DeepChatYour AI Partner on Desktop

WindsurfThe new purpose-built IDE to harness magic

EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

Amap Maps高德地图官方 MCP Server

Playwright McpPlaywright MCP server

ChatWiseThe second fastest AI chatbot™

MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs

CursorThe AI Code Editor

Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.

RedisA Model Context Protocol server that provides access to Redis databases. This server enables LLMs to interact with Redis key-value stores through a set of standardized tools.

MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.

Y GuiA web-based graphical interface for AI chat interactions with support for multiple AI models and MCP (Model Context Protocol) servers.

Selenium MCP Server

Selenium MCP Server

🚀 Key Features

📦 Exposing MCP Server for Team Usage

1. Publish to npm (for team-wide access)

2. Install and Run (for any team)

Global install (recommended for teams)

Or use npx (no install needed)

3. Configure in MCP Client (e.g., Cursor, CI, or custom)

📝 Onboarding for New Teams

Example package.json for Publishing

📋 Available Tools

🛠️ Installation & Setup

For Cursor Users

For Other MCP Clients

Alternative Installation Methods

Method 1: Local Installation

Method 2: Global Installation

Quick Start

🎯 Zero-Installation (Recommended)

1. Add Configuration

2. Restart Cursor

Option A: Local Installation

1. Install in your project

2. Configure Cursor

Option B: Global Installation

1. Install globally

2. Configure Cursor

3. Use in Cursor

Configuration Options

Available Tools

Navigation

Page Interaction

Information Gathering

Tab Management

Utilities

Usage Examples

Basic Navigation and Screenshot

Form Interaction

Data Extraction

Requirements

Configuration Examples

Local Development

Headless Production

Custom Browser Path

Troubleshooting

Browser Not Found

Permission Issues

Memory Issues

Contributing

License

Related Projects

References

For more, see CONTRIBUTING.md and TEAM_SETUP_GUIDE.md

Example `package.json` for Publishing