Sponsored by Deepsite.site

Browser Automation MCP Server

Created By
samihalawa9 months ago
A Model Context Protocol (MCP) server that provides browser automation capabilities for Claude and other MCP-compatible AI assistants
Content

Browser Automation MCP Server

A Model Context Protocol (MCP) server that provides browser automation capabilities for Claude and other MCP-compatible AI assistants.

Features

  • Web Automation: Control web browsers programmatically
  • Screenshot Capture: Take screenshots of web pages
  • Element Interaction: Click, type, and interact with web elements
  • Navigation: Navigate between pages and manage browser state
  • Form Filling: Automate form filling and submission
  • Data Extraction: Extract data from web pages

Installation

# Clone the repository
git clone https://github.com/samihalawa/browser-automation-server.git
cd browser-automation-server

# Install dependencies
npm install

# Build the server
npm run build

Usage

Starting the Server

npm start

Configuration

Add the server to your MCP configuration:

{
  "servers": {
    "browser-automation": {
      "command": "/path/to/node",
      "args": ["/path/to/browser-automation-server/build/index.js"],
      "enabled": true,
      "port": 3008,
      "environment": {
        "NODE_PATH": "/path/to/node_modules",
        "PATH": "/usr/local/bin:/usr/bin:/bin"
      }
    }
  }
}

Available Tools

Navigate to a URL.

Parameters:

  • url (string, required): URL to navigate to
  • waitUntil (string, optional): Navigation wait condition. Options: 'load', 'domcontentloaded', 'networkidle'. Default: 'load'

screenshot

Take a screenshot of the current page.

Parameters:

  • fullPage (boolean, optional): Whether to capture full page or just viewport. Default: false
  • path (string, optional): Path to save the screenshot to. If not provided, returns base64 encoded image

click

Click on an element.

Parameters:

  • selector (string, required): CSS selector of the element to click
  • waitForSelector (boolean, optional): Whether to wait for the selector to appear. Default: true

type

Type text into an input field.

Parameters:

  • selector (string, required): CSS selector of the input element
  • text (string, required): Text to type
  • delay (number, optional): Delay between keystrokes in milliseconds. Default: 0

extract

Extract data from the page.

Parameters:

  • selector (string, required): CSS selector of the elements to extract
  • attribute (string, optional): Attribute to extract. If not provided, extracts text content

evaluate

Evaluate JavaScript in the browser context.

Parameters:

  • script (string, required): JavaScript code to evaluate
  • args (array, optional): Arguments to pass to the script

Example Usage

  1. Navigate to a website:

    navigate(url: "https://example.com")
    
  2. Take a screenshot:

    screenshot(fullPage: true)
    
  3. Click a button:

    click(selector: "#submit-button")
    
  4. Fill a form:

    type(selector: "#username", text: "user123")
    type(selector: "#password", text: "password123")
    click(selector: "#login-button")
    
  5. Extract data:

    extract(selector: ".product-title", attribute: "innerText")
    

Requirements

  • Node.js 14+
  • Playwright for browser automation

License

MIT

Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
Playwright McpPlaywright MCP server
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
Serper MCP ServerA Serper MCP Server
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
ChatWiseThe second fastest AI chatbot™
CursorThe AI Code Editor
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Tavily Mcp
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
WindsurfThe new purpose-built IDE to harness magic
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
Amap Maps高德地图官方 MCP Server
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
DeepChatYour AI Partner on Desktop