Sponsored by Deepsite.site

Bright Data MCP

Created By
luminati-io8 months ago
A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.
Content

Bright Data Logo

Bright Data MCP

Enhance AI Agents with Real-Time Web Data

🌟 Overview

Welcome to the official Bright Data Model Context Protocol (MCP) server, enabling LLMs, agents and apps to access, discover and extract web data in real-time. This server allows MCP clients, such as Claude Desktop, Cursor, Windsurf and others, to seamlessly search the web, navigate websites, take action and retrieve data - without getting blocked.

MCP

✨ Features

  • Real-time Web Access: Access up-to-date information directly from the web
  • Bypass Geo-restrictions: Access content regardless of location constraints
  • Web Unlocker: Navigate websites with bot detection protection
  • Browser Control: Optional remote browser automation capabilities
  • Seamless Integration: Works with all MCP-compatible AI assistants

🔧 Available Tools

FeatureDescription
search_engineScrape search results from Google, Bing or Yandex. Returns SERP results in markdown (URL, title, description)
scrape_as_markdownScrape a single webpage URL with advanced options for content extraction and get back the results in MarkDown language. This tool can unlock any webpage even if it uses bot detection or CAPTCHA.
scrape_as_htmlScrape a single webpage URL with advanced options for content extraction and get back the results in HTML. This tool can unlock any webpage even if it uses bot detection or CAPTCHA.
session_statsTell the user about the tool usage during this session
web_data_amazon_productQuickly read structured amazon product data. Requires a valid product URL with /dp/ in it. This can be a cache lookup, so it can be more reliable than scraping
web_data_amazon_product_reviewsQuickly read structured amazon product review data. Requires a valid product URL with /dp/ in it. This can be a cache lookup, so it can be more reliable than scraping
web_data_linkedin_person_profileQuickly read structured linkedin people profile data. This can be a cache lookup, so it can be more reliable than scraping
web_data_linkedin_company_profileQuickly read structured linkedin company profile data. This can be a cache lookup, so it can be more reliable than scraping
web_data_zoominfo_company_profileQuickly read structured ZoomInfo company profile data. Requires a valid ZoomInfo company URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_instagram_profilesQuickly read structured Instagram profile data. Requires a valid Instagram URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_instagram_postsQuickly read structured Instagram post data. Requires a valid Instagram URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_instagram_reelsQuickly read structured Instagram reel data. Requires a valid Instagram URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_instagram_commentsQuickly read structured Instagram comments data. Requires a valid Instagram URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_facebook_postsQuickly read structured Facebook post data. Requires a valid Facebook post URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_facebook_marketplace_listingsQuickly read structured Facebook marketplace listing data. Requires a valid Facebook marketplace listing URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_facebook_company_reviewsQuickly read structured Facebook company reviews data. Requires a valid Facebook company URL and number of reviews. This can be a cache lookup, so it can be more reliable than scraping
web_data_x_postsQuickly read structured X post data. Requires a valid X post URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_zillow_properties_listingQuickly read structured zillow properties listing data. Requires a valid zillow properties listing URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_booking_hotel_listingsQuickly read structured booking hotel listings data. Requires a valid booking hotel listing URL. This can be a cache lookup, so it can be more reliable than scraping
web_data_youtube_videosQuickly read structured YouTube videos data. Requires a valid YouTube video URL. This can be a cache lookup, so it can be more reliable than scraping
scraping_browser_navigateNavigate a scraping browser session to a new URL
scraping_browser_go_backGo back to the previous page
scraping_browser_go_forwardGo forward to the next page
scraping_browser_clickClick on an element. Avoid calling this unless you know the element selector (you can use other tools to find those)
scraping_browser_linksGet all links on the current page, text and selectors. It's strongly recommended that you call the links tool to check that your click target is valid
scraping_browser_typeType text into an element
scraping_browser_wait_forWait for an element to be visible on the page
scraping_browser_screenshotTake a screenshot of the current page
scraping_browser_get_htmlGet the HTML content of the current page. Avoid using the full_page option unless it is important to see things like script tags since this can be large
scraping_browser_get_textGet the text content of the current page

🚀 Quickstart with Claude Desktop

  1. Install nodejs to get the npx command (node.js module runner). Installation instructions can be found on the node.js website

  2. Go to Claude > Settings > Developer > Edit Config > claude_desktop_config.json to include the following:

{
  "mcpServers": {
    "Bright Data": {
      "command": "npx",
      "args": ["@brightdata/mcp"],
      "env": {
        "API_TOKEN": "<insert-your-api-token-here>",
        "WEB_UNLOCKER_ZONE": "<optional if you want to override the default mcp_unlocker zone name>",
        "BROWSER_AUTH": "<optional if you want to enable remote browser control tools>"
      }
    }
  }
}

🔧 Account Setup

  1. Make sure you have an account on brightdata.com (new users get free credit for testing, and pay as you go options are available)

  2. Get your API key from the user settings page

  3. Create a Web Unlocker proxy zone called mcp_unlocker in your control panel

    • You can override this zone in your MCP server with the env variable WEB_UNLOCKER_ZONE
  4. (Optional) To enable browser control tools:

    • Visit your Bright Data control panel at brightdata.com/cp/zones
    • Create a new 'Browser API' zone
    • Once created, copy the authentication string from the Browser API overview tab
    • The authentication string will be formatted like: brd-customer-[your-customer-ID]-zone-[your-zone-ID]:[your-password]

Browser API Setup

🔌 Other MCP Clients

To use this MCP server with other agent types, you should adapt the following to your specific software:

  • The full command to run the MCP server is npx @brightdata/mcp
  • The environment variable API_TOKEN=<your-token> must exist when running the server

🎮 Try Bright Data MCP Playgrounds

Want to try Bright Data MCP without setting up anything?

Check out this playground on Smithery:

2025-05-06_10h44_20

This platform provide an easy way to explore the capabilities of Bright Data MCP without any local setup. Just sign in and start experimenting with web data collection!

💡 Usage Examples

Some example queries that this MCP server will be able to help with:

  • "Google some movies that are releasing soon in [your area]"
  • "What's Tesla's current market cap?"
  • "What's the Wikipedia article of the day?"
  • "What's the 7-day weather forecast in [your location]?"
  • "Of the 3 highest paid tech CEOs, how long have their careers been?"

🎬 Demo

The videos below demonstrate a minimal use case for Claude Desktop:

Demo Demo

⚠️ Troubleshooting

Timeouts when using certain tools

Some tools can involve reading web data, and the amount of time needed to load the page can vary by quite a lot in extreme circumstances.

To ensure that your agent will be able to consume the data, set a high enough timeout in your agent settings.

A value of 180s should be enough for 99% of requests, but some sites load slower than others, so tune this to your needs.

spawn npx ENOENT

This error occurs when your system cannot find the npx command. To fix it:

Finding npm/Node Path

macOS:

which node

Shows path like /usr/local/bin/node

Windows:

where node

Shows path like C:\Program Files\nodejs\node.exe

Update your MCP configuration:

Replace the npx command with the full path to Node, for example, on mac, it will look as follows:

"command": "/usr/local/bin/node"

👨‍💻 Contributing

We welcome contributions to help improve the Bright Data MCP! Here's how you can help:

  1. Report Issues: If you encounter any bugs or have feature requests, please open an issue on our GitHub repository.
  2. Submit Pull Requests: Feel free to fork the repository and submit pull requests with enhancements or bug fixes.
  3. Coding Style: All JavaScript code should follow Bright Data's JavaScript coding conventions. This ensures consistency across the codebase.
  4. Documentation: Improvements to documentation, including this README, are always appreciated.
  5. Examples: Share your use cases by contributing examples to help other users.

For major changes, please open an issue first to discuss your proposed changes. This ensures your time is well spent and aligned with project goals.

📞 Support

If you encounter any issues or have questions, please reach out to the Bright Data support team or open an issue in the repository.

Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
ChatWiseThe second fastest AI chatbot™
Tavily Mcp
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
WindsurfThe new purpose-built IDE to harness magic
Playwright McpPlaywright MCP server
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
DeepChatYour AI Partner on Desktop
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
CursorThe AI Code Editor
Amap Maps高德地图官方 MCP Server
Serper MCP ServerA Serper MCP Server
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.