Sponsored by Deepsite.site

MinionWorks – Modular browser agents that work for bananas 🍌

Created By
minionworks8 months ago
Content

Minion Works Minion Works

MinionWorks – Modular browser agents that work for bananas 🍌

Modular. Extensible. AI-native browser agents for modern web automation.


🚀 Overview

MinionWorks is a modular AI agent framework that connects to your browser and executes complex tasks autonomously. Built for developers, researchers, and curious builders.

✨ Features

  • 🌐 Perform Google searches and scrape content
  • 🤖 Use LLMs (like GPT-4) to plan actions
  • 🔗 Modular architecture for plug-and-play use cases
  • 🔎 DOM interaction & content extraction
  • 🔄 Run workflows via Python or UI

🛠️ Installation

  1. Install the package

    pip install minion-agent
    
  2. Set up environment variables

    cp .env.example .env
    # Edit your .env file with OpenAI or other API keys
    

🧪 Quick Start

Here’s a complete example using MinionAgent with langchain-openai:

from langchain_openai import ChatOpenAI
from minion_agent.browser import MinionAgent
import asyncio
from dotenv import load_dotenv

# Load environment variables from .env file
load_dotenv()

async def main():
    # Example 1: Using environment variables for OpenAI configuration
    agent1 = MinionAgent(
        task="Compare the price of GPT-4 and DeepSeek-V3",
        llm=ChatOpenAI(model="gpt-4o"),
        headless=True
    )
    result1 = await agent1.run()
    print("Result 1:", result1)

if __name__ == "__main__":
    asyncio.run(main())

💡 Example Use Cases

Invoke MinionAgent with any of these example prompts to see it in action:

  1. List of Indian B2B YC Companies

    agent = MinionAgent(task="Give me the list of all B2B YC companies in India", llm=ChatOpenAI(model="gpt-4o"))
    result = await agent.run()
    print(result)
    
  2. Kerala CPI Index (Feb 2024)

    agent = MinionAgent(task="What is the CPI index of kerela for the month of february 2024", llm=ChatOpenAI(model="gpt-4o"))
    result = await agent.run()
    print(result)
    
  3. 3-Day Ladakh Itinerary from Delhi

    agent = MinionAgent(task="Give me the itinerary for the three day plan for visiting Ladakh from Delhi", llm=ChatOpenAI(model="gpt-4o"))
    result = await agent.run()
    print(result)
    

📂 Examples & Recordings

Play back the recorded demos in the examples/videos/ directory and see MinionAgent tackle each prompt below.

1. Indian B2B YC Companies

Prompt:

Give me the list of all B2B YC companies in India

2. Kerala CPI Index (Feb 2024)

Prompt:

What is the CPI index of kerela for the month of february 2024

3. Ladakh Itinerary from Delhi

Prompt:

Give me the itinerary for the three day plan for visiting Ladakh from Delhi

🧪 Testing

pytest --maxfail=1 --disable-warnings -q

Ensure you’re in the root folder where tests/ lives.


🤝 Contributing

We welcome PRs, feedback, and creative ideas!

  1. Fork → Branch → Commit
  2. Add tests
  3. Submit a Pull Request
  4. Tell your friends 🚀

📖 Citation

@software{minion_works2025,
  author = {Sairaam, Aman, Cheena},
  title = {Minion Works: Let AI take the helm of your browser.},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/minionworks/minions}
}
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
CursorThe AI Code Editor
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
ChatWiseThe second fastest AI chatbot™
WindsurfThe new purpose-built IDE to harness magic
DeepChatYour AI Partner on Desktop
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
TimeA Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.
Tavily Mcp
Amap Maps高德地图官方 MCP Server
Serper MCP ServerA Serper MCP Server
Playwright McpPlaywright MCP server
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
Context7Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.