Sponsored by Deepsite.site

Gemini Terminal Agent

Created By
Nghiaueta year ago
create agent with mcp server
Content

Gemini Terminal Agent

A powerful terminal-based agent using Google's Gemini model with web search capabilities. This agent lets you interact with Gemini through your terminal while leveraging real-time web search for up-to-date information.

Features

  • 🤖 Conversational AI Interface - Talk with Google's Gemini models directly from your terminal
  • 🔍 Web Search Integration - Get real-time information from the web
  • 💬 Conversation History - Maintain context throughout your conversation
  • 🛠️ Advanced Search Options - Filter by domains, exclude sites, and more
  • 📝 Clean, Modular Architecture - Well-structured codebase that's easy to extend

Installation

Prerequisites

  • Python 3.9+
  • Google API key for Gemini models
  • Google Custom Search Engine (CSE) API key and ID

Setup

  1. Clone the repository:

    git clone https://github.com/yourusername/gemini-terminal-agent.git
    cd gemini-terminal-agent
    
  2. Create a virtual environment (recommended):

    python -m venv venv
    source venv/bin/activate  # On Windows: venv\Scripts\activate
    
  3. Install dependencies:

    pip install -r requirements.txt
    
  4. Create a .env file in the project root with your API keys:

    GOOGLE_GENAI_API_KEY=your_gemini_api_key_here
    SEARCH_ENGINE_API_KEY=your_google_api_key_here
    SEARCH_ENGINE_CSE_ID=your_cse_id_here
    DEFAULT_MODEL=gemini-2.5-flash-preview-04-17
    

Setting Up Google Search Engine

To use the web search functionality, you need to set up a Google Custom Search Engine:

  1. Get a Google API Key:

    • Go to Google Cloud Console
    • Create a new project or select an existing one
    • Navigate to "APIs & Services" > "Library"
    • Search for "Custom Search API" and enable it
    • Go to "APIs & Services" > "Credentials"
    • Create an API key and copy it (this will be your SEARCH_ENGINE_API_KEY)
  2. Create a Custom Search Engine:

    • Go to Programmable Search Engine
    • Click "Create a Programmable Search Engine"
    • Add sites to search (use *.com to search the entire web)
    • Give your search engine a name
    • In "Customize" > "Basics", enable "Search the entire web"
    • Get your Search Engine ID from the "Setup" > "Basics" page (this will be your SEARCH_ENGINE_CSE_ID)
  3. Get a Gemini API Key:

    • Go to Google AI Studio
    • Sign in with your Google account
    • Go to "API Keys" and create a new API key
    • Copy the API key (this will be your GOOGLE_GENAI_API_KEY)

Usage

Run the agent from the terminal:

python main.py

Commands

  • Type your question or prompt to interact with the agent
  • Type help to see available tools and commands
  • Type clear to clear the conversation history
  • Type exit, quit, or q to exit the program

Example Queries

>>> What is the capital of France?
Paris is the capital of France. It is located in the north-central part of the country on the Seine River.

>>> search for recent developments in quantum computing
Searching the web for recent developments in quantum computing...
[Agent response with up-to-date information]

>>> help
🔍 Available Tools:
  - search: Search for information online based on a query
  - advanced_search: Perform an advanced search with domain filtering and time range options

⌨️ Terminal Commands:
  - help: Show this help message
  - clear: Clear conversation history
  - exit/quit/q: Exit the program

Project Structure

gemini-terminal-agent/
├── main.py               # Main entry point
├── search_server.py      # Search server entry point
├── .env                  # Environment variables (not versioned)
├── agent/                # Agent implementation
│   ├── __init__.py
│   ├── terminal_agent.py # Core agent implementation
│   └── config.py         # Agent configuration
├── search/               # Search functionality
│   ├── __init__.py
│   ├── server.py         # MCP search server
│   ├── engine.py         # Search engine implementation
│   └── content.py        # Web content extraction 
└── utils/                # Shared utilities
    ├── __init__.py
    ├── config.py         # Global configuration
    └── logging.py        # Logging setup

Advanced Configuration

You can customize the agent's behavior by modifying settings in your .env file:

# Model settings
DEFAULT_MODEL=gemini-2.5-flash-preview-04-17
# Other models: gemini-1.5-pro, gemini-1.5-flash

# Search settings
MAX_CONCURRENT_REQUESTS=5
CONNECTION_TIMEOUT=10
CONTENT_TIMEOUT=15
MAX_CONTENT_LENGTH=5000
CACHE_TTL=3600

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • This project uses LangChain for the agent framework
  • Web search functionality powered by Google Custom Search Engine
  • Built with Google's Gemini models
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
RedisA Model Context Protocol server that provides access to Redis databases. This server enables LLMs to interact with Redis key-value stores through a set of standardized tools.
Playwright McpPlaywright MCP server
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
Tavily Mcp
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
DeepChatYour AI Partner on Desktop
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Y GuiA web-based graphical interface for AI chat interactions with support for multiple AI models and MCP (Model Context Protocol) servers.
CursorThe AI Code Editor
Serper MCP ServerA Serper MCP Server
ChatWiseThe second fastest AI chatbot™
Amap Maps高德地图官方 MCP Server
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
WindsurfThe new purpose-built IDE to harness magic