- π§ Tool Router Mode: Composio's intelligent tool routing for accessing Gmail, Slack, GitHub, and 500+ integrations
- β Browser Tools Mode: Gemini 2.5 Computer Use for visual browser automation with screenshots, clicks, typing, scrolling, and navigation
- Sidebar Chat Interface: Clean, modern React-based chat UI accessible from any tab
- Direct Browser Automation: No backend required - all API calls made directly from extension
- Visual Feedback: Blue click indicators and element highlighting during automation
- Safety Features: Confirmation dialogs for sensitive actions (checkout, payment, etc.)
- Node.js 18+ and npm
- Chrome or Edge browser (Manifest V3 support)
- Google API key for Gemini (required)
- Composio API key (optional, for Tool Router mode)
- Clone this repository
- Install dependencies:
npm install- Build the extension:
npm run build- Load the extension in Chrome:
- Open Chrome and navigate to
chrome://extensions/ - Enable "Developer mode" in the top right
- Click "Load unpacked"
- Select the
distfolder - Open Settings (βοΈ icon) to configure your API keys
- Open Chrome and navigate to
-
Google API Key (Required)
- Get your key from Google AI Studio
- Add it in Settings under "Google API Key"
- Supports: Gemini 2.5 Pro, Flash, and Flash Lite
-
Composio API Key (Optional - for Tool Router mode)
- Get your key from Composio Dashboard
- Add it in Settings under "Composio API Key"
- Enables access to 500+ app integrations
- Enable Browser Tools by clicking the β button in the chat header
- The extension automatically uses Gemini 2.5 Computer Use Preview
- Provide natural language instructions to control the browser
Example prompts:
- "Navigate to reddit.com and scroll down"
- "Click on the search box and type 'puppies'"
- "Take a screenshot of this page"
- "Click the first image on the page"
- Add your Composio API key in Settings
- Click β to disable Browser Tools (or keep it off)
- Chat normally - the AI will automatically use Composio tools when needed
Example prompts:
- "Check my Gmail for unread messages"
- "Create a GitHub issue titled 'Bug in login flow'"
- "Send a Slack message to #general with 'Hello team!'"
Run with hot reload:
npm run devThen reload the extension in Chrome after each change.
- FAQ - Frequently asked questions and quick troubleshooting
- TROUBLESHOOTING.md - Detailed troubleshooting guide for common issues
- Composio Platform - Intelligent tool routing for AI agents
- Composio Tool Router Documentation - Learn how to use Tool Router to route tool calls across 500+ integrations
- Composio GitHub - Python and TS SDK
- ChatGPT Atlas - OpenAI's browser automation AI agent
- Gemini Computer Use Model - Google's AI model for browser automation
- Gemini API Documentation - Official documentation for Gemini Computer Use
