Skip to content

Meet Glitch: The JARVIS you dreamt of. A multimodal agent that doesn't just chat; it acts. Sees your screen, automates your OS. #AgenticAI

License

Notifications You must be signed in to change notification settings

KirthanNB/AI-Companion

Repository files navigation

๐Ÿค– AI Desktop Companion โ€” Glitch

The AI that lives on your desktop โ€” sees, thinks, and acts.


๐Ÿง  What is Glitch?

(Just kidding, don't try that!!)

We grew up dreaming of companions like **JARVIS** โ€” agents that don๏ฟฝ๏ฟฝt just listen, but *act*. Somewhere along the way, assistants got stuck in browser tabs.

Glitch breaks the fourth wall of the operating system.

AI Desktop Companion (Glitch) is a fully multimodal, autonomous desktop agent that:

  • ๐Ÿ‘๏ธ Sees your screen
  • ๐ŸŽค Talks with you
  • ๐Ÿค– Controls your system
  • ๐Ÿ–ฅ๏ธ Lives directly on your desktop as a playful character
  • โ€ฆand a lot more (I want you to explore ๐Ÿ˜„)

This isnโ€™t just an assistant you use.
Itโ€™s one you work with.


๐ŸŽฌ Live Demo

โ–ถ๏ธ Full Demo (Vimeo)
https://vimeo.com/1150677379

The Vimeo demo shows Glitch executing real tasks end-to-end.

(Also check out our landing page here!!)


โœจ What Can Glitch Do?

๐Ÿ–ฅ๏ธ Lives on Your Desktop

Glitch runs as a transparent, click-through desktop overlay.
He shares your workspace instead of hiding in a window or sidebar.


๐ŸŽจ Interactive & Playful Characters

  • Interactive pixel-style characters
  • Drag, click, and interact
  • Customizable appearance and behavior

Inspired by classic desktop pets, powered by modern multimodal AI.


โš™๏ธ All Modes, One Companion

Everything is built in:

  • ๐ŸŽค Voice Mode
  • ๐Ÿ‘๏ธ Vision Mode
  • ๐Ÿค– Agent Mode
  • โš™๏ธ Settings (character & voice customization)

No switching apps. No broken context.


๐Ÿค– Agent Mode โ€” Real Automation

This is not just another chatbot.

Agent Mode lets Glitch:

  • Control mouse & keyboard
  • Open applications
  • Execute multi-step workflows
  • Do real things on your system

Thereโ€™s always a stop button. Safety matters.


๐Ÿš€ Developer Accelerator

Glitch is especially useful while building.

Here, it creates a complete Next.js project structure from a single voice command โ€” turning ideas into runnable code instantly.


๐Ÿ“ Smart Summarization & Notes

Glitch can:

  • Summarize information
  • Extract key points
  • Save them directly to Notepad or files

Your AI remembers for you.


๐ŸŒ On-Demand Web Search

Ask once โ€” Glitch searches Google, parses results, and gives you the useful bits.

Hands-free.


๐Ÿง  The Personality: Glitch

Glitch isnโ€™t robotic.

He has personality.
He reacts.
He feels present.

Working with AI finally feels alive, not transactional.


๐Ÿงฌ How It Works (High Level)

Glitch uses a hybrid multimodal agent architecture:

  • ๐Ÿง  Brain โ€” Google Gemini 2.0 Flash (chat + vision)
  • ๐Ÿ‘๏ธ Vision โ€” Screen understanding via screenshots
  • ๐ŸŽค Voice โ€” ElevenLabs (low-latency TTS)
  • ๐Ÿค– Automation โ€” nut.js (mouse, keyboard, OS control)
  • ๐Ÿ–ฅ๏ธ UI Soul โ€” Electron + PixiJS (desktop overlay)

๐Ÿš€ Quick Start

Prerequisites

Installation

  1. Clone the repository

    git clone https://github.com/KirthanNB/AI-Companion.git
    cd AI-Companion
  2. Install Node.js dependencies

    npm install
  3. Install Python dependencies (Required for Agent Mode automation)

    pip install -r requirements.txt
  4. Configure Environment Create a .env file in the root directory (copy .env.example):

    GOOGLE_API_KEY=your_gemini_key
    ELEVEN_API_KEY=your_elevenlabs_key
    ELEVEN_VOICE_ID=your_voice_id
  5. Run the application

    npm start

๐Ÿ“– User Manual

๐ŸŽฎ Controls

Icon Name Description
๐ŸŽค Mic Click to speak to the AI.
๐Ÿค– Agent Mode Toggle autonomous mode for complex tasks.
๐Ÿ›‘ Stop Emergency stop for any active automation.

๐Ÿ—ฃ๏ธ Voice Commands

  • "Create a portfolio website" -> Generates a project folder and opens VS Code.
  • "What is on my screen?" -> Analyzes the current window content.
  • "Open YouTube and search for lofi beats" -> Automates the browser.
  • "Type a python script to calculate fibonacci" -> Tyupes code into your active editor.

๐Ÿ› ๏ธ Development

Project Structure

ai-companion/
โ”œโ”€โ”€ src/
โ”‚   โ”œโ”€โ”€ ai/                 # AI logic & GameAgent
โ”‚   โ”œโ”€โ”€ services/           # Automation & helper services
โ”‚   โ”œโ”€โ”€ renderer.js         # Frontend logic (PixiJS)
โ”‚   โ””โ”€โ”€ main.js             # Electron main process
โ”œโ”€โ”€ assets/                 # Images & sounds
โ””โ”€โ”€ package.json            # Dependencies & scripts

Building for Production

To create an installer for your OS:

# Windows
npm run build:win

๐Ÿค Contributing

We welcome contributions! Please see CONTRIBUTING.md for details on how to get started.

๐Ÿ“„ License

This project is licensed under the MIT License.


Made with โค๏ธ by Kirthan NB & Rohith M

About

Meet Glitch: The JARVIS you dreamt of. A multimodal agent that doesn't just chat; it acts. Sees your screen, automates your OS. #AgenticAI

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors