🤖 AI Desktop Companion — Glitch

The AI that lives on your desktop — sees, thinks, and acts.

🧠 What is Glitch?

(Just kidding, don't try that!!)

We grew up dreaming of companions like **JARVIS** — agents that don��t just listen, but *act*. Somewhere along the way, assistants got stuck in browser tabs.

Glitch breaks the fourth wall of the operating system.

AI Desktop Companion (Glitch) is a fully multimodal, autonomous desktop agent that:

👁️ Sees your screen
🎤 Talks with you
🤖 Controls your system
🖥️ Lives directly on your desktop as a playful character
…and a lot more (I want you to explore 😄)

This isn’t just an assistant you use.
It’s one you work with.

🎬 Live Demo

▶️ Full Demo (Vimeo)
https://vimeo.com/1150677379

The Vimeo demo shows Glitch executing real tasks end-to-end.

(Also check out our landing page here!!)

✨ What Can Glitch Do?

🖥️ Lives on Your Desktop

Glitch runs as a transparent, click-through desktop overlay.
He shares your workspace instead of hiding in a window or sidebar.

🎨 Interactive & Playful Characters

Interactive pixel-style characters
Drag, click, and interact
Customizable appearance and behavior

Inspired by classic desktop pets, powered by modern multimodal AI.

⚙️ All Modes, One Companion

Everything is built in:

🎤 Voice Mode
👁️ Vision Mode
🤖 Agent Mode
⚙️ Settings (character & voice customization)

No switching apps. No broken context.

🤖 Agent Mode — Real Automation

This is not just another chatbot.

Agent Mode lets Glitch:

Control mouse & keyboard
Open applications
Execute multi-step workflows
Do real things on your system

There’s always a stop button. Safety matters.

🚀 Developer Accelerator

Glitch is especially useful while building.

Here, it creates a complete Next.js project structure from a single voice command — turning ideas into runnable code instantly.

📝 Smart Summarization & Notes

Glitch can:

Summarize information
Extract key points
Save them directly to Notepad or files

Your AI remembers for you.

🌐 On-Demand Web Search

Ask once — Glitch searches Google, parses results, and gives you the useful bits.

Hands-free.

🧠 The Personality: Glitch

Glitch isn’t robotic.

He has personality.
He reacts.
He feels present.

Working with AI finally feels alive, not transactional.

🧬 How It Works (High Level)

Glitch uses a hybrid multimodal agent architecture:

🧠 Brain — Google Gemini 2.0 Flash (chat + vision)
👁️ Vision — Screen understanding via screenshots
🎤 Voice — ElevenLabs (low-latency TTS)
🤖 Automation — nut.js (mouse, keyboard, OS control)
🖥️ UI Soul — Electron + PixiJS (desktop overlay)

🚀 Quick Start

Prerequisites

Node.js (v16 or higher)
Python (v3.8 or higher) — Required for mouse/keyboard automation
Google Gemini API Key (Get it here)
ElevenLabs API Key (Get it here)

Installation

Clone the repository

git clone https://github.com/KirthanNB/AI-Companion.git
cd AI-Companion

Install Node.js dependencies
```
npm install
```
Install Python dependencies (Required for Agent Mode automation)
```
pip install -r requirements.txt
```

Configure Environment Create a .env file in the root directory (copy .env.example):

GOOGLE_API_KEY=your_gemini_key
ELEVEN_API_KEY=your_elevenlabs_key
ELEVEN_VOICE_ID=your_voice_id

Run the application
```
npm start
```

📖 User Manual

🎮 Controls

Icon	Name	Description
🎤	Mic	Click to speak to the AI.
🤖	Agent Mode	Toggle autonomous mode for complex tasks.
🛑	Stop	Emergency stop for any active automation.

🗣️ Voice Commands

"Create a portfolio website" -> Generates a project folder and opens VS Code.
"What is on my screen?" -> Analyzes the current window content.
"Open YouTube and search for lofi beats" -> Automates the browser.
"Type a python script to calculate fibonacci" -> Tyupes code into your active editor.

🛠️ Development

Project Structure

ai-companion/
├── src/
│   ├── ai/                 # AI logic & GameAgent
│   ├── services/           # Automation & helper services
│   ├── renderer.js         # Frontend logic (PixiJS)
│   └── main.js             # Electron main process
├── assets/                 # Images & sounds
└── package.json            # Dependencies & scripts

Building for Production

To create an installer for your OS:

# Windows
npm run build:win

🤝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for details on how to get started.

📄 License

This project is licensed under the MIT License.

Made with ❤️ by Kirthan NB & Rohith M

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.github/workflows		.github/workflows
assets		assets
build		build
config		config
docs		docs
generated_images		generated_images
src		src
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PROJECT_AUTOMATION_GUIDE.md		PROJECT_AUTOMATION_GUIDE.md
README.md		README.md
USER_MANUAL.md		USER_MANUAL.md
check_models.js		check_models.js
check_raw.js		check_raw.js
list_models.js		list_models.js
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
test-project-service.js		test-project-service.js
validate_key.js		validate_key.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 AI Desktop Companion — Glitch

🧠 What is Glitch?

🎬 Live Demo

✨ What Can Glitch Do?

🖥️ Lives on Your Desktop

🎨 Interactive & Playful Characters

⚙️ All Modes, One Companion

🤖 Agent Mode — Real Automation

🚀 Developer Accelerator

📝 Smart Summarization & Notes

🌐 On-Demand Web Search

🧠 The Personality: Glitch

🧬 How It Works (High Level)

🚀 Quick Start

Prerequisites

Installation

📖 User Manual

🎮 Controls

🗣️ Voice Commands

🛠️ Development

Project Structure

Building for Production

🤝 Contributing

📄 License

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

KirthanNB/AI-Companion

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Desktop Companion — Glitch

🧠 What is Glitch?

🎬 Live Demo

✨ What Can Glitch Do?

🖥️ Lives on Your Desktop

🎨 Interactive & Playful Characters

⚙️ All Modes, One Companion

🤖 Agent Mode — Real Automation

🚀 Developer Accelerator

📝 Smart Summarization & Notes

🌐 On-Demand Web Search

🧠 The Personality: Glitch

🧬 How It Works (High Level)

🚀 Quick Start

Prerequisites

Installation

📖 User Manual

🎮 Controls

🗣️ Voice Commands

🛠️ Development

Project Structure

Building for Production

🤝 Contributing

📄 License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages