Kate

Kate is a multimodal live assistant that listens to your user and discuss website contents.

Examples

Kate Playlist: https://www.youtube.com/playlist?list=PLPCnNL6Y2PbTzUxmsFICoQj0rx_PmVnk-

Demo: Find Textbooks to Learn Web Programming

Demo: Search Information about an Accounting Book

Demo: Recommend Textbooks for Project Management

Demo: Browse the Website FAQ

Architecture: How it Works

Key Features

AI-Powered Live Assistant: Kate acts as a helpful AI assistant, providing users with a natural language interface to search for and learn from websites.
Multimodal Interaction: Kate leverages multimodal capabilities of Gemini 2.0, including audio transcription, text generation, and potentially visual elements (talking animation), to create an engaging and interactive user experience.
Real-Time Communication Integration: Built on pipecat framework, Kate seamlessly integrates with real-time communication platforms like Daily.co, allowing users to interact with the assistant within a live meeting or call.
Website Search Tool: Kate uses the Vertex AI Search to accurately search the website and provide users with precise answers to their queries.
Customizable and Extensible: The project is designed with modular components and uses environment variables for configuration, making it adaptable to different environments and use cases. New bots and tools can be created for new use cases.
Pipecat Framework: Utilizes the pipecat framework to handle media and events.

Dependencies

Python 3.12+
invoke
uv
yarn (for client-side dependencies)
Docker (for containerization)
Other dependencies are managed by uv and listed in pyproject.toml.

Installation

Clone the GitHub repository on your computer:

git clone https://github.com/fmind/kate

Install the project and its dependencies:

uv sync --all-groups

Architecture

Data Ingestion

This projects uses Vertex AI Search to ingest website content (either online or offline) and retrieve them in the server code.

Please refer to this documentation to start the ingestion and configure your bot:

https://cloud.google.com/generative-ai-app-builder/docs/enterprise-search-introduction

Configuration

Create an API Key from Daily Website for WebRTC communication: https://dashboard.daily.co/

Create an API key from Google AI Studio for LLM interactions: https://aistudio.google.com/

Configure the environment variables of the web client from client/.env.example:

client/.env.development: configuration for development mode.
client/.env.production: configuration for production mode.

Configure the environment variables of the web server from .env.example:

.env.development: configuration for development mode.
.env.production: configuration for production mode.

Configure the web exposition through ngrok from ngrok.yml:

ngrok.yml: configuration for ngrok.

Development

Start the web server in development mode:

inv runs.server --mode=development

Start the web client in development mode:

inv clients.dev

Production

Start the client and the embedded client:

inv runs.server --mode=production

Expose your application on the internet:

inv apps.expose
# ngrok start --config=ngrok.yml kate

Tasks

The project uses invoke for task management. Here's an overview of the available tasks:

Install tasks:
- inv installs.uv: Install uv packages.
- inv installs.pre-commit: Install pre-commit hooks on git.
- inv installs.all: Run all install tasks.
Server tasks:
- inv servers.run --mode=development: Run the project server in development mode.
- inv servers.run --mode=production: Run the project server in production mode.
- inv servers.all: Run all server tasks.
Spider tasks:
- inv spiders.otl: Scrape Open Textbook Library dataset.
- inv spiders.all: Run all spider tasks.
Format tasks:
- inv formats.imports: Format python imports with ruff.
- inv formats.sources: Format python sources with ruff.
- inv formats.all: Run all format tasks.
Package tasks:
- inv packages.build: Build the python package.
- inv packages.requirements: Generate a requirements.txt.
- inv packages.all: Run all package tasks.
Client tasks:
- inv clients.install: Install the project client.
- inv clients.dev: Start the project client.
- inv clients.build: Build the project client.
- inv clients.lint: Lint the project client.
- inv clients.preview: Preview the project client.
- inv clients.all: Run all client tasks.
Docs tasks:
- inv docs.serve --format=google --port=8088: Serve the API docs with pdoc.
- inv docs.api --format=google --output_dir=docs/: Generate the API docs with pdoc.
- inv docs.all: Run all docs tasks.
Checks tasks:
- inv checks.format: Check the formats with ruff.
- inv checks.type: Check the types with mypy.
- inv checks.code: Check the codes with ruff.
- inv checks.test: Check the tests with pytest.
- inv checks.security: Check the security with bandit.
- inv checks.coverage: Check the coverage with coverage.
- inv checks.all: Run all check tasks.
Container tasks:
- inv containers.compose: Start up docker compose.
- inv containers.build --tag=latest: Build the container image.
- inv containers.run --mode=production --port=8080 --tag=latest: Run the container image.
- inv containers.all: Run all container tasks.
Clean tasks:
- inv cleans.mypy: Clean the mypy tool.
- inv cleans.ruff: Clean the ruff tool.
- inv cleans.pytest: Clean the pytest tool.
- inv cleans.coverage: Clean the coverage tool.
- inv cleans.dist: Clean the dist folder.
- inv cleans.docs: Clean the docs folder.
- inv cleans.cache: Clean the cache folder.
- inv cleans.venv: Clean the venv folder.
- inv cleans.uv: Clean uv lock file.
- inv cleans.python: Clean python caches and bytecodes.
- inv cleans.tools: Run all tools tasks.
- inv cleans.folders: Run all folders tasks.
- inv cleans.sources: Run all sources tasks.
- inv cleans.all: Run all tools and folders tasks.
- inv cleans.reset: Run all tools, folders and sources tasks.
App tasks:
- inv apps.expose: Expose the application on the internet.
- inv apps.all: Run all app tasks.

To execute a task, use the inv command followed by the task name (e.g., inv checks.all).

Contributions

We welcome contributions to enhance this project.

Feel free to open issues or pull requests for any improvements, bug fixes, or feature requests.

License

This project is licensed under the MIT License. See the LICENSE.txt file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github		.github
client		client
diagrams		diagrams
docs		docs
spiders		spiders
src/kate		src/kate
tasks		tasks
tests		tests
.aiexclude		.aiexclude
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
docker-compose.yml		docker-compose.yml
invoke.yaml		invoke.yaml
kate.code-workspace		kate.code-workspace
ngrok.example.yml		ngrok.example.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kate

Examples

Demo: Find Textbooks to Learn Web Programming

Demo: Search Information about an Accounting Book

Demo: Recommend Textbooks for Project Management

Demo: Browse the Website FAQ

Architecture: How it Works

Key Features

Dependencies

Installation

Architecture

Data Ingestion

Configuration

Development

Production

Tasks

Contributions

License

About

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Kate

Examples

Demo: Find Textbooks to Learn Web Programming

Demo: Search Information about an Accounting Book

Demo: Recommend Textbooks for Project Management

Demo: Browse the Website FAQ

Architecture: How it Works

Key Features

Dependencies

Installation

Architecture

Data Ingestion

Configuration

Development

Production

Tasks

Contributions

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages