Kon

Kon is a minimal coding agent with a tiny harness: about 215 tokens for the system prompt and around 600 tokens for tool definitions – so under 1k tokens before conversation context.

At the time of writing this README (25 Feb 2026), this repo has 112 files and is easy to understand in a weekend. Here’s a rough file-count comparison against a couple of popular OSS coding agents:

Others are of course more mature, support more models, include broader test coverage, and cover more surfaces. But if you want a truly minimal coding agent with batteries included – something you can understand, fork, and extend quickly – Kon might be interesting.

$ fd . | cut -d/ -f1 | sort | uniq -c | sort -rn
4107 opencode
 740 pi-mono
 108 kon

Kon is inspired from Bleach, a artificial soul

Setup

Prerequisites

Python 3.12+ and uv.

Install (recommended)

uv tool install kon-coding-agent

This installs kon globally as a CLI tool.

Install from source (advanced)

git clone https://github.com/kuutsav/kon
cd kon
uv tool install .

Warning

Platform support: macOS and Linux are supported; Windows is not tested yet.

Run

kon

CLI options:

usage: kon [-h] [--model MODEL]
           [--provider {github-copilot,openai,openai-codex,openai-responses,zhipu}]
           [--api-key API_KEY] [--base-url BASE_URL] [--continue]
           [--resume RESUME_SESSION]

Kon TUI

options:
  -h, --help            show this help message and exit
  --model, -m MODEL     Model to use
  --provider, -p {github-copilot,openai,openai-codex,openai-responses,zhipu}
                        Provider to use
  --api-key, -k API_KEY
                        API key
  --base-url, -u BASE_URL
                        Base URL for API
  --continue, -c        Resume the most recent session
  --resume, -r RESUME_SESSION
                        Resume a specific session by ID (full or unique
                        prefix)

Tool binaries

fd – required for fast file discovery; Kon auto-downloads it only if it's missing.
ripgrep (rg) – required for fast content search; Kon auto-downloads it only if it's missing.
eza (optional) – supports .gitignore-aware listings and usually emits fewer tokens than ls.

OAuth and API keys

GitHub Copilot OAuth: run /login and choose GitHub Copilot.
OpenAI OAuth (Codex): run /login and choose OpenAI. Kon supports callback flow plus manual paste fallback.
OpenAI-compatible providers (for example ZhiPu): set an API key via environment variable (OPENAI_API_KEY or ZAI_API_KEY).

Features

Tools

Tool	Purpose
`read`	Read file contents (pagination for large files, image support)
`edit`	Surgical find-and-replace edits
`write`	Create or overwrite files
`bash`	Execute shell commands
`grep`	Search file contents with regex
`find`	Find files by glob pattern

Slash commands

Type / at the start of input to see available commands.

Command	Description
`/new`	Start a new conversation and reload project context/skills
`/resume`	Browse and restore a saved session
`/model`	Switch model via interactive picker
`/session`	Show session metadata and token stats
`/compact`	Compact the current conversation immediately
`/export`	Export current session to HTML
`/copy`	Copy last assistant response to clipboard
`/login`	Authenticate with a provider
`/logout`	Log out from a provider
`/clear`	Clear current conversation
`/help`	Show commands and keybindings
`/quit` (`/exit`, `/q`)	Quit Kon

`@` file and folder search

Type @ + query to fuzzy-search files/folders in the current project and insert paths into your prompt.

Tab path autocomplete

Press Tab in the input box to complete paths (~, ./, ../, absolute paths, quoted paths, etc.).

Query queueing

If the agent is currently running, you can still submit more prompts. Kon queues them and runs them in order once the current task finishes (up to 5 queued prompts).

Sessions

Sessions are append-only JSONL files under ~/.kon/sessions/.

/resume to reopen past sessions
/session for message/token stats
/export for standalone HTML transcripts
--continue / -c to continue the most recent session from CLI

AGENTS.md

Kon loads project guidelines from AGENTS.md (or CLAUDE.md) files into the system prompt:

Global: ~/.kon/AGENTS.md
Ancestor directories from git root (or home) down to current working directory

Skills

Skills are reusable instruction packs loaded from:

Project: .kon/skills/
Global: ~/.kon/skills/

Each skill has a SKILL.md file with front matter:

---
name: my-skill
description: Brief description of what this skill does
---

# My Skill

Detailed instructions for the agent...

For skills with scripts, see Agent Skills Documentation.

Not supported

Some features you might expect in other coding agents are not part of Kon's design philosophy:

MCP servers – Use skills instead; they're simpler and give you full control
Sandbox environments – Kon runs directly on your machine for simplicity; use Docker or VMs if you need isolation
Checkpoint restores – Not currently supported; may be added in the future

Architecture

LLM Provider
    │
    │ StreamPart (TextPart, ThinkPart, ToolCallStart, ToolCallDelta, ...)
    ▼
Single Turn (turn.py)
    │
    │ StreamEvent (ThinkingStart/Delta/End, TextStart/Delta/End, ToolStart/End, ToolResult, ...)
    ▼
Agentic Loop (loop.py)
    │
    │ Event (AgentStart, TurnStart, TurnEnd, AgentEnd + all StreamEvents)
    ▼
UI (app.py)

Supported Models

Kon works well with local models exposed through an OpenAI-compatible /v1 API.

Example using llama-server

To run a local model using llama-server:

./llama-server -m <models-dir>/GLM-4.7-Flash-GGUF/GLM-4.7-Flash-Q4_K_M.gguf \
    -n 8192 \
    -c 64000

# Then use Kon with:
kon --model zai-org/glm-4.7-flash \
    --provider openai \
    --base-url http://localhost:8080/v1 \
    --api-key ""

GLM-4.7-Flash-Q4 ran at 80-90 tps on my i7-14700F × 28, 64GB RAM, 24GB VRAM (RTX 3090)

All Supported Providers

Model (local=*)	Provider	Thinking	Vision
`*zai-org/glm-4.7-flash`	OpenAI Completions	Yes	No
`*qwen/qwen3-coder-next`	OpenAI Completions	Yes	No
`glm-4.7`	ZhiPu (OpenAI Completions)	Yes	No
`glm-5`	ZhiPu (OpenAI Completions)	Yes	No
`claude-sonnet-4.5`	GitHub Copilot	Yes	Yes
`claude-opus-4.5`	GitHub Copilot	Yes	Yes
`claude-sonnet-4.6`	GitHub Copilot	Yes	Yes
`claude-opus-4.6`	GitHub Copilot	Yes	Yes
`gpt-5.3-codex`	GitHub Copilot	Yes	Yes
`gpt-5.3-codex`	OpenAI Codex Responses	Yes	Yes

Configuration

Config lives at ~/.kon/config.toml (auto-created on first run).

Most important knobs:

llm.default_provider
llm.default_model
llm.default_thinking_level
llm.system_prompt (you can fully override Kon’s system prompt here)
llm.tool_call_idle_timeout_seconds (fallback timeout for stalled tool-call streaming)
compaction.on_overflow, compaction.buffer_tokens
agent.max_turns, agent.default_context_window

You can also theme the UI via [ui.colors] values.

Example:

[llm]
default_provider = "openai-codex"
default_model = "gpt-5.3-codex"
default_thinking_level = "high"
tool_call_idle_timeout_seconds = 60
system_prompt = """Your custom system prompt here"""

[compaction]
on_overflow = "continue"
buffer_tokens = 20000

Development setup

For hacking on Kon locally:

uv sync
uv run kon
uv run ruff format .
uv run pytest

Acknowledgements

Kon takes significant inspiration from pi-mono coding-agent, especially in terms of the overall philosophy and UI design.
- Why not just use pi? Pi is no longer a small project, and I want to be in complete control of my coding agent.
- I mostly agree with Mario (author of pi), but I have different beliefs on some matters - for example, subagents (especially useful for context gathering in larger repos when paired with semantic search tools).
- Over time, I also want to give more preference to local LLMs I can run. glm-4.7-flash and qwen-3-coder-next look promising, so I may make decisions that do not necessarily optimize for SOTA paid models.
Kon also borrows ideas from Amp, Claude Code, and other coding agents.

LICENSE

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.kon/skills		.kon/skills
scripts		scripts
src/kon		src/kon
tests		tests
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
LICENSE		LICENSE
LOCAL.md		LOCAL.md
README.md		README.md
TODO.md		TODO.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kon

Setup

Prerequisites

Install (recommended)

Install from source (advanced)

Run

Tool binaries

OAuth and API keys

Features

Tools

Slash commands

`@` file and folder search

Tab path autocomplete

Query queueing

Sessions

AGENTS.md

Skills

Not supported

Architecture

Supported Models

Example using llama-server

All Supported Providers

Configuration

Development setup

Acknowledgements

LICENSE

About

Uh oh!

Releases 6

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

License

kuutsav/kon

Folders and files

Latest commit

History

Repository files navigation

Kon

Setup

Prerequisites

Install (recommended)

Install from source (advanced)

Run

Tool binaries

OAuth and API keys

Features

Tools

Slash commands

@ file and folder search

Tab path autocomplete

Query queueing

Sessions

AGENTS.md

Skills

Not supported

Architecture

Supported Models

Example using llama-server

All Supported Providers

Configuration

Development setup

Acknowledgements

LICENSE

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`@` file and folder search

Packages