We build systems that build themselves. An indie studio running experiments in the open — and sharing the patterns that build them.
It's one person and a fleet of agents shipping the work. We build games, simulators, and specialized agents on top of infrastructure we run ourselves, from the hardware up. The patterns that fall out of building them (orchestration, voice-as-code, release automation) get open-sourced as we go, for anyone who wants to read them and build their own.
Infrastructure we ship as patterns to study and steal, not products you wait on us to improve.
release-tools— release-notes generator and Discord embed as one GitHub Action. Runs on every repo here.rabbit-hole.io— research agent that turns a topic and a budget into a knowledge graph instead of a pile of links.protoCLI— an AI agent that lives in your terminal.protoBanana— chat-native image generation and editing, served as an OpenAI-compatible provider on top of ComfyUI.protoWorkstacean— a pluggable pubsub bus for agentic action.
We run 35B-class open models on a pair of prosumer GPUs, and the builds that make that possible — FP8 quants and speculative-decode heads — go up for anyone to pull. huggingface.co/protoLabsAI.
Ornith-1.0-9B-MTP-GGUF— a 9B with a multi-token-prediction head we trained for it; the base shipped without one. KL-divergence against the target distribution gets ~76% token acceptance, for a lossless +49–57% single-stream speedup (75 → 121 tok/s). Packaged for llama.cpp and Ollama — the build most people run; the raw MTP head is up too. MIT.Ornith-1.0-35B-FP8— the 35B in native FP8 E4M3, half the VRAM at full speed.- Our most-pulled build so far is an FP8 quant of a Qwen3.6-35B MoE fine-tune — 66 GB down to 34 GB, 180 → 226 tok/s, 13k+ downloads.
The findings behind them get written up at protolabs.studio: CUDA graphs run 37–470% faster on Blackwell, INT4 holds quality on dense models but corrupts MoE expert routing (so MoE stays BF16), and NCCL_P2P_DISABLE=1 clears TP=2 corruption on PCIe Blackwell for a 9.3× jump on the 35B MoE.
protoAgent is the adaptable engine: a lean, A2A-native agent on LangGraph that ships a small core and grows at runtime instead of by forking. Capability arrives three ways — plugins install from a git URL (tools, subagents, MCP servers, console views), skills load on demand as the model needs them, and delegate agents route work out over A2A, the OpenAI-compatible API, or ACP, so it can spawn other coding agents (protoCLI, Claude Code) as subprocesses. Run one agent or orchestrate a fleet; drive it from the console, the API, or A2A. The boring parts — A2A spec handling, cost and extension emission, tracing, the release pipeline — stay stable across every agent, so forking is close to a rewrite of SOUL.md and a couple of files, not inheriting a pile.
The substrate the experiments sit on. Most of it stays private; the parts worth sharing surface as the repos above.
- protoLab — LLM eval, training, and inference on 2× RTX PRO 6000 Blackwell (192 GB). LiteLLM gateway, Langfuse, vLLM.
- mythxengine-sdk — deterministic multi-agent worlds in Rust. Genre-agnostic; games plug in as packs.
- homelab-iac — the self-hosted Proxmox cluster it all runs on, behind Caddy, Cloudflare, and Tailscale.
- ORBIS — a voice-first AI companion: an orb that talks, remembers you, and routes the heavy lifting to your agents. Live.
- MythXEngine — a TTRPG world-and-story generator on mythxengine-sdk. Generate a world, then play it.
- PilotProtocol — mech-vs-mech agents fighting on a hex grid, with a click-through replay viewer that shows each model's reasoning a move at a time.
idea → decision record → task graph → agent picks it up → worktree
isolation → implementation → antagonistic review → CI → PR → merge
Every transition is a gate. Work that fails one goes back a step or escalates to a stronger model, and nothing merges that didn't pass the same checks a person would.
Rough aggregates across the protoLabsAI org, from GitHub (June 2026):
| Repositories | 119 (56 public, open to fork) |
| Merged pull requests | 9,440 |
| Stack | TypeScript · Python · Rust |
| Operator contributions, last 12 months | ~27,000 |
One person and a fleet of agents. We keep the numbers visible because a claim without one is just marketing.
- protolabs.studio — home and blog
- hello@protolabs.consulting — consulting: pattern transfer, not retainer work
- Hugging Face — open model weights: FP8 quants and MTP heads
- @protoLabsAI — X
- Substack — long-form
MIT licensed. No subscriptions, no microtransactions, no gated tiers — pay-what-you-want and donations only. The infrastructure ships as patterns to study and steal. Bring your own hacking.
