protoLabs.studio

We build systems that build themselves. An indie studio running experiments in the open — and sharing the patterns that build them.

It's one person and a fleet of agents shipping the work. We build games, simulators, and specialized agents on top of infrastructure we run ourselves, from the hardware up. The patterns that fall out of building them (orchestration, voice-as-code, release automation) get open-sourced as we go, for anyone who wants to read them and build their own.

Patterns you can take

Infrastructure we ship as patterns to study and steal, not products you wait on us to improve.

release-tools — release-notes generator and Discord embed as one GitHub Action. Runs on every repo here.
rabbit-hole.io — research agent that turns a topic and a budget into a knowledge graph instead of a pile of links.
protoCLI — an AI agent that lives in your terminal.
protoBanana — chat-native image generation and editing, served as an OpenAI-compatible provider on top of ComfyUI.
protoWorkstacean — a pluggable pubsub bus for agentic action.

Open weights on Hugging Face

We run 35B-class open models on a pair of prosumer GPUs, and the builds that make that possible — FP8 quants and speculative-decode heads — go up for anyone to pull. huggingface.co/protoLabsAI.

Ornith-1.0-9B-MTP-GGUF — a 9B with a multi-token-prediction head we trained for it; the base shipped without one. KL-divergence against the target distribution gets ~76% token acceptance, for a lossless +49–57% single-stream speedup (75 → 121 tok/s). Packaged for llama.cpp and Ollama — the build most people run; the raw MTP head is up too. MIT.
Ornith-1.0-35B-FP8 — the 35B in native FP8 E4M3, half the VRAM at full speed.
Our most-pulled build so far is an FP8 quant of a Qwen3.6-35B MoE fine-tune — 66 GB down to 34 GB, 180 → 226 tok/s, 13k+ downloads.

The findings behind them get written up at protolabs.studio: CUDA graphs run 37–470% faster on Blackwell, INT4 holds quality on dense models but corrupts MoE expert routing (so MoE stays BF16), and NCCL_P2P_DISABLE=1 clears TP=2 corruption on PCIe Blackwell for a 9.3× jump on the 35B MoE.

The engine

protoAgent is the adaptable engine: a lean, A2A-native agent on LangGraph that ships a small core and grows at runtime instead of by forking. Capability arrives three ways — plugins install from a git URL (tools, subagents, MCP servers, console views), skills load on demand as the model needs them, and delegate agents route work out over A2A, the OpenAI-compatible API, or ACP, so it can spawn other coding agents (protoCLI, Claude Code) as subprocesses. Run one agent or orchestrate a fleet; drive it from the console, the API, or A2A. The boring parts — A2A spec handling, cost and extension emission, tracing, the release pipeline — stay stable across every agent, so forking is close to a rewrite of SOUL.md and a couple of files, not inheriting a pile.

What runs it

The substrate the experiments sit on. Most of it stays private; the parts worth sharing surface as the repos above.

protoLab — LLM eval, training, and inference on 2× RTX PRO 6000 Blackwell (192 GB). LiteLLM gateway, Langfuse, vLLM.
mythxengine-sdk — deterministic multi-agent worlds in Rust. Genre-agnostic; games plug in as packs.
homelab-iac — the self-hosted Proxmox cluster it all runs on, behind Caddy, Cloudflare, and Tailscale.

What we build with it

ORBIS — a voice-first AI companion: an orb that talks, remembers you, and routes the heavy lifting to your agents. Live.
MythXEngine — a TTRPG world-and-story generator on mythxengine-sdk. Generate a world, then play it.
PilotProtocol — mech-vs-mech agents fighting on a hex grid, with a click-through replay viewer that shows each model's reasoning a move at a time.

How it works

idea → decision record → task graph → agent picks it up → worktree
isolation → implementation → antagonistic review → CI → PR → merge

Every transition is a gate. Work that fails one goes back a step or escalates to a stronger model, and nothing merges that didn't pass the same checks a person would.

By the numbers

Rough aggregates across the protoLabsAI org, from GitHub (June 2026):


Repositories	119 (56 public, open to fork)
Merged pull requests	9,440
Stack	TypeScript · Python · Rust
Operator contributions, last 12 months	~27,000

One person and a fleet of agents. We keep the numbers visible because a claim without one is just marketing.

Where to find us

protolabs.studio — home and blog
hello@protolabs.consulting — consulting: pattern transfer, not retainer work
Hugging Face — open model weights: FP8 quants and MTP heads
@protoLabsAI — X
Substack — long-form

MIT licensed. No subscriptions, no microtransactions, no gated tiers — pay-what-you-want and donations only. The infrastructure ships as patterns to study and steal. Bring your own hacking.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

protoLabs.studio

protoLabs.studio

Patterns you can take

Open weights on Hugging Face

The engine

What runs it

What we build with it

How it works

By the numbers

Where to find us

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!