DeepTutor-1.0.0-beta.1
🚀 DeepTutor v1.0.0-beta1 Release Notes
Release Date: 2026.04.04
We're thrilled to announce DeepTutor v1.0.0-beta1 — the first beta of the brand new DeepTutor architecture. This is a ground-up rewrite that transforms DeepTutor from a monolithic RAG tutor into an agent-native learning platform with a two-layer plugin model (Tools + Capabilities), three unified entry points (CLI / WebSocket / Python SDK), and a completely rebuilt web application shell. Find more surprises by your own!
⚠️ Beta Notice: This is beta 1 of v1.0.0. The core architecture is stable, but some UI interactions and edge-case workflows may still contain bugs. We appreciate your patience and welcome bug reports via Issues.
📌 Knowledge Base Note: In this release, the RAG pipeline has been simplified to LlamaIndex only. LightRAG and RAG-Anything pipelines along with their related knowledge base content have been temporarily removed to focus on stability. They will be re-introduced in upcoming releases.
Tip
Call for Feedback: It is aware that some of the old features are not included in the newest version. If you have any helpful comments, encounter any bugs or have any feature requests, please open an issue! PRs are also welcome — see our Contributing Guide.
Diff Scope: main...dev (903 files changed, 92,701 insertions, 73,749 deletions)
Quick Summary
- Architecture — Complete rewrite from
src/todeeptutor/+deeptutor_cli/with agent-native runtime (Tools + Capabilities). - Entry Points — Three unified entry points: standalone CLI (
deeptutor), WebSocket API (/api/v1/ws), and Python SDK facade. - Capabilities — Five built-in capabilities:
chat,deep_solve,deep_question,deep_research,math_animator. - Tools — Seven LLM-callable tools:
rag,web_search,code_execution,reason,brainstorm,paper_search,geogebra_analysis. - Web App — Rebuilt Next.js app with workspace/utility route groups, new Playground, Co-Writer, Agents, and Guide pages.
- TutorBot — Multi-channel bot agent supporting 12 messaging platforms.
- Infra — SQLite-backed session persistence, turn runtime, provider-level LLM traffic control and telemetry.
✨ Highlights
🏗️ Agent-Native Runtime (Tools + Capabilities)
Introduced a two-layer plugin model that decouples tool execution from high-level agent workflows:
- Core Contracts:
ToolProtocol,CapabilityProtocol,UnifiedContext,StreamEvent, andStreamBus— the foundation of all runtime execution. - ChatOrchestrator: Central coordinator with two registries:
ToolRegistry— tool discovery, OpenAI-style schema export, and execution.CapabilityRegistry— capability routing, manifest management, and stage-aware streaming.
🖥️ Unified Entry Points: CLI / WebSocket / Python SDK
Three entry points share a single ChatOrchestrator runtime:
| Entry Point | Description |
|---|---|
CLI (deeptutor) |
Typer-based CLI with sub-commands: run, chat, bot, kb, memory, session, notebook, plugin, config, provider, serve |
WebSocket (/api/v1/ws) |
Unified endpoint with turn lifecycle: start_turn, subscribe_turn, subscribe_session, resume_from, cancel_turn |
Python SDK (deeptutor.app.facade) |
Programmatic facade for SDK-style integrations |
🧠 Capability Layer
Five built-in capabilities, each a multi-step agent pipeline:
| Capability | Stages | Description |
|---|---|---|
chat |
responding | Default tool-augmented conversation |
deep_solve |
planning → reasoning → writing | Multi-stage problem solving |
deep_question |
ideation → evaluation → generation → validation | Intelligent question generation with follow-up mode |
deep_research |
search → analyze → synthesize → report | Multi-agent research with report generation |
math_animator |
analysis → design → codegen → review → render | Manim-based math concept video generation |
🔧 Tooling System
Seven unified LLM-callable tools with bilingual prompt hints (en/zh):
| Tool | Description |
|---|---|
rag |
Knowledge base retrieval via LlamaIndex |
web_search |
10 search providers: Tavily, Exa, Jina, Serper, Perplexity, Brave, Baidu, SearXNG, DuckDuckGo, OpenRouter |
code_execution |
Sandboxed Python execution with AST-based safety guards |
reason |
Dedicated deep-reasoning LLM call |
brainstorm |
Breadth-first idea exploration with structured rationale |
paper_search |
arXiv academic paper search |
🤖 TutorBot — Multi-Channel Bot Agent
New autonomous bot system (deeptutor/tutorbot/) that brings DeepTutor to messaging platforms:
- 12 Channels: Telegram, Discord, Slack, WeChat Work (WeCom), Feishu, DingTalk, WhatsApp, Matrix, QQ, Email, MoChat
- Agent Loop: Tool-augmented LLM loop with memory, subagent spawning, and team collaboration
- Built-in Tools: Shell, filesystem, web, MCP, cron, and message tools
- Background Services: Heartbeat health checks and cron-based scheduled tasks
🌐 Web Application Restructure
Complete rebuild of the Next.js frontend with new route groups:
Workspace Routes ((workspace)/):
| Page | Description |
|---|---|
Home (/) |
Main chat interface with tool-augmented conversation |
Guide (/guide) |
Interactive learning guide with session history, progress tracking, and completion summaries |
Playground (/playground) |
Unified deep capability UI (deep_solve, deep_question, deep_research, math_animator) |
Co-Writer (/co-writer) |
AI-assisted collaborative writing with edit and narrator agents |
Agents (/agents) |
TutorBot management — create, configure, and chat with custom bots |
Utility Routes ((utility)/):
| Page | Description |
|---|---|
Knowledge (/knowledge) |
Knowledge base management with LlamaIndex pipeline |
Memory (/memory) |
User memory and preference management |
Settings (/settings) |
Unified configuration for LLM, Embedding, TTS, and Search services |
🏭 Service Infrastructure Rebuild
Refactored services into clearer domains:
deeptutor/services/
├── config/ # Environment store, model catalog, provider runtime
├── llm/ # Multi-provider LLM: factory, registry, traffic control, telemetry
├── embedding/ # Adapter-based: OpenAI-compatible, Cohere, Jina, Ollama
├── rag/ # LlamaIndex pipeline with component-based architecture
├── search/ # 10 web search providers with result consolidation
├── session/ # SQLite store, turn runtime, context builder
├── memory/ # User memory persistence
├── notebook/ # Notebook management
├── prompt/ # Bilingual prompt template manager (en/zh)
├── settings/ # Interface settings
├── setup/ # Application initialization
├── tutorbot/ # TutorBot management
└── path_service # Centralized data path resolution
🔒 Security & Stability
- Code Execution Safety: AST-based import/call guards with configurable allowlists.
- LLM Traffic Control: Provider-level circuit breaker, error rate tracking, and retry mechanisms.
- Startup Validation: Capability-to-tool consistency checks at boot time.
🧪 Test Coverage
53+ new test files across all major layers: runtime (tool/capability registry, orchestrator), services (LLM provider/factory/routing/telemetry, RAG pipeline, embedding, search, session, memory, notebook, config), agents (chat, solve, question, math_animator), API (knowledge, memory, solve, WebSocket turn runtime), CLI, and tools (code executor safety).
⚠️ Breaking Changes
- Package layout:
src/→deeptutor/+deeptutor_cli/. Oldsrc/directory fully removed (140 files). - Package renamed:
ai-tutor→deeptutor, version1.0.0. - Runtime model: Capability-native orchestration.
chatis the default; deep modes selected explicitly viaruncommand or WebSocket. - Web routes: All pages reorganized under
(workspace)/and(utility)/. Legacy pages (/solver,/question,/research,/ideagen,/notebook,/history) removed. - RAG pipeline: Only LlamaIndex available. LightRAG and RAG-Anything temporarily removed.
- Data layout: Runtime data centered under
data/user/workspace/.... - Dependencies: Split into layered requirements:
cli.txt,server.txt,dev.txt,math-animator.txt,tutorbot.txt.
📦 What's Changed
- Complete codebase rewrite with agent-native architecture (DeepTutor 2.0).
- Two-layer plugin model (Tools + Capabilities) with
ChatOrchestratorcoordinator. - Standalone CLI package (
deeptutor_cli/) with 11 sub-commands via Typer. - Unified WebSocket endpoint with turn lifecycle and session streaming.
- 5 built-in capabilities and 7 LLM-callable tools with bilingual prompt hints.
- TutorBot multi-channel bot agent with 12 platform integrations.
- Rebuilt web app with workspace/utility route groups and new Playground, Co-Writer, Agents, and Guide pages.
- Service infrastructure rebuild: LLM provider registry, embedding adapters, SQLite session store, memory, notebook, and search consolidation.
- AST-based code execution safety, LLM traffic control, and provider telemetry.
- 53+ test files across runtime, services, agents, API, CLI, and tools.
- Updated Docker configuration and layered dependency management.
- fix:CoWriterEditor scroll sync by @Frant1cc in #175
- Fix/i18n improvement by @Frant1cc in #176
- Feature/llm hardening core slim (#52) by @scrrlt in #183
- docs: update ru translate by @oshliaer in #184
- feat: Add OpenRouter search provider by @infstellar in #194
- Restrict code execution scope and enforce import whitelist by @RinZ27 in #196
- feat/vision_slover by @kms9 in #191
- Let's meet DeepTutor 1.0.0! by @pancacake in #238
New Contributors
- @Frant1cc made their first contribution in #175
- @infstellar made their first contribution in #194
- @kms9 made their first contribution in #191
Full Changelog: v0.6.0...v1.0.0-beta.1