Starred repositories
Multi-language BPE tokenizer implementation for Qwen3 models. Lightweight byte-pair encoding for C#/.NET
A modular active learning framework for Python
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX …
Define and run multi-container applications with Docker
Warp is an agentic development environment, born out of the terminal.
🚀✨ Help beginners to contribute to open source projects
SGLang is a high-performance serving framework for large language models and multimodal models.
FEMU: Accurate, Scalable and Extensible NVMe SSD Emulator (FAST'18)
OpenROAD's unified application implementing an RTL-to-GDS Flow. Documentation at https://openroad.readthedocs.io/en/latest/
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
A high-throughput and memory-efficient inference and serving engine for LLMs
A professional cross-platform SSH/Sftp/Shell/Telnet/Tmux/Serial terminal.
vLLM Metal plugin powered by mlx-swift — high-performance LLM inference on Apple Silicon
🕵️♂️ All-in-one OSINT tool for analysing any website
OpenProject is the leading open source project management software.
Create and share 3D architectural projects.
Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM
🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models
