Stars
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
A Foundation Model for Generalist Gaming Agents
Open-source release accompanying Gao et al. 2025
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
GenAI Agent Framework, the Pydantic way
⚡ TabPFN: Foundation Model for Tabular Data ⚡
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Pokee Deep Research Model Open Source Repo
Implementation of TabTransformer, attention network for tabular data, in Pytorch
Post-training with Tinker
Feature engineering package with sklearn like functionality
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Reference PyTorch implementation and models for DINOv3
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation…
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)