Skip to content
View zhtsh's full-sized avatar
🎯
Focusing
🎯
Focusing
  • meetsocial
  • shanghai, china

Block or report zhtsh

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 980 137 Updated Dec 31, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 2,948 191 Updated Jan 1, 2026

A Foundation Model for Generalist Gaming Agents

Python 1,263 151 Updated Dec 26, 2025

Open-source release accompanying Gao et al. 2025

Python 477 48 Updated Dec 11, 2025

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,691 207 Updated Dec 30, 2025

GenAI Agent Framework, the Pydantic way

Python 14,082 1,513 Updated Jan 2, 2026

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Jupyter Notebook 5,418 533 Updated Dec 31, 2025

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,576 189 Updated Jul 12, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,299 295 Updated May 11, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,407 286 Updated Jul 17, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,543 214 Updated Dec 30, 2025

Pokee Deep Research Model Open Source Repo

Python 1,605 889 Updated Oct 22, 2025

Implementation of TabTransformer, attention network for tabular data, in Pytorch

Python 1,049 126 Updated Dec 18, 2025

Post-training with Tinker

Python 2,642 276 Updated Jan 2, 2026

Feature engineering package with sklearn like functionality

Python 2,177 333 Updated Jan 1, 2026

Contexts Optical Compression

Python 21,704 1,950 Updated Oct 25, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,599 8,645 Updated Nov 12, 2025

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 1,338 217 Updated Jun 16, 2025

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Jupyter Notebook 1,402 99 Updated Aug 30, 2023

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,493 1,685 Updated Apr 7, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,598 2,640 Updated Dec 30, 2025

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

Roff 24,653 3,782 Updated Jan 1, 2026
Python 854 45 Updated Sep 15, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,141 675 Updated Nov 20, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,519 2,003 Updated Nov 1, 2025

Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation…

Go 5,203 719 Updated Dec 31, 2025

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 19,257 2,732 Updated Dec 30, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,913 985 Updated Dec 13, 2025

Pythonic bindings for FFmpeg's libraries.

Python 3,078 417 Updated Dec 18, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,388 2,078 Updated Oct 21, 2025
Next