Skip to content
View DefTruth's full-sized avatar
🎯
#pragma unroll
🎯
#pragma unroll

Organizations

@vipshop @PaddlePaddle @xlite-dev

Block or report DefTruth

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DefTruth/README.md

Pinned Loading

  1. xlite-dev/LeetCUDA xlite-dev/LeetCUDA Public

    📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA.🎉

    Cuda 5.2k 554

  2. xlite-dev/lite.ai.toolkit xlite-dev/lite.ai.toolkit Public

    🛠 A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉

    C++ 4.1k 746

  3. xlite-dev/Awesome-LLM-Inference xlite-dev/Awesome-LLM-Inference Public

    📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

    Python 4.2k 289

  4. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 51.2k 8.4k

  5. PaddlePaddle/FastDeploy PaddlePaddle/FastDeploy Public

    High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

    C++ 3.3k 515

  6. vipshop/cache-dit vipshop/cache-dit Public

    🤗A Training-free and Easy-to-use Cache Acceleration Toolbox for DiTs: DBCache, DBPrune, TaylorSeer, FBCache, etc🔥

    Python 77 3