Skip to content
View Ki6an's full-sized avatar
👾
👾

Block or report Ki6an

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. fastT5 fastT5 Public

    âš¡ boost inference speed of T5 models by 5x & reduce the model size by 3x.

    Python 588 75

  2. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  3. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 162k 33.7k

  4. triton triton Public

    Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    C++