Skip to content
View dsikka's full-sized avatar
🐕
🐕

Block or report dsikka

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm-project/llm-compressor vllm-project/llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 2.5k 346

  2. vllm-project/compressed-tensors vllm-project/compressed-tensors Public

    A safetensors extension to efficiently store sparse quantized tensors on disk

    Python 227 48

  3. vllm-project/speculators vllm-project/speculators Public

    A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

    Python 178 23