Skip to content
View LudovicoYIN's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report LudovicoYIN

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
LudovicoYIN/README.md

Hi 👋 I'm Ludovico (Yin Hanke)

🔧 AI Infra / Model Compression Engineer
📍 Chengdu, China
📧 hankeyin@gmail.com


About Me

  • Focused on model compression & efficient inference
    • Quantization · Sparsity · Low-bit tensor storage
    • Compiler- and runtime-level optimization
    • Interest in AI Agent

Pinned Loading

  1. apache/tvm apache/tvm Public

    Open Machine Learning Compiler Framework

    Python 13.2k 3.8k

  2. alibaba/MNN alibaba/MNN Public

    MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

    C++ 14.3k 2.2k

  3. vllm-project/vllm-omni vllm-project/vllm-omni Public

    A framework for efficient model inference with omni-modality models

    Python 2.9k 470

  4. vllm-project/compressed-tensors vllm-project/compressed-tensors Public

    A safetensors extension to efficiently store sparse quantized tensors on disk

    Python 256 61