Vinkle-hzt

Follow

🤯

hard working

vinkle Vinkle-hzt

🤯

hard working

Follow

9 followers · 7 following

Zhejiang China
23:46 (UTC +08:00)
https://vinkle.top

Achievements

Achievements

Vinkle-hzt/README.md

🔭 I’m currently working on LLM inference architecture development
🌱 Specializing in multi-GPU parallelism and model acceleration
💻 Proficient in C++, Python, and CUDA programming
🚀 Experienced in optimizing large-scale model inference
🧠 Supporting various model architectures for efficient deployment
💡 Passionate about pushing the boundaries of AI efficiency

Pinned Loading

alibaba/rtp-llm alibaba/rtp-llm Public

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1.1k 179