Skip to content
View JJJYmmm's full-sized avatar

Block or report JJJYmmm

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JJJYmmm/README.md

👋 Hi, I’m JJJYmmm

🧠 Focus: Building Multimodal LLMs — turning pixels into words 🤖

🎓 In Pursuit:
M.S. @ VIPL, ICT, CAS | B.Eng. @ CSE, HUST

Pinned Loading

  1. QwenLM/Qwen3-VL QwenLM/Qwen3-VL Public

    Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

    Jupyter Notebook 16.8k 1.4k

  2. Multimodal-RoPEs Multimodal-RoPEs Public

    Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models"

    Python 34 1

  3. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 153k 31.3k