Skip to content
View lixin4ever's full-sized avatar
🍉
I may be slow to respond before the due date of ACL.
🍉
I may be slow to respond before the due date of ACL.

Organizations

@dmlc @textmine

Block or report lixin4ever

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. DAMO-NLP-SG/VideoLLaMA3 DAMO-NLP-SG/VideoLLaMA3 Public

    Frontier Multimodal Foundation Models for Image and Video Understanding

    Jupyter Notebook 1.1k 76

  2. DAMO-NLP-SG/VideoLLaMA2 DAMO-NLP-SG/VideoLLaMA2 Public

    VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

    Python 1.3k 85

  3. alibaba-damo-academy/RynnVLA-001 alibaba-damo-academy/RynnVLA-001 Public

    RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

    Python 266 17

  4. alibaba-damo-academy/RynnVLA-002 alibaba-damo-academy/RynnVLA-002 Public

    RynnVLA-002: A Unified Vision-Language-Action and World Model

    Python 684 39

  5. alibaba-damo-academy/RynnEC alibaba-damo-academy/RynnEC Public

    RynnEC: Bringing MLLMs into Embodied World

    Jupyter Notebook 381 17

  6. alibaba-damo-academy/PixelRefer alibaba-damo-academy/PixelRefer Public

    The code for PixelRefer & VideoRefer

    Jupyter Notebook 325 18