🍉
I may be slow to respond before the due date of ACL.
PhD@CUHK, Research Engineer@Alibaba
- Shatin, N.T., HKSAR
- https://lixin4ever.github.io/
- @lixin4ever
Pinned Loading
-
DAMO-NLP-SG/VideoLLaMA3
DAMO-NLP-SG/VideoLLaMA3 PublicFrontier Multimodal Foundation Models for Image and Video Understanding
-
DAMO-NLP-SG/VideoLLaMA2
DAMO-NLP-SG/VideoLLaMA2 PublicVideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
-
alibaba-damo-academy/RynnVLA-001
alibaba-damo-academy/RynnVLA-001 PublicRynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
-
alibaba-damo-academy/RynnVLA-002
alibaba-damo-academy/RynnVLA-002 PublicRynnVLA-002: A Unified Vision-Language-Action and World Model
-
alibaba-damo-academy/RynnEC
alibaba-damo-academy/RynnEC PublicRynnEC: Bringing MLLMs into Embodied World
-
alibaba-damo-academy/PixelRefer
alibaba-damo-academy/PixelRefer PublicThe code for PixelRefer & VideoRefer
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


