Skip to content
View JosephPai's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report JosephPai

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. showlab/Show-o showlab/Show-o Public

    [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1.8k 81

  2. showlab/Awesome-MLLM-Hallucination showlab/Awesome-MLLM-Hallucination Public

    📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

    944 39

  3. Awesome-Talking-Face Awesome-Talking-Face Public

    📖 A curated list of resources dedicated to talking face.

    1.5k 120

  4. showlab/VideoLISA showlab/VideoLISA Public

    [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

    Python 143 4

  5. showlab/FQGAN showlab/FQGAN Public

    FQGAN: Factorized Visual Tokenization and Generation

    Python 57 3

  6. showlab/Awesome-Unified-Multimodal-Models showlab/Awesome-Unified-Multimodal-Models Public

    📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    775 41