Stars
Official implementation of SIGGRAPH 2025 paper "Image-GS: Content-Adaptive Image Representation via 2D Gaussians"
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
A conference poster format with structure, content, creation, and presentation recommendations.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Evaluation of concept erasing diffusion models should include latent likelihood
Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]
✨✨Latest Advances on Multimodal Large Language Models
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and e…
PyTorch code and models for the DINOv2 self-supervised learning method.
Emu Series: Generative Multimodal Models from BAAI
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Awesome resources on normalizing flows.
A beautiful, simple, clean, and responsive Jekyll theme for academics
Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)
Repository dedicated to Fixel Courses (Education)
PyTorch implementation of adversarial attacks [torchattacks]
Library for training machine learning models with privacy for training data
Code and documentation to train Stanford's Alpaca models, and generate the data.
A curated list of awesome papers on dataset distillation and related applications.
Stable Diffusion web UI
Erasing Concepts from Diffusion Models
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Official implementation of the paper GEFF: Improving Any Clothes-Changing Person ReID Model using Gallery Enrichment with Face Features.
Leveraging Out-of-domain Self-supervision for Multi-modal Video Deepfake Detection
