Skip to content

SkyworkAI/UniPic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

98 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Skywork-UniPic

Unified multimodal models for image understanding, generation, and editing


Skywork UniPic2 Teaser

πŸ“ Overview

Welcome to the Skywork-UniPic repository!
This repository hosts the model weights and official implementations of our unified multimodal models, featuring two distinct modeling paradigms:

  • UniPic-1.0 β€” 1.5B parameters, Unified Autoregressive Modeling for joint visual understanding and generation, enabling a single transformer to handle both perception and synthesis tasks.
  • UniPic-2.0 Series β€” SD3.5M-Kontext and MetaQuery variants based on Efficient Architectures with Diffusion Post-Training, delivering state-of-the-art performance in text-to-image generation, fine-grained image editing, and multimodal reasoning.

πŸ”₯ Latest News

Date Update
2025-08-13 Released UniPic-2 β€” Unified Model Weights with Diffusion-based Post-Training
GitHub HuggingFace arXiv
2025-07-30 Released UniPic-1 β€” Autoregressive unified modeling from scratch
GitHub HuggingFace arXiv

✨ Key Features

  • 🎨 Text-to-Image Generation β€” High-fidelity synthesis from natural language prompts.
  • πŸ›  Image Editing β€” Seamless inpainting, outpainting, and object manipulation.
  • πŸ–Ό Image Understanding β€” Robust perception capabilities for various visual tasks.
  • ⚑ Efficient Architecture β€” Optimized for both accuracy and deployability.

πŸ“œ License

This project is licensed under the MIT License β€” see the LICENSE file for details.


Releases

No releases published

Packages

No packages published

Contributors 7

Languages