🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement.
Shawn
csfufu
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level
Entropy Shaping
upvoted
a
paper
about 2 months ago
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level
Entropy Shaping
upvoted
a
collection
about 2 months ago
ARES