Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Yuhao Dong PRO
THUdyh
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 9 hours ago
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
liked
a dataset
5 days ago
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
upvoted
a
paper
13 days ago
BabyVision: Visual Reasoning Beyond Language