arxiv:2603.09877
Zhaokai Wang
wzk1015
AI & ML interests
Computer Vision
Music Generation
Multimodal Large Language Models
Recent Activity
liked
a dataset about 19 hours ago
opencompass/TextEdit authored
a paper
2 days ago
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing