Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published 4 days ago • 26
ZzzHelloWorld/llava-uhd-qwen3-moonvit-so-400m-4-18-p12-anyres-256-1024-858k 5B • Updated Dec 15, 2025
ZzzHelloWorld/llava-uhd-qwen3-moonvit-so-400m-4-18-p12-anyres-256-1024-858k 5B • Updated Dec 15, 2025