Qwen/Qwen3-Omni-30B-A3B-Captioner
Any-to-Any • 32B • Updated
• 5.55k • 207
None defined yet.
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning