From 2D Grids to 1D Tokens: Reforming Shared Representations for Multimodal Image Fusion Paper • 2606.12303 • Published 3 days ago • 12
VIA-SD: Verification via Intra-Model Routing for Speculative Decoding Paper • 2606.12243 • Published 3 days ago • 14
CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving Paper • 2601.01874 • Published Jan 5 • 19
Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration Paper • 2505.16479 • Published May 22, 2025 • 11
MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs Paper • 2410.12332 • Published Oct 16, 2024 • 2
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems Paper • 2503.16549 • Published Mar 19, 2025 • 15