Q-Zoom: Query-Aware Adaptive Perception for Efficient Multimodal Large Language Models Paper • 2604.06912 • Published 2 days ago
Q-Zoom: Query-Aware Adaptive Perception for Efficient Multimodal Large Language Models Paper • 2604.06912 • Published 2 days ago
Catching the Details: Self-Distilled RoI Predictors for Fine-Grained MLLM Perception Paper • 2509.16944 • Published Sep 21, 2025 • 3