DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference Paper • 2602.18846 • Published 20 days ago • 4
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling Paper • 2503.13440 • Published Mar 17, 2025 • 2
Training-Free Long-Context Scaling of Large Language Models Paper • 2402.17463 • Published Feb 27, 2024 • 24 • 4
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Paper • 2408.15237 • Published Aug 27, 2024 • 42 • 6
An Empirical Study of Mamba-based Language Models Paper • 2406.07887 • Published Jun 12, 2024 • 1 • 2
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Paper • 2503.11579 • Published Mar 14, 2025 • 21
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published Mar 6, 2025 • 96
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Paper • 2503.13444 • Published Mar 17, 2025 • 18
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Paper • 2503.12937 • Published Mar 17, 2025 • 30
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 38 items • Updated 11 days ago • 356