MLLM-CL: Continual Learning for Multimodal Large Language Models Paper • 2506.05453 • Published Jun 5, 2025 • 3
Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection Paper • 2409.04796 • Published Sep 7, 2024 • 1
ModalPrompt: Towards Efficient Multimodal Continual Instruction Tuning with Dual-Modality Guided Prompt Paper • 2410.05849 • Published Oct 8, 2024 • 1
HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model Paper • 2503.12941 • Published Mar 17, 2025 • 1
MambaIC: State Space Models for High-Performance Learned Image Compression Paper • 2503.12461 • Published Mar 16, 2025 • 2
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 26 days ago • 155
Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality Paper • 2505.18227 • Published May 23, 2025 • 15
VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression? Paper • 2512.15649 • Published Dec 17, 2025 • 7
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Paper • 2512.07783 • Published Dec 8, 2025 • 38
MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark Paper • 2508.07307 • Published Aug 10, 2025 • 1
Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation Paper • 2502.17159 • Published Feb 24, 2025 • 2