MLLM-CL: Continual Learning for Multimodal Large Language Models Paper β’ 2506.05453 β’ Published Jun 5, 2025 β’ 4
MMEVOKE (ICLR26π₯) Collection MMEVOKE introduces the first comprehensive benchmark and systematic evaluation framework designed to investigate multimodal evolving knowledge injecti β’ 4 items β’ Updated Feb 5 β’ 1
iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper β’ 2601.17124 β’ Published Jan 23 β’ 33
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper β’ 2601.07832 β’ Published Jan 12 β’ 52
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper β’ 2601.06943 β’ Published Jan 11 β’ 215
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper β’ 2601.04767 β’ Published Jan 8 β’ 28
kailinjiang/llava_1.5_13b_covariance_matrices_from_onevision_pre_64_seed_rank233_new222 Updated Dec 30, 2025
kailinjiang/llava_1.5_13b_covariance_matrices_from_onevision_pre_64_seed_rank233_new Updated Dec 30, 2025
TongSIM: A General Platform for Simulating Intelligent Machines Paper β’ 2512.20206 β’ Published Dec 23, 2025 β’ 28
KORE Collection KORE uses knowledge-oriented control as its pivot to synergistically optimize the balance between knowledge adaptation and retention at different stag β’ 30 items β’ Updated Jan 27 β’ 1