Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension Paper • 2602.13310 • Published Feb 10 • 8
TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding Paper • 2511.16595 • Published Nov 20, 2025 • 10
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding Paper • 2511.13026 • Published Nov 17, 2025 • 26