Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain Paper • 2510.17801 • Published Oct 20, 2025 • 2
WoW: Towards a World omniscient World model Through Embodied Interaction Paper • 2509.22642 • Published Sep 26, 2025 • 15
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 117