M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models Paper • 2405.15638 • Published May 24, 2024 • 1
BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation Paper • 2506.07530 • Published Jun 9, 2025 • 20