Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark Paper • 2511.13853 • Published Nov 17 • 34
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published Oct 14, 2024 • 16
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning Paper • 2410.00255 • Published Sep 30, 2024 • 5