Collections of ICLR 2026 paper: "OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models"
Zekun Qi
qizekun
AI & ML interests
Embodied Intelligence, Large Langugae Model, 3D Computer Vision
Recent Activity
upvoted a collection 1 day ago
SenseNova-U1 authored a paper 2 days ago
ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?