Collections of ICLR 2026 paper: "OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models"
Zekun Qi
qizekun
AI & ML interests
Embodied Intelligence, Large Langugae Model, 3D Computer Vision
Recent Activity
upvoted a collection about 8 hours ago
SenseNova-U1 authored a paper 1 day ago
ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?