PROGRESSLM: Towards Progress Reasoning in Vision-Language Models Paper • 2601.15224 • Published 8 days ago • 12
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published Dec 2, 2025 • 71
PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling Paper • 2512.04784 • Published Dec 2, 2025 • 25
Embodied Referring Expression Comprehension in Human-Robot Interaction Paper • 2512.06558 • Published Dec 6, 2025 • 4
X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale Paper • 2512.04537 • Published Dec 4, 2025 • 7
From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models Paper • 2512.10867 • Published Dec 11, 2025 • 16
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Paper • 2506.03135 • Published Jun 3, 2025 • 40
ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction Paper • 2511.20937 • Published Nov 26, 2025 • 16
Cambrian-S: Towards Spatial Supersensing in Video Paper • 2511.04670 • Published Nov 6, 2025 • 38