Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models Paper • 2605.28132 • Published about 1 month ago • 25
view article Article Improving Object Detection through Reinforcement Learning with VLM-R1 omlab • Mar 25, 2025 • 3
view article Article Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning omlab • Mar 25, 2025 • 3
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Paper • 2504.07615 • Published Apr 10, 2025 • 36