What matters for Representation Alignment: Global Information or Spatial Structure? Paper • 2512.10794 • Published 15 days ago • 8
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning Paper • 2512.02835 • Published 24 days ago • 9
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning Paper • 2512.02835 • Published 24 days ago • 9 • 2
WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model Paper • 2308.15962 • Published Aug 30, 2023
Unified Lexical Representation for Interpretable Visual-Language Alignment Paper • 2407.17827 • Published Jul 25, 2024 • 1
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning Paper • 2512.02835 • Published 24 days ago • 9
Unified Lexical Representation for Interpretable Visual-Language Alignment Paper • 2407.17827 • Published Jul 25, 2024 • 1