What matters for Representation Alignment: Global Information or Spatial Structure? Paper • 2512.10794 • Published 14 days ago • 8
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning Paper • 2512.02835 • Published 23 days ago • 9
Unified Lexical Representation for Interpretable Visual-Language Alignment Paper • 2407.17827 • Published Jul 25, 2024 • 1