Same or Not? Enhancing Visual Perception in Vision-Language Models Paper • 2512.23592 • Published 29 days ago • 1
No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers Paper • 2512.08889 • Published Dec 9, 2025 • 1
TWIN Collection Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models" • 5 items • Updated about 9 hours ago • 2
VALOR Collection Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" • 3 items • Updated about 9 hours ago • 1
Aligning Text, Images, and 3D Structure Token-by-Token Paper • 2506.08002 • Published Jun 9, 2025 • 21