Model Card for VALOR-GroundingDINO

This is the verified-tuned GroundingDINO model from the paper: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

For further information please refer to the project webpage, paper, and repository.

Citation

If you use VALOR in your research, please consider citing our work:

BibTeX:

@misc{marsili2025labelsproblemtrainingvisual,
      title={No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers}, 
      author={Damiano Marsili and Georgia Gkioxari},
      year={2025},
      eprint={2512.08889},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.08889}, 
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for glab-caltech/VALOR-GroundingDINO

Base model

ShilongLiu/GroundingDINO

Finetuned

(1)

this model

Datasets used to train glab-caltech/VALOR-GroundingDINO

Collection including glab-caltech/VALOR-GroundingDINO

VALOR

Collection

[ICLR 2026] Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" • 3 items • Updated Feb 22 • 1

Paper for glab-caltech/VALOR-GroundingDINO

No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

Paper • 2512.08889 • Published Dec 9, 2025 • 1