dmarsili commited on
Commit
b2cc72d
·
verified ·
1 Parent(s): 07cd2e7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -3
README.md CHANGED
@@ -1,3 +1,38 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - lmms-lab/GQA
5
+ - dmarsili/Omni3D-Bench
6
+ - cambridgeltl/vsr_random
7
+ - snowclipsed/TallyQA
8
+ language:
9
+ - en
10
+ base_model:
11
+ - ShilongLiu/GroundingDINO
12
+ pipeline_tag: object-detection
13
+ tags:
14
+ - object-detection
15
+ - computer-vision
16
+ ---
17
+ # Model Card for VALOR-GroundingDINO
18
+
19
+ This is the verified-tuned GroundingDINO model from the paper: [No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers](https://glab-caltech.github.io/valor/)
20
+
21
+ For further information please refer to the [project webpage](https://glab-caltech.github.io/valor/), [paper](https://arxiv.org/abs/2512.08889), and [repository](https://github.com/damianomarsili/VALOR).
22
+
23
+ ## Citation
24
+
25
+ If you use VALOR in your research, please consider citing our work:
26
+
27
+ **BibTeX:**
28
+ ```
29
+ @misc{marsili2025labelsproblemtrainingvisual,
30
+ title={No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers},
31
+ author={Damiano Marsili and Georgia Gkioxari},
32
+ year={2025},
33
+ eprint={2512.08889},
34
+ archivePrefix={arXiv},
35
+ primaryClass={cs.CV},
36
+ url={https://arxiv.org/abs/2512.08889},
37
+ }
38
+ ```