Hyperwjf commited on
Commit
12e9427
·
verified ·
1 Parent(s): 4bb1bb1

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - AntResearchNLP/ViLaSR-data
4
+ language:
5
+ - en
6
+ base_model:
7
+ - Qwen/Qwen2.5-VL-7B-Instruct
8
+ ---
9
+
10
+
11
+ This repository contains the ViLaSR-7B model as presented in [Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing](https://arxiv.org/abs/2506.09965).
12
+
13
+ Please refer to the code https://github.com/AntResearchNLP/ViLaSR.
14
+
15
+ ```
16
+ @misc{wu2025reinforcingspatialreasoningvisionlanguage,
17
+ title={Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing},
18
+ author={Junfei Wu and Jian Guan and Kaituo Feng and Qiang Liu and Shu Wu and Liang Wang and Wei Wu and Tieniu Tan},
19
+ year={2025},
20
+ eprint={2506.09965},
21
+ archivePrefix={arXiv},
22
+ primaryClass={cs.CV},
23
+ url={https://arxiv.org/abs/2506.09965},
24
+ }
25
+ ```