Add pipeline tag and library name, link to project page
#1
by
nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,6 +1,9 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
| 4 |
# VGR: Visual Grounded Reasoning
|
| 5 |
|
| 6 |
## Overview
|
|
@@ -9,6 +12,7 @@ VGR (Visual Grounded Reasoning) is a novel multimodal large language model (MLLM
|
|
| 9 |
|
| 10 |
- [Arxiv Paper Link](https://arxiv.org/pdf/2506.11991)
|
| 11 |
- [Data Repository](https://huggingface.co/datasets/BytedanceDouyinContent/VGR)
|
|
|
|
| 12 |
|
| 13 |
## Key Features
|
| 14 |
|
|
@@ -33,9 +37,9 @@ Data have been make **Public Avaliable**: checkout at [VGR-SFT](https://huggingf
|
|
| 33 |
|
| 34 |
```bibtex
|
| 35 |
@article{wang2025vgr,
|
| 36 |
-
title={VGR: Visual Grounded Reasoning},
|
| 37 |
author={Jiacong Wang and Zijian Kang and Haochen Wang and Haiyong Jiang and Jiawen Li and Bohong Wu and Ya Wang and Jiao Ran and Xiao Liang and Chao Feng and Jun Xiao},
|
| 38 |
journal={arXiv preprint arXiv:2506.11991},
|
| 39 |
year={2025}
|
| 40 |
}
|
| 41 |
-
```
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
pipeline_tag: image-text-to-text
|
| 4 |
+
library_name: transformers
|
| 5 |
---
|
| 6 |
+
|
| 7 |
# VGR: Visual Grounded Reasoning
|
| 8 |
|
| 9 |
## Overview
|
|
|
|
| 12 |
|
| 13 |
- [Arxiv Paper Link](https://arxiv.org/pdf/2506.11991)
|
| 14 |
- [Data Repository](https://huggingface.co/datasets/BytedanceDouyinContent/VGR)
|
| 15 |
+
- [Project Page](https://huggingface.co/BytedanceDouyinContent/VGR)
|
| 16 |
|
| 17 |
## Key Features
|
| 18 |
|
|
|
|
| 37 |
|
| 38 |
```bibtex
|
| 39 |
@article{wang2025vgr,
|
| 40 |
+
title={VGR: Visual Grounded Reasoning},
|
| 41 |
author={Jiacong Wang and Zijian Kang and Haochen Wang and Haiyong Jiang and Jiawen Li and Bohong Wu and Ya Wang and Jiao Ran and Xiao Liang and Chao Feng and Jun Xiao},
|
| 42 |
journal={arXiv preprint arXiv:2506.11991},
|
| 43 |
year={2025}
|
| 44 |
}
|
| 45 |
+
```
|