Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ library_name: transformers
|
|
| 15 |
|
| 16 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
| 17 |
|
| 18 |
-
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated training
|
| 19 |
|
| 20 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
| 21 |
|
|
|
|
| 15 |
|
| 16 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
| 17 |
|
| 18 |
+
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated **training data**: [ViRL39K](https://huggingface.co/datasets/TIGER-Lab/ViRL39K).
|
| 19 |
|
| 20 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
| 21 |
|