Improve model card for GRPO-CARE
#1
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,7 +1,11 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
| 3 |
---
|
| 4 |
|
|
|
|
|
|
|
| 5 |
Code released at [GRPO-CARE](https://github.com/TencentARC/GRPO-CARE).
|
| 6 |
|
| 7 |
## Citation
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
library_name: transformers
|
| 4 |
+
pipeline_tag: video-text-to-text
|
| 5 |
---
|
| 6 |
|
| 7 |
+
This repository contains the GRPO-CARE model, presented in the paper [GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning](https://huggingface.co/papers/2506.16141).
|
| 8 |
+
|
| 9 |
Code released at [GRPO-CARE](https://github.com/TencentARC/GRPO-CARE).
|
| 10 |
|
| 11 |
## Citation
|