| license: apache-2.0 | |
| library_name: transformers | |
| pipeline_tag: video-text-to-text | |
| This repository contains the GRPO-CARE model, presented in the paper [GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning](https://huggingface.co/papers/2506.16141). | |
| Code released at [GRPO-CARE](https://github.com/TencentARC/GRPO-CARE). | |
| ## Citation | |
| ```bibtex | |
| @misc{chen2025grpocareconsistencyawarereinforcementlearning, | |
| title={GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning}, | |
| author={Yi Chen and Yuying Ge and Rui Wang and Yixiao Ge and Junhao Cheng and Ying Shan and Xihui Liu}, | |
| year={2025}, | |
| eprint={2506.16141}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CV}, | |
| url={https://arxiv.org/abs/2506.16141}, | |
| } | |
| ``` |