TencentARC
/

GRPO-CARE

 ---
 license: apache-2.0
+library_name: transformers
+pipeline_tag: video-text-to-text
 ---
+This repository contains the GRPO-CARE model, presented in the paper [GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning](https://huggingface.co/papers/2506.16141).
 Code released at [GRPO-CARE](https://github.com/TencentARC/GRPO-CARE).
 ## Citation