File size: 436 Bytes
b62de01 03bc4f1 b62de01 7887e4a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
---
license: apache-2.0
pipeline_tag: video-text-to-text
library_name: transformers
datasets:
- Video-R1/Video-R1-data
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen2.5-VL-7B-Instruct
---
This repository contains the Video-R1-7B model as presented in [Video-R1: Reinforcing Video Reasoning in MLLMs](https://arxiv.org/pdf/2503.21776).
For training and inference, please refer to Code: https://github.com/tulerfeng/Video-R1 |