CSDDSFSFSAFSAF commited on
Commit
5c4c4a7
·
verified ·
1 Parent(s): 3e6e470

Add model card

Browse files
Files changed (1) hide show
  1. README.md +40 -0
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - video-language-model
7
+ - long-video-understanding
8
+ - reinforcement-learning
9
+ - self-correction
10
+ - reflection
11
+ - qwen2.5-vl
12
+ ---
13
+
14
+ # Reflect-R1
15
+
16
+ Model checkpoints for **Reflect-R1: Evidence-Driven Reflection for Self-Correction in Long Video Understanding**.
17
+
18
+ - Paper: https://arxiv.org/abs/2606.27922
19
+ - Code: https://github.com/ShuimuChen-hyq/Reflect-R1
20
+ - Data: https://huggingface.co/datasets/CSDDSFSFSAFSAF/Reflect-R1-data
21
+
22
+ ## Checkpoints
23
+
24
+ ```text
25
+ Reflect-R1-SFT-6000/ Cold-start SFT checkpoint.
26
+ Reflect-R1-GRPO-Final/ Final SD-GRPO checkpoint.
27
+ ```
28
+
29
+ Both checkpoints are based on Qwen2.5-VL-7B and include sharded `safetensors` weights together with the corresponding tokenizer and processor configuration files.
30
+
31
+ ## Citation
32
+
33
+ ```bibtex
34
+ @article{chen2026reflectr1,
35
+ title = {Reflect-R1: Evidence-Driven Reflection for Self-Correction in Long Video Understanding},
36
+ author = {Shuimu Chen and Yuteng Chen and Yuanshen Guan and Zebang Cheng and Zeyu Zhang and Shengqian Qin and Bin Xia and Jiaran Li and Wenming Yang and Fei Ma},
37
+ journal = {arXiv preprint arXiv:2606.27922},
38
+ year = {2026}
39
+ }
40
+ ```