LeonOverload
/

PRIMO-R1-7B

Video-Text-to-Text

robotic-manipulation

reinforcement-learning

chain-of-thought

Model card Files Files and versions

LeonOverload commited on Mar 17

Commit

f084cf7

·

verified ·

1 Parent(s): bfbb6e4

Update README.md

Files changed (1) hide show

README.md +18 -1

README.md CHANGED Viewed

@@ -6,4 +6,21 @@ metrics:
 base_model:
 - Qwen/Qwen2.5-VL-7B-Instruct
 pipeline_tag: video-text-to-text
----

 base_model:
 - Qwen/Qwen2.5-VL-7B-Instruct
 pipeline_tag: video-text-to-text
+---
+## Citations
+If you find our work helpful for your research, please consider citing our work.
+```
+@misc{liu2026passiveobserveractivecritic,
+      title={From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation},
+      author={Yibin Liu and Yaxing Lyu and Daqi Gao and Zhixuan Liang and Weiliang Tang and Shilong Mu and Xiaokang Yang and Yao Mu},
+      year={2026},
+      eprint={2603.15600},
+      archivePrefix={arXiv},
+      primaryClass={cs.RO},
+      url={https://arxiv.org/abs/2603.15600},
+}
+```