PRIME-RL
/

P1-VL-30B-A3B

Image-Text-to-Text

Transformers

Safetensors

Model card Files Files and versions

xet

Community

ramiroluo

nielsr HF Staff commited on Feb 12

Commit

8f4e56c

1 Parent(s): 4f978a8

Add metadata and link to paper/code (#1)

Browse files

- Add metadata and link to paper/code (1b7b4babd4f66b040209b06089994e56576cdd78)

Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show

README.md +17 -9

README.md CHANGED Viewed

@@ -1,5 +1,13 @@
 ---
 license: apache-2.0
 ---
 <div align="center">
@@ -7,12 +15,12 @@ license: apache-2.0
 </div>
 <p align="center">
-  <a href="https://prime-rl.github.io/P1-VL/"><b>🌐 P1-VL Project Page</b></a> |
-  <a href="https://phyarena.github.io/"><b>🏆 HiPhO Leaderboard</b></a>
 </p>
 <p align="center">
 <img src="hipho.png" style="width: 800px" align=center>
 </p>
@@ -23,7 +31,7 @@ license: apache-2.0
 ## Model Description
-**P1-VL-30B-A3B** is the mid-size variant of the P1-VL series, a high-performance open-source vision-language model specialized in physics reasoning. Built on *Qwen3-VL-30B-A3B-Thinking* and refined through multi-stage reinforcement learning on curated physics competition data, P1-VL-30B-A3B achieves impressive results while maintaining reasonable computational requirements, making it accessible for researchers working with physics problems that require visual understanding.
 ### Key Highlights
@@ -155,8 +163,8 @@ We are grateful to the open-source community for their invaluable contributions.
 ```bibtex
 @misc{p1vl2025,
   title={P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads},
-  author={P1 Team},
-  year={2025},
-  url={https://prime-rl.github.io/P1-VL/}
 }
-```

 ---
 license: apache-2.0
+library_name: transformers
+pipeline_tag: image-text-to-text
+tags:
+- science
+- physics
+- vision-language
+- reasoning
+- olympiad
 ---
 <div align="center">
 </div>
 <p align="center">
+  <a href="https://arxiv.org/abs/2602.09443"><b>📄 Paper</b></a> |
+  <a href="https://github.com/PRIME-RL/P1-VL"><b>💻 Code</b></a> |
+  <a href="https://prime-rl.github.io/P1-VL/"><b>🌐 Project Page</b></a> |
+  <a href="https://phyarena.github.io/"><b>🏆 Leaderboard</b></a>
 </p>
 <p align="center">
 <img src="hipho.png" style="width: 800px" align=center>
 </p>
 ## Model Description
+**P1-VL-30B-A3B** is the mid-size variant of the P1-VL series, a high-performance open-source vision-language model specialized in physics reasoning. Introduced in [P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads](https://huggingface.co/papers/2602.09443), it is built on *Qwen3-VL-30B-A3B-Thinking* and refined through multi-stage reinforcement learning on curated physics competition data. P1-VL-30B-A3B achieves impressive results while maintaining reasonable computational requirements, making it accessible for researchers working with physics problems that require visual understanding.
 ### Key Highlights
 ```bibtex
 @misc{p1vl2025,
   title={P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads},
+  author={Yun Luo and Futing Wang and Qianjia Cheng and Fangchen Yu and Haodi Lei and Jianhao Yan and Chenxi Li and Jiacheng Chen and Yufeng Zhao and Haiyuan Wan and Yuchen Zhang and Shenghe Zheng and Junchi Yao and Qingyang Zhang and Haonan He and Wenxuan Zeng and Li Sheng and Chengxing Xie and Yuxin Zuo and Yizhuo Li and Yulun Wu and Rui Huang and Dongzhan Zhou and Kai Chen and Yu Qiao and Lei Bai and Yu Cheng and Ning Ding and Bowen Zhou and Peng Ye and Ganqu Cui},
+  year={2026},
+  url={https://arxiv.org/abs/2602.09443}
 }
+```