Add library_name and pipeline_tag

#2
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +16 -10
README.md CHANGED
@@ -1,11 +1,13 @@
1
  ---
2
- license: apache-2.0
 
3
  datasets:
4
  - BytedTsinghua-SIA/DAPO-Math-17k
5
  language:
6
  - en
7
- base_model:
8
- - Qwen/Qwen2.5-32B
 
9
  ---
10
 
11
  # FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
@@ -70,10 +72,14 @@ The FIPO objective yields longer responses and a stronger AIME 2024 peak than th
70
  ## 🎈 Citation
71
 
72
  ```bibtex
73
- @misc{FIPO,
74
- title = {FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization},
75
- url = {[https://qwen-pilot.notion.site/fipo](https://qwen-pilot.notion.site/fipo)},
76
- author = {Chiyu Ma and Shuo Yang and Kexin Huang and Jinda Lu and Haoming Meng and Shangshang Wang and Bolin Ding and Soroush Vosoughi and Guoyin Wang and Jingren Zhou},
77
- year = {2026},
78
- month = {March},
79
- }
 
 
 
 
 
1
  ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-32B
4
  datasets:
5
  - BytedTsinghua-SIA/DAPO-Math-17k
6
  language:
7
  - en
8
+ license: apache-2.0
9
+ library_name: transformers
10
+ pipeline_tag: text-generation
11
  ---
12
 
13
  # FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
 
72
  ## 🎈 Citation
73
 
74
  ```bibtex
75
+ @article{ma2026fipo,
76
+ title={FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization},
77
+ author={Ma, Chiyu and Yang, Shuo and Huang, Kexin and Lu, Jinda and Meng, Haoming and Shangshang Wang and Bolin Ding and Soroush Vosoughi and Guoyin Wang and Jingren Zhou},
78
+ journal={arXiv preprint arXiv:2603.19835},
79
+ year={2026}
80
+ }
81
+ ```
82
+
83
+ ## 🌻 Acknowledgement
84
+
85
+ This project builds on top of the [VeRL](https://github.com/volcengine/verl) training framework and follows the practical recipe structure introduced by [DAPO](https://github.com/Bytedance-Research/DAPO).