Update README.md

#1
by chiyum609 - opened
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -10,7 +10,7 @@ base_model:
10
 
11
  # FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
12
 
13
- 🏠 [Homepage](https://qwen-pilot.notion.site/fipo) | πŸ“ [Paper PDF](https://github.com/qwenpilot/FIPO/blob/main/assets/FIPO_Eliciting_Deep_Reasoning_with_Future_KL_Influenced_Policy_Optimization.pdf) | πŸ€— [Hugging Face](https://huggingface.co/QwenPilot/FIPO_32B) | πŸ€– [ModelScope](https://modelscope.cn/models/chiyum609/FIPO_32B) | 🐱 [GitHub](https://github.com/qwenpilot/FIPO)
14
 
15
  **Qwen Pilot, Alibaba Group | Published on March 20, 2026**
16
 
 
10
 
11
  # FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
12
 
13
+ 🏠 [Homepage](https://qwen-pilot.notion.site/fipo) | πŸ“ [Paper PDF](https://arxiv.org/abs/2603.19835) | πŸ€— [Hugging Face](https://huggingface.co/QwenPilot/FIPO_32B) | πŸ€– [ModelScope](https://modelscope.cn/models/chiyum609/FIPO_32B) | 🐱 [GitHub](https://github.com/qwenpilot/FIPO)
14
 
15
  **Qwen Pilot, Alibaba Group | Published on March 20, 2026**
16