Update README.md
#1
by chiyum609 - opened
README.md
CHANGED
|
@@ -10,7 +10,7 @@ base_model:
|
|
| 10 |
|
| 11 |
# FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
|
| 12 |
|
| 13 |
-
π [Homepage](https://qwen-pilot.notion.site/fipo) | π [Paper PDF](https://
|
| 14 |
|
| 15 |
**Qwen Pilot, Alibaba Group | Published on March 20, 2026**
|
| 16 |
|
|
|
|
| 10 |
|
| 11 |
# FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
|
| 12 |
|
| 13 |
+
π [Homepage](https://qwen-pilot.notion.site/fipo) | π [Paper PDF](https://arxiv.org/abs/2603.19835) | π€ [Hugging Face](https://huggingface.co/QwenPilot/FIPO_32B) | π€ [ModelScope](https://modelscope.cn/models/chiyum609/FIPO_32B) | π± [GitHub](https://github.com/qwenpilot/FIPO)
|
| 14 |
|
| 15 |
**Qwen Pilot, Alibaba Group | Published on March 20, 2026**
|
| 16 |
|