Yuqian-Fu
/

SRFT-Qwen2.5-Math-7B

Text Generation

text-generation-inference

Model card Files Files and versions

Yuqian-Fu commited on Jun 25, 2025

Commit

025154f

·

verified ·

1 Parent(s): 30d6824

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -6,6 +6,7 @@ base_model:
 - open-r1/Qwen2.5-Math-7B-RoPE-300k
 - Qwen/Qwen2.5-Math-7B
 pipeline_tag: reinforcement-learning
 ---
 # 📄 Introduction

 - open-r1/Qwen2.5-Math-7B-RoPE-300k
 - Qwen/Qwen2.5-Math-7B
 pipeline_tag: reinforcement-learning
+arxiv: 2506.19767
 ---
 # 📄 Introduction