Update README.md
Browse files
README.md
CHANGED
|
@@ -6,6 +6,7 @@ base_model:
|
|
| 6 |
- open-r1/Qwen2.5-Math-7B-RoPE-300k
|
| 7 |
- Qwen/Qwen2.5-Math-7B
|
| 8 |
pipeline_tag: reinforcement-learning
|
|
|
|
| 9 |
---
|
| 10 |
|
| 11 |
# 📄 Introduction
|
|
|
|
| 6 |
- open-r1/Qwen2.5-Math-7B-RoPE-300k
|
| 7 |
- Qwen/Qwen2.5-Math-7B
|
| 8 |
pipeline_tag: reinforcement-learning
|
| 9 |
+
arxiv: 2506.19767
|
| 10 |
---
|
| 11 |
|
| 12 |
# 📄 Introduction
|