Update README.md
Browse files
README.md
CHANGED
|
@@ -5,7 +5,7 @@ language:
|
|
| 5 |
metrics:
|
| 6 |
- accuracy
|
| 7 |
base_model:
|
| 8 |
-
- deepseek-ai/DeepSeek-R1-Distill-Qwen-
|
| 9 |
---
|
| 10 |
|
| 11 |
# 🚀 GRPO-LEAD: Efficient Reasoning Enhancement for Mathematical Tasks
|
|
@@ -77,4 +77,4 @@ If you find our work useful, please cite it as:
|
|
| 77 |
url={https://arxiv.org/abs/2504.09696},
|
| 78 |
}
|
| 79 |
```
|
| 80 |
-
Enjoy exploring GRPO-LEAD! 🚀✨
|
|
|
|
| 5 |
metrics:
|
| 6 |
- accuracy
|
| 7 |
base_model:
|
| 8 |
+
- deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
|
| 9 |
---
|
| 10 |
|
| 11 |
# 🚀 GRPO-LEAD: Efficient Reasoning Enhancement for Mathematical Tasks
|
|
|
|
| 77 |
url={https://arxiv.org/abs/2504.09696},
|
| 78 |
}
|
| 79 |
```
|
| 80 |
+
Enjoy exploring GRPO-LEAD! 🚀✨
|