PlanePaper commited on
Commit
c663e9b
·
verified ·
1 Parent(s): 0834676

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -5,7 +5,7 @@ language:
5
  metrics:
6
  - accuracy
7
  base_model:
8
- - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
9
  ---
10
 
11
  # 🚀 GRPO-LEAD: Efficient Reasoning Enhancement for Mathematical Tasks
@@ -77,4 +77,4 @@ If you find our work useful, please cite it as:
77
  url={https://arxiv.org/abs/2504.09696},
78
  }
79
  ```
80
- Enjoy exploring GRPO-LEAD! 🚀✨
 
5
  metrics:
6
  - accuracy
7
  base_model:
8
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
9
  ---
10
 
11
  # 🚀 GRPO-LEAD: Efficient Reasoning Enhancement for Mathematical Tasks
 
77
  url={https://arxiv.org/abs/2504.09696},
78
  }
79
  ```
80
+ Enjoy exploring GRPO-LEAD! 🚀✨