GD-ML
/

Open-RS1

@@ -1,16 +1,20 @@
 ---
-license: apache-2.0
 datasets:
 - knoveleng/open-rs
 language:
 - en
 metrics:
 - accuracy
-base_model:
-- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
 ---
-GPG: A Simple and Strong Reinforcement Learning
-Baseline for Model Reasoning
 https://arxiv.org/abs/2504.02546
-The RL model trained on the Open-r1 dataset based on GPG, using DeepSeek-R1-Distill-Qwen-1.5B as the baseline model.

 ---
+base_model:
+- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
 datasets:
 - knoveleng/open-rs
 language:
 - en
+license: apache-2.0
 metrics:
 - accuracy
+library_name: transformers
+pipeline_tag: text-generation
 ---
+GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
 https://arxiv.org/abs/2504.02546
+The RL model trained on the Open-r1 dataset based on GPG, using DeepSeek-R1-Distill-Qwen-1.5B as the baseline model.
+Code: https://github.com/AMAP-ML/GPG