Safetensors
English
qwen2
nielsr HF Staff commited on
Commit
3b0e2c5
·
verified ·
1 Parent(s): 5d3772d

Add library name, pipeline tag and link to code

Browse files

This PR adds the `library_name` and `pipeline_tag` metadata to the model card, specifying that the model uses the Hugging Face Transformers library and is for text generation. It also adds a direct link to the GitHub repository for easier access to the code.

Files changed (1) hide show
  1. README.md +10 -6
README.md CHANGED
@@ -1,16 +1,20 @@
1
  ---
2
- license: apache-2.0
 
3
  datasets:
4
  - knoveleng/open-rs
5
  language:
6
  - en
 
7
  metrics:
8
  - accuracy
9
- base_model:
10
- - deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
11
  ---
12
- GPG: A Simple and Strong Reinforcement Learning
13
- Baseline for Model Reasoning
14
  https://arxiv.org/abs/2504.02546
15
 
16
- The RL model trained on the Open-r1 dataset based on GPG, using DeepSeek-R1-Distill-Qwen-1.5B as the baseline model.
 
 
 
1
  ---
2
+ base_model:
3
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
4
  datasets:
5
  - knoveleng/open-rs
6
  language:
7
  - en
8
+ license: apache-2.0
9
  metrics:
10
  - accuracy
11
+ library_name: transformers
12
+ pipeline_tag: text-generation
13
  ---
14
+
15
+ GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
16
  https://arxiv.org/abs/2504.02546
17
 
18
+ The RL model trained on the Open-r1 dataset based on GPG, using DeepSeek-R1-Distill-Qwen-1.5B as the baseline model.
19
+
20
+ Code: https://github.com/AMAP-ML/GPG