nielsr HF Staff commited on
Commit
9182f46
·
verified ·
1 Parent(s): 2e2054a

Add metadata

Browse files

This PR ensures a "Use this model" button appears at the top right, and links the base model.

Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -1,5 +1,9 @@
1
  ---
2
  license: cc-by-4.0
 
 
 
 
3
  ---
4
 
5
  This model is described in the paper [DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models](https://arxiv.org/abs/2505.09655).
 
1
  ---
2
  license: cc-by-4.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ base_model:
6
+ - Qwen/Qwen2.5-1.5B-Instruct
7
  ---
8
 
9
  This model is described in the paper [DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models](https://arxiv.org/abs/2505.09655).