nielsr HF Staff commited on
Commit
6e75896
·
verified ·
1 Parent(s): 0cc7dca

Add model card

Browse files

This PR:
- ensures the model is linked to (and shows up at) https://huggingface.co/papers/2504.20571
- has a linked base model (Qwen/Qwen2.5-Math-7B)
- has a "Use this model" button at the top right, by adding the `library_name` metadata.

Files changed (1) hide show
  1. README.md +11 -3
README.md CHANGED
@@ -1,3 +1,11 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ base_model:
6
+ - Qwen/Qwen2.5-Math-7B
7
+ ---
8
+
9
+ This repository contains the model presented in [Reinforcement Learning for Reasoning in Large Language Models with One Training Example](https://huggingface.co/papers/2504.20571).
10
+
11
+ Code: https://github.com/ypwang61/One-Shot-RLVR