ypwang61
/

One-Shot-RLVR-Qwen2.5-Math-7B-pi1

Text Generation

text-generation-inference

Model card Files Files and versions

Add model card

#1

by nielsr HF Staff - opened May 19, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +11 -3

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+library_name: transformers
+pipeline_tag: text-generation
+base_model:
+- Qwen/Qwen2.5-Math-7B
+---
+This repository contains the model presented in [Reinforcement Learning for Reasoning in Large Language Models with One Training Example](https://huggingface.co/papers/2504.20571).
+Code: https://github.com/ypwang61/One-Shot-RLVR