| library_name: transformers | |
| pipeline_tag: text-generation | |
| base_model: | |
| - Qwen/Qwen2.5-3B-Instruct | |
| This model was trained for our Reasoning Gym paper (https://arxiv.org/abs/2505.24760) using our Reasoning Gym repo (https://github.com/open-thought/reasoning-gym) |