This model was trained for our Reasoning Gym paper (https://arxiv.org/abs/2505.24760) using our Reasoning Gym repo (https://github.com/open-thought/reasoning-gym)

Downloads last month
67
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with zafstojano/Qwen2.5-3B-Instruct-RG-Math.

Model tree for zafstojano/Qwen2.5-3B-Instruct-RG-Math

Base model

Qwen/Qwen2.5-3B
Finetuned
(1263)
this model

Paper for zafstojano/Qwen2.5-3B-Instruct-RG-Math