The model for mathematical reasoning task training from MATH training set by DERL.

Downloads last month
10
Safetensors
Model size
3B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for DifferentiableEvolutionaryRL/DERL-MATH-Qwen-2.5-3B

Base model

Qwen/Qwen2.5-3B
Finetuned
(287)
this model
Quantizations
1 model