Qwen2.5-0.5B-Instruct Math SFT 100K 2 Epochs

Fine-tuned from Qwen/Qwen2.5-0.5B-Instruct on a 100K math reasoning SFT mixture for 2 epochs with learning rate 1e-5.

Prompt format used during training:

System: Please reason step by step, and put your final answer within \boxed{}.
User: {problem}
Assistant: {solution}

Training mixture:

Source Count
nvidia/OpenMathReasoning CoT 40,000
AI-MO/NuminaMath-1.5 filtered, no AMC/AIME source 25,000
meta-math/MetaMathQA 15,000
MATH train, especially levels 4-5 15,000
GSM8K train 5,000

Training summary:

Setting Value
Base model Qwen/Qwen2.5-0.5B-Instruct
Training examples 100,000
Epochs 2
Learning rate 1e-5
Max sequence length 4096
Effective batch size 32
Final train loss 0.6923
Final token accuracy about 0.803

Evaluation results are not included in this model card yet.

Downloads last month
18
Safetensors
Model size
0.6B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zbeeb/Qwen2.5-0.5B-Instruct-Math-SFT-100K-2ep

Finetuned
(796)
this model

Datasets used to train zbeeb/Qwen2.5-0.5B-Instruct-Math-SFT-100K-2ep