Fine-tuned the base model Qwen/Qwen2.5-Math-1.5B-Instruct on HuggingFaceH4/Bespoke-Stratos-17k using hf SFT trainer on a single H100 gpu.
Chat template
Files info
Base model