Curated Numinamath
Collection
3 items • Updated
How to use collinear-ai/math_reasoning_phi_c2 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-3.5-mini-instruct")
model = PeftModel.from_pretrained(base_model, "collinear-ai/math_reasoning_phi_c2")This is an open-source fine-tuned reasoning adapter of microsoft/Phi-3.5-mini-instruct, transformed into a math reasoning model using data curated from collinear-ai/R1-Distill-SFT-Curated.
axolotl version: 0.5.0
Math-Reasoning
Training data curated from collinear-ai/R1-Distill-SFT-Curated Evaluation data: HuggingFaceH4/MATH-500
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| No log | 0.0003 | 1 | 0.6646 |
| 0.3174 | 0.3335 | 1247 | 0.3329 |
| 0.307 | 0.6670 | 2494 | 0.3169 |
Base model
microsoft/Phi-3.5-mini-instruct
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-3.5-mini-instruct") model = PeftModel.from_pretrained(base_model, "collinear-ai/math_reasoning_phi_c2")