koreankiwi99
/
M2_dpo_model_base_Math-Step-DPO-10K

Model card Files Files and versions
xet
Metrics Training metrics Community