koreankiwi99
/
M2_dpo_model_base_Math-Step-DPO-10K
like
0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community