dpo_math_sft_5e6_2ep / optimizer.pt

Commit History

Upload training checkpoint
97b8705
verified

Jennny commited on