Shekswess
/

tiny-think-dpo-math-stem-apo_zero-beta0_3-lr3e-6-e1-bs8

Commit History

Training in progress, step 358

ad78a7b
verified

Shekswess commited on 18 days ago