Shekswess
/

tiny-think-dpo-math-stem-dpo-beta1-lr3e-6-e1-bs8

Text Generation

Generated from Trainer

Model card Files Files and versions

tiny-think-dpo-math-stem-dpo-beta1-lr3e-6-e1-bs8

Commit History

Update README.md

6bef1af
verified

Shekswess commited on Jan 28

Training in progress, step 358

ab1f3a5
verified

Shekswess commited on Jan 18

initial commit

cca4239
verified

Shekswess commited on Jan 18