math_model / generation_config.json

Commit History

Update math model
72927ef
verified

jdecim commited on

Push exp8 GRPO best (step 750), gen temp=0.7 for pass@8
ab2047d
verified

jdecim commited on

Push DPO checkpoint with T=0.3 (optimal for pass@8 on CI gate)
ac3517e
verified

jdecim commited on

Upload math SFT checkpoint (smoke test)
2b172d7
verified

jdecim commited on