Spaces:

yashvyasop
/

DesignGym

Running

App Files Files Community

DesignGym / training

115 kB

Ctrl+K

Ctrl+K

1 contributor

History: 34 commits

yashvyasop's picture

grpo: add --eval_best_of (best-of-N eval for SFT and GRPO)

8a2235b verified 2 months ago