grpo-test-better / generation_config.json

Commit History

Trained with Unsloth
d8eb7cf
verified

duxx commited on