grpo-llama-8b-math / generation_config.json

Commit History

Trained with Unsloth
82f83b6
verified

ness15 commited on