ppo_trained_model_gsm8k_ppo_500examples / generation_config.json

Commit History

Trained with Unsloth
d75b973
verified

regulus4869 commited on