grpo_qwen_coder_3B_16bit / generation_config.json

Commit History

Trained with Unsloth
1b89650
verified

chien commited on