grpo_trained_model / README.md

Commit History

Trained with Unsloth
faf5996
verified

regulus4869 commited on