countdown-grpo-qwen2 / training_args.bin

Commit History

Upload GRPO fine-tuned model
2ac4c5d
verified

Dat1710 commited on