grpo_math / training_args.bin

Commit History

Backup current best GRPO math checkpoint
7909e36
verified

mmm128 commited on