Qwen2.5-3B-Knowledge-R1-GRPO / training_args.bin

Commit History

Model save
4b48b2b
verified

hzy commited on