OPENR1_REINFORCE_MOD_Q2.5-7B / training_args.bin

Commit History

Upload Qwen2.5-7B OpenR1 verified REINFORCE-Mod LoRA adapter
72fc998
verified

saaduddinM commited on