MATH_ALL_REINFORCE_MOD_L3.2-3B / training_args.bin

Commit History

Upload Llama3.2-3B MATH train+test REINFORCE-Mod TB LoRA adapter
82cdc6d
verified

saaduddinM commited on