MATH_ALL_REINFORCE_L3.2-3B / training_args.bin

Commit History

Upload Llama3.2-3B MATH train+test REINFORCE-Mod LoRA adapter
acf0380
verified

saaduddinM commited on