MATH_ALL_GRPO_L3.2-3B_K8 / tokenizer.json

Commit History

Upload Llama3.2-3B MATH train+test GRPO-OS K8 LoRA adapter
a4ffeb1
verified

saaduddinM commited on