gemma-3-1b-it-Math-GRPO / train_rs_sft.py

Commit History

Add train_rs_sft.py
12dd0e7
verified

NotoriousH2 commited on