qmd-training-scripts / train_4B_grpo.py

Commit History

Add 4B GRPO training script
5775965
verified

tobil commited on