MMR-GRPO-lambda-0.6 / README.md

Commit History

End of training
45f1017
verified

kangdawei commited on

Model save
d62811f
verified

kangdawei commited on