MMR-DR_GRPO-lambda-0.8 / config.json

Commit History

End of training
87f6dab
verified

kangdawei commited on

Training in progress, step 100
7bb1e7b
verified

kangdawei commited on