MMR-GRPO-lambda-0.8 / generation_config.json

Commit History

Model save
3bef7e7
verified

kangdawei commited on