MoM-python-slm-grpo / generation_config.json
srivarenya's picture
GRPO (RLVR) on MoM-python-slm, 500 steps
6768ee3 verified
Raw
History Blame Contribute Delete
216 Bytes
{
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"repetition_penalty": 1.1,
"temperature": 0.7,
"top_k": 20,
"top_p": 0.8,
"transformers_version": "5.12.1"
}