EtashGuha's picture
Upload global_step_10 policy weights (best checkpoint, reward 0.648, pass@8 0.781)
2d0ed64 verified
raw
history blame contribute delete
188 Bytes
{
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"temperature": 0.6,
"top_k": 20,
"top_p": 0.95,
"transformers_version": "4.57.6"
}