32b_codeelo-v2_Qwen3-32B_step5 / generation_config.json
atutej's picture
Best train reward at step 4 (reward=0.453, pass@8=0.678). Uploading nearest HF export at step 5 (reward=0.398). Base model: Qwen/Qwen3-32B, dataset: exp_rpt_codeelo-v2.
6e7a327 verified
raw
history blame contribute delete
214 Bytes
{
"bos_token_id": 151643,
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"temperature": 0.6,
"top_k": 20,
"top_p": 0.95,
"transformers_version": "4.57.6"
}