tinystories / generation_config.json
jacksuuuu's picture
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
08dbe3c verified
{
"_from_model_config": true,
"bos_token_id": 50256,
"eos_token_id": 50256,
"transformers_version": "4.51.3"
}