zubenelgenubi-124m / generation_config.json
farpluto's picture
Upload 124M GPT trained from scratch with SmolLM distillation
ca40472 verified
raw
history blame contribute delete
194 Bytes
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"output_attentions": false,
"output_hidden_states": false,
"transformers_version": "5.0.0",
"use_cache": true
}