SmolLM2-135M-Chat / generation_config.json
gnokit's picture
5000 steps (batch size is 4) SFT training of HuggingFaceTB/SmolLM2-135M with HuggingFaceTB/smoltalk/everyday-conversations dataset
b1b36e3 verified
raw
history blame contribute delete
139 Bytes
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"pad_token_id": 2,
"transformers_version": "4.49.0"
}