filter-2b-w8a8 / generation_config.json
alphaXiv's picture
Add W8A8 INT8 quantized filter-2b (SmoothQuant + GPTQ W8A8)
844fc11 verified
raw
history blame contribute delete
115 Bytes
{
"_from_model_config": true,
"eos_token_id": 248044,
"transformers_version": "5.5.0",
"use_cache": true
}