Mistral-Small-Instruct-2409-NVFP4 / generation_config.json
llmat's picture
Add NVFP4 quantized model (llmcompressor oneshot).
ef523f5 verified
raw
history blame contribute delete
111 Bytes
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"transformers_version": "4.55.0"
}