llmat
/

Mistral-Small-Instruct-2409-NVFP4

Text Generation

8-bit precision

compressed-tensors

Model card Files Files and versions

Mistral-Small-Instruct-2409-NVFP4 / generation_config.json

llmat's picture

Add NVFP4 quantized model (llmcompressor oneshot).

ef523f5 verified 6 months ago

history blame contribute delete

111 Bytes

	{
	"_from_model_config": true,
	"bos_token_id": 1,
	"eos_token_id": 2,
	"transformers_version": "4.55.0"
	}