CarlOwOs
/

MNLP_M2_mcqa_model-W4A8

8-bit precision

compressed-tensors

Model card Files Files and versions

MNLP_M2_mcqa_model-W4A8

1.09 GB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

CarlOwOs's picture

Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor

cc8a543 verified 12 months ago

.gitattributes

1.57 kB
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
added_tokens.json

707 Bytes
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
config.json

1.84 kB
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
generation_config.json

166 Bytes
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
merges.txt

1.67 MB
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
model.safetensors

1.07 GB
xet

Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
recipe.yaml

170 Bytes
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
special_tokens_map.json

617 Bytes
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
tokenizer.json

11.4 MB
xet

Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
tokenizer_config.json

5.6 kB
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago
vocab.json

2.78 MB
Add FP8 dynamically quantized MNLP_M2_mcqa_model model using llm-compressor 12 months ago