RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 at 9811037b3bdd0e39bd4c5e8ff039e97bb4da453d

Meta-Llama-3.1-8B-Instruct-quantized.w8a8

9.09 GB

Ctrl+K

Ctrl+K

4 contributors

History: 22 commits

jennyyyi's picture

Update README.md

9811037 verified 12 months ago

.gitattributes

1.52 kB
initial commit almost 2 years ago
README.md

20.9 kB
Update README.md 12 months ago
config.json

2.15 kB
Updated compression_config to quantization_config over 1 year ago
generation_config.json

184 Bytes
Upload folder using huggingface_hub almost 2 years ago
model-00001-of-00002.safetensors

5 GB
xet

Upload folder using huggingface_hub almost 2 years ago
model-00002-of-00002.safetensors

4.08 GB
xet

Upload folder using huggingface_hub almost 2 years ago
model.safetensors.index.json

43.5 kB
Upload folder using huggingface_hub almost 2 years ago
recipe.yaml

173 Bytes
Upload folder using huggingface_hub almost 2 years ago
special_tokens_map.json

325 Bytes
Upload folder using huggingface_hub almost 2 years ago
tokenizer.json

9.09 MB
Upload tokenizer.json with huggingface_hub over 1 year ago
tokenizer_config.json

55.4 kB
Upload tokenizer_config.json with huggingface_hub over 1 year ago