Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lsm0729
/
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
like
0
Text Generation
Transformers
Safetensors
llama
quantized
int8
w8a8
llmcompressor
llama-3.1
conversational
text-generation-inference
8-bit precision
compressed-tensors
arxiv:
2407.21783
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
9.1 GB
1 contributor
History:
2 commits
lsm0729
Upload W8A8 quantized Llama 3.1 8B Instruct model
e5a9602
verified
24 days ago
.gitattributes
1.57 kB
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
LICENSE
314 Bytes
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
README.md
2.6 kB
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
chat_template.jinja
4.61 kB
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
config.json
2.16 kB
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
generation_config.json
184 Bytes
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
model-00001-of-00002.safetensors
5 GB
xet
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
model-00002-of-00002.safetensors
4.08 GB
xet
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
model.safetensors.index.json
43.5 kB
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
recipe.yaml
223 Bytes
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
special_tokens_map.json
296 Bytes
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
tokenizer.json
17.2 MB
xet
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago
tokenizer_config.json
50.5 kB
Upload W8A8 quantized Llama 3.1 8B Instruct model
24 days ago