Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lsm0729
/
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
like
0
Text Generation
Transformers
Safetensors
llama
quantized
int8
w8a8
llmcompressor
llama-3.1
conversational
text-generation-inference
8-bit precision
compressed-tensors
arxiv:
2407.21783
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
/
tokenizer.json
Commit History
Upload W8A8 quantized Llama 3.1 8B Instruct model
e5a9602
verified
lsm0729
commited on
Jan 21