Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tokenlabsdotrun
/
Llama-3.1-8B-Quanto-Int8
like
0
Follow
tokenlabs.run
1
Text Generation
Transformers
Safetensors
llama
quantized
quanto
int8
conversational
text-generation-inference
8-bit precision
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.1-8B-Quanto-Int8
/
tokenizer.json
Commit History
Upload Llama-3.1-8B quantized with quanto int8
c6aa61d
verified
genai2eliza
commited on
9 days ago