Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
Llama-3.2-3B-Instruct-quantized.w8a8
like
1
Follow
Red Hat AI
1.87k
Text Generation
Safetensors
8 languages
llama
llama-3
neuralmagic
llmcompressor
conversational
8-bit precision
compressed-tensors
arxiv:
2211.10438
arxiv:
2210.17323
License:
llama3.2
Model card
Files
Files and versions
xet
Community
1
refs/pr/1
Llama-3.2-3B-Instruct-quantized.w8a8
Commit History
Update config.json with the correct state
ffdab28
verified
dsikka
commited on
Jul 10, 2025
Update README.md
fb5da44
verified
alexmarques
commited on
Oct 16, 2024
Updated compression_config to quantization_config
59e165b
verified
mgoin
commited on
Oct 9, 2024
Update README.md
0f32e45
verified
alexmarques
commited on
Sep 26, 2024
Create README.md
518b068
verified
alexmarques
commited on
Sep 26, 2024
Upload folder using huggingface_hub
1c42cac
verified
alexmarques
commited on
Sep 25, 2024
initial commit
664ba2b
verified
alexmarques
commited on
Sep 25, 2024