Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
RedHatAI
/
Llama-3.2-3B-Instruct-FP8-dynamic
like
3
Follow
Red Hat AI
2.69k
Text Generation
Safetensors
8 languages
llama
fp8
vllm
conversational
compressed-tensors
License:
llama3.2
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Llama-3.2-3B-Instruct-FP8-dynamic
4.41 GB
Ctrl+K
Ctrl+K
4 contributors
History:
5 commits
mgoin
Updated compression_config to quantization_config
c308a86
verified
over 1 year ago
.gitattributes
Safe
1.57 kB
add model
almost 2 years ago
README.md
Safe
8.78 kB
Update README.md
almost 2 years ago
config.json
Safe
2.11 kB
Updated compression_config to quantization_config
over 1 year ago
generation_config.json
Safe
184 Bytes
add model
almost 2 years ago
model.safetensors
Safe
4.4 GB
xet
add model
almost 2 years ago
recipe.yaml
Safe
351 Bytes
add model
almost 2 years ago
special_tokens_map.json
Safe
296 Bytes
add model
almost 2 years ago
tokenizer.json
Safe
17.2 MB
xet
add model
almost 2 years ago
tokenizer_config.json
Safe
54.5 kB
add model
almost 2 years ago