Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
fraQtl
/
Llama-3.2-3B-optimized
like
0
Follow
fraQtl AI Research
2
Safetensors
llama
fraqtl
kv-cache-optimized
inference
arxiv:
2604.11501
License:
other
Model card
Files
Files and versions
xet
Community
main
Llama-3.2-3B-optimized
/
tokenizer.json
Commit History
fraQtl compressed: k=16 INT3, delta=+0.7151
667ffd6
verified
Zenalyze
commited on
6 days ago