Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
fraQtl
/
Mistral-7B-optimized
like
0
Follow
fraQtl AI Research
2
Safetensors
mistral
fraqtl
kv-cache-optimized
inference
arxiv:
2604.11501
License:
other
Model card
Files
Files and versions
xet
Community
main
Mistral-7B-optimized
/
README.md
Commit History
Add arXiv paper link
c367f9b
verified
Zenalyze
commited on
2 days ago
Clarify: KV cache optimized, not smaller file
9a7a97c
verified
Zenalyze
commited on
3 days ago
Update: k=64, fix generation samples, correct PPL
dd72361
verified
Zenalyze
commited on
4 days ago
fraQtl compressed: k=64 INT3, delta=+0.2218
b881508
verified
Zenalyze
commited on
4 days ago
fraQtl compressed: k=48 INT3, delta=+0.2146
6fe65fa
verified
Zenalyze
commited on
6 days ago
Update model card with runtime comparison
80efb64
verified
Zenalyze
commited on
6 days ago
fraQtl compressed: k=32 INT3, delta=+0.1989
dc721d2
verified
Zenalyze
commited on
6 days ago