Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
atomicmilkshake
/
llama-cpp-turboquant-binaries
like
0
llama-cpp
turboquant
triattention
kv-cache
windows
cuda
arxiv:
2604.04921
License:
mit
Model card
Files
Files and versions
xet
Community
main
llama-cpp-turboquant-binaries
187 MB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
atomicmilkshake
Add README
402c910
verified
1 day ago
.gitattributes
Safe
1.52 kB
initial commit
1 day ago
README.md
2.14 kB
Add README
1 day ago
llama-turboquant-triattention-win-cu13-x64.zip
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
187 MB
xet
Add Windows x64 CUDA 13 Release build (TurboQuant + TriAttention)
1 day ago