Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
XXXXyu
's Collections
vlut.cpp
vlut.cpp
updated
Jan 1
SOTA ternary-packed versions of 1.58-bit LLMs for efficient on-device inference with vlut.cpp.
Upvote
1
XXXXyu/Llama3-8B-1.58-100B-tokens-vlut-gguf
Text Generation
•
8B
•
Updated
Jan 1
•
28
XXXXyu/bitnet_b1_58-3B-vlut-gguf
Text Generation
•
3B
•
Updated
Jan 1
•
41
XXXXyu/Falcon3-1B-Instruct-1.58bit-vlut-gguf
Text Generation
•
2B
•
Updated
Jan 1
•
48
Upvote
1
Share collection
View history
Collection guide
Browse collections