Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
parrishcorcoran
/
MedusaBitNet-2B-4T
like
0
Text Generation
GGUF
English
bitnet
speculative-decoding
medusa
ternary-weights
efficient-inference
cpu-inference
arxiv:
2401.10774
License:
mit
Model card
Files
Files and versions
xet
Community
main
MedusaBitNet-2B-4T
1.31 GB
Ctrl+K
Ctrl+K
2 contributors
History:
8 commits
Parrish Corcoran
Add trained Medusa heads and merged GGUF model
734ed40
3 days ago
figures
Upload folder using huggingface_hub
3 days ago
.gitattributes
2.29 kB
Add trained Medusa heads and merged GGUF model
3 days ago
README.md
Safe
5.25 kB
Upload README.md with huggingface_hub
3 days ago
benchmark_headtohead.json
Safe
6.64 kB
Upload benchmark_headtohead.json with huggingface_hub
3 days ago
benchmark_medusa_real.json
Safe
504 Bytes
Upload benchmark_medusa_real.json with huggingface_hub
3 days ago
benchmark_results.json
Safe
3.08 kB
Upload benchmark_results.json with huggingface_hub
3 days ago
ggml-model-i2_s-medusa.gguf
1.2 GB
xet
Add trained Medusa heads and merged GGUF model
3 days ago
medusa_heads_step2000.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
105 MB
xet
Add trained Medusa heads and merged GGUF model
3 days ago