Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
datatab
/
YugoGPT-Quantized-GGUF
like
2
Transformers
GGUF
Serbian
mistral
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
9ee0070
YugoGPT-Quantized-GGUF
57.6 GB
Ctrl+K
Ctrl+K
1 contributor
History:
53 commits
datatab
q4_0: Original quant method, 4-bit.
9ee0070
verified
about 2 years ago
.gitattributes
3.79 kB
q4_0: Original quant method, 4-bit.
about 2 years ago
README.md
Safe
3.42 kB
Update README.md
about 2 years ago
YugoGPT-Quantized-GGUF.Q3_K_XS.gguf
3 GB
xet
q3_k_xs" : "3-bit extra small quantization
about 2 years ago
YugoGPT-Quantized-GGUF.Q4_K_M.gguf
4.37 GB
xet
Rename YugoGPT-Quantize-GGUF.Q4_K_M.gguf to YugoGPT-Quantized-GGUF.Q4_K_M.gguf
about 2 years ago
YugoGPT-Quantized-GGUF.Q4_K_S.gguf
Safe
4.14 GB
xet
Rename YugoGPT-Quantized-GGUF-unsloth.Q4_K_S.gguf to YugoGPT-Quantized-GGUF.Q4_K_S.gguf
about 2 years ago
YugoGPT-Quantized-GGUF.Q5_0.gguf
Safe
5 GB
xet
Rename YugoGPT-Quantized-GGUF-unsloth.Q5_0.gguf to YugoGPT-Quantized-GGUF.Q5_0.gguf
about 2 years ago
YugoGPT-Quantized-GGUF.Q5_K_M.gguf
5.13 GB
xet
Rename YugoGPT-Quantized-GGUF-unsloth.Q5_K_M.gguf to YugoGPT-Quantized-GGUF.Q5_K_M.gguf
about 2 years ago
YugoGPT-Quantized-GGUF.Q5_K_S.gguf
5 GB
xet
Rename YugoGPT-Quantized-GGUF-unsloth.Q5_K_S.gguf to YugoGPT-Quantized-GGUF.Q5_K_S.gguf
about 2 years ago
YugoGPT-Quantized-GGUF.Q6_K.gguf
5.94 GB
xet
Rename YugoGPT-Quantized-GGUF-unsloth.Q6_K.gguf to YugoGPT-Quantized-GGUF.Q6_K.gguf
about 2 years ago
YugoGPT-Quantized-GGUF.Q8_0.gguf
Safe
7.7 GB
xet
Rename YugoGPT-Quantized-GGUF-unsloth.Q8_0.gguf to YugoGPT-Quantized-GGUF.Q8_0.gguf
about 2 years ago
YugoGPT-Quantized.GGUF.Q2_K.gguf
Safe
2.72 GB
xet
q2_k: Uses Q4_K for the attention.vw and feed_forward.w2 tensors, Q2_K for the other tensors.
about 2 years ago
YugoGPT-Quantized.GGUF.Q3_K_L.gguf
Safe
3.82 GB
xet
q3_k_l: Uses Q5_K for the attention.wv, attention.wo, and feed_forward.w2 tensors, else Q3_K
about 2 years ago
YugoGPT-Quantized.GGUF.Q3_K_M.gguf
Safe
3.52 GB
xet
q3_k_m: Uses Q4_K for the attention.wv, attention.wo, and feed_forward.w2 tensors, else Q3_K
about 2 years ago
YugoGPT-Quantized.GGUF.Q3_K_S.gguf
3.16 GB
xet
q3_k_s: Uses Q3_K for all tensors
about 2 years ago
YugoGPT-Quantized.GGUF.Q4_0.gguf
Safe
4.11 GB
xet
q4_0: Original quant method, 4-bit.
about 2 years ago
config.json
Safe
31 Bytes
Create config.json
about 2 years ago