Update README.md
Browse files
README.md
CHANGED
|
@@ -11,4 +11,28 @@ tags:
|
|
| 11 |
language:
|
| 12 |
- en
|
| 13 |
library_name: transformers
|
| 14 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
language:
|
| 12 |
- en
|
| 13 |
library_name: transformers
|
| 14 |
+
---
|
| 15 |
+
|
| 16 |
+
# **Draco-CoderMini-3B-GGUF**
|
| 17 |
+
|
| 18 |
+
> **Draco-CoderMini-3B** is a compact, coding-optimized language model built on the **Qwen2 architecture**, tailored for high-accuracy **code generation**, **debugging**, and **technical reasoning**. With **3 billion parameters**, it strikes a balance between power and deployability, making it an ideal assistant for developers, educators, and engineers working in constrained environments or requiring fast inference.
|
| 19 |
+
|
| 20 |
+
## Model File
|
| 21 |
+
|
| 22 |
+
| File Name | Size | Format |
|
| 23 |
+
|----------------------------------------|---------|--------|
|
| 24 |
+
| Draco-CoderMini-3B.BF16.gguf | 6.18 GB | BF16 |
|
| 25 |
+
| Draco-CoderMini-3B.F16.gguf | 6.18 GB | F16 |
|
| 26 |
+
| Draco-CoderMini-3B.F32.gguf | 12.3 GB | F32 |
|
| 27 |
+
| .gitattributes | 1.75 kB | - |
|
| 28 |
+
| README.md | 210 B | - |
|
| 29 |
+
| config.json | 31 B | JSON |
|
| 30 |
+
|
| 31 |
+
## Quants Usage
|
| 32 |
+
|
| 33 |
+
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
| 34 |
+
|
| 35 |
+
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
| 36 |
+
types (lower is better):
|
| 37 |
+
|
| 38 |
+

|