Update README.md
Browse files
README.md
CHANGED
|
@@ -2,4 +2,14 @@
|
|
| 2 |
license: apache-2.0
|
| 3 |
base_model:
|
| 4 |
- Pinkstackorg/Qwen3-Coder-pruned-20B-A3B
|
| 5 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
license: apache-2.0
|
| 3 |
base_model:
|
| 4 |
- Pinkstackorg/Qwen3-Coder-pruned-20B-A3B
|
| 5 |
+
---
|
| 6 |
+
This is a 4bit bnb quant of the original model.
|
| 7 |
+
|
| 8 |
+
This is a layer-wise pruned variant of cerebras/Qwen3-Coder-REAP-25B-A3B resulting in a ~20B model with ~3B active parameters. It has not been fine-tuned yet.
|
| 9 |
+
|
| 10 |
+
Prune info:
|
| 11 |
+
Original model: 48 layers
|
| 12 |
+
**New model: 38 Layers**
|
| 13 |
+
|
| 14 |
+
result:
|
| 15 |
+
Similar model, it MUST be fine-tuned before usage as current performance is non-ideal. While it can absolutely be used as is, fine-tuning is needed.
|