Cross-link: DistilQwen collection spotlight — 2026-03-29
Browse files
README.md
CHANGED
|
@@ -219,3 +219,19 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
|
|
| 219 |
|
| 220 |
|
| 221 |
*Last updated: 2026-03-28 12:58 UTC*
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 219 |
|
| 220 |
|
| 221 |
*Last updated: 2026-03-28 12:58 UTC*
|
| 222 |
+
|
| 223 |
+
<!-- CIX-CROSSLINK-START -->
|
| 224 |
+
|
| 225 |
+
---
|
| 226 |
+
|
| 227 |
+
## From the Convergent Intelligence Portfolio
|
| 228 |
+
|
| 229 |
+
**[DistilQwen Collection](https://huggingface.co/collections/reaperdoesntknow/distilqwen-69bf40ec669117e3f069ef1c)** — Proof-weighted distillation from Qwen3-30B-A3B → 1.7B and 0.6B. Three teacher variants (Instruct, Thinking, Coder), nine models, 2,788 combined downloads. Structure beats scale.
|
| 230 |
+
|
| 231 |
+
Top model: [Qwen3-1.7B-Coder-Distilled-SFT](https://huggingface.co/reaperdoesntknow/Qwen3-1.7B-Coder-Distilled-SFT) — 508 downloads
|
| 232 |
+
|
| 233 |
+
Full methodology: [Structure Over Scale (DOI: 10.57967/hf/8165)](https://doi.org/10.57967/hf/8165)
|
| 234 |
+
|
| 235 |
+
*Convergent Intelligence LLC: Research Division*
|
| 236 |
+
|
| 237 |
+
<!-- CIX-CROSSLINK-END -->
|