Cross-link: DistilQwen collection spotlight — 2026-03-29
Browse files
README.md
CHANGED
|
@@ -228,3 +228,19 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
|
|
| 228 |
|
| 229 |
|
| 230 |
*Last updated: 2026-03-28 12:56 UTC*
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 228 |
|
| 229 |
|
| 230 |
*Last updated: 2026-03-28 12:56 UTC*
|
| 231 |
+
|
| 232 |
+
<!-- CIX-CROSSLINK-START -->
|
| 233 |
+
|
| 234 |
+
---
|
| 235 |
+
|
| 236 |
+
## From the Convergent Intelligence Portfolio
|
| 237 |
+
|
| 238 |
+
**[DistilQwen Collection](https://huggingface.co/collections/reaperdoesntknow/distilqwen-69bf40ec669117e3f069ef1c)** — Proof-weighted distillation from Qwen3-30B-A3B → 1.7B and 0.6B. Three teacher variants (Instruct, Thinking, Coder), nine models, 2,788 combined downloads. Structure beats scale.
|
| 239 |
+
|
| 240 |
+
Top model: [Qwen3-1.7B-Coder-Distilled-SFT](https://huggingface.co/reaperdoesntknow/Qwen3-1.7B-Coder-Distilled-SFT) — 508 downloads
|
| 241 |
+
|
| 242 |
+
Full methodology: [Structure Over Scale (DOI: 10.57967/hf/8165)](https://doi.org/10.57967/hf/8165)
|
| 243 |
+
|
| 244 |
+
*Convergent Intelligence LLC: Research Division*
|
| 245 |
+
|
| 246 |
+
<!-- CIX-CROSSLINK-END -->
|