Cross-link: DistilQwen collection spotlight — 2026-03-29
Browse files
README.md
CHANGED
|
@@ -108,3 +108,19 @@ This model was designed and built from Discrepancy Analysis, paper to be publish
|
|
| 108 |
|
| 109 |
|
| 110 |
*Last updated: 2026-03-28 12:57 UTC*
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 108 |
|
| 109 |
|
| 110 |
*Last updated: 2026-03-28 12:57 UTC*
|
| 111 |
+
|
| 112 |
+
<!-- CIX-CROSSLINK-START -->
|
| 113 |
+
|
| 114 |
+
---
|
| 115 |
+
|
| 116 |
+
## From the Convergent Intelligence Portfolio
|
| 117 |
+
|
| 118 |
+
**[DistilQwen Collection](https://huggingface.co/collections/reaperdoesntknow/distilqwen-69bf40ec669117e3f069ef1c)** — Proof-weighted distillation from Qwen3-30B-A3B → 1.7B and 0.6B. Three teacher variants (Instruct, Thinking, Coder), nine models, 2,788 combined downloads. Structure beats scale.
|
| 119 |
+
|
| 120 |
+
Top model: [Qwen3-1.7B-Coder-Distilled-SFT](https://huggingface.co/reaperdoesntknow/Qwen3-1.7B-Coder-Distilled-SFT) — 508 downloads
|
| 121 |
+
|
| 122 |
+
Full methodology: [Structure Over Scale (DOI: 10.57967/hf/8165)](https://doi.org/10.57967/hf/8165)
|
| 123 |
+
|
| 124 |
+
*Convergent Intelligence LLC: Research Division*
|
| 125 |
+
|
| 126 |
+
<!-- CIX-CROSSLINK-END -->
|