Update portfolio metrics — 2026-03-28
Browse files
README.md
CHANGED
|
@@ -239,4 +239,34 @@ Citation
|
|
| 239 |
|
| 240 |
Acknowledgements
|
| 241 |
|
| 242 |
-
Built with 🤗 Transformers and a metric-first rethinking of attention. BlackHoleRoPE draws inspiration from symplectic/rotational encodings and bounded-energy dynamics.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 239 |
|
| 240 |
Acknowledgements
|
| 241 |
|
| 242 |
+
Built with 🤗 Transformers and a metric-first rethinking of attention. BlackHoleRoPE draws inspiration from symplectic/rotational encodings and bounded-energy dynamics.
|
| 243 |
+
|
| 244 |
+
---
|
| 245 |
+
|
| 246 |
+
## Convergent Intelligence Portfolio
|
| 247 |
+
|
| 248 |
+
*Part of the [Mixture of Attention Series](https://huggingface.co/reaperdoesntknow) by [Convergent Intelligence LLC: Research Division](https://huggingface.co/reaperdoesntknow)*
|
| 249 |
+
|
| 250 |
+
|
| 251 |
+
### Related Models
|
| 252 |
+
|
| 253 |
+
| Model | Downloads | Format |
|
| 254 |
+
|-------|-----------|--------|
|
| 255 |
+
| [MoA-100M](https://huggingface.co/reaperdoesntknow/MoA-100M) | 14 | HF |
|
| 256 |
+
| [MoA-155M](https://huggingface.co/reaperdoesntknow/MoA-155M) | 2 | HF |
|
| 257 |
+
| [MoA-400M](https://huggingface.co/reaperdoesntknow/MoA-400M) | 3 | HF |
|
| 258 |
+
|
| 259 |
+
### Top Models from Our Lab
|
| 260 |
+
|
| 261 |
+
| Model | Downloads |
|
| 262 |
+
|-------|-----------|
|
| 263 |
+
| [Qwen3-1.7B-Thinking-Distil](https://huggingface.co/reaperdoesntknow/Qwen3-1.7B-Thinking-Distil) | 501 |
|
| 264 |
+
| [LFM2.5-1.2B-Distilled-SFT](https://huggingface.co/reaperdoesntknow/LFM2.5-1.2B-Distilled-SFT) | 342 |
|
| 265 |
+
| [Qwen3-1.7B-Coder-Distilled-SFT](https://huggingface.co/reaperdoesntknow/Qwen3-1.7B-Coder-Distilled-SFT) | 302 |
|
| 266 |
+
| [Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT-GGUF](https://huggingface.co/reaperdoesntknow/Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT-GGUF) | 203 |
|
| 267 |
+
| [Qwen3-1.7B-Coder-Distilled-SFT-GGUF](https://huggingface.co/reaperdoesntknow/Qwen3-1.7B-Coder-Distilled-SFT-GGUF) | 194 |
|
| 268 |
+
|
| 269 |
+
**Total Portfolio: 41 models | 2,781 total downloads**
|
| 270 |
+
|
| 271 |
+
|
| 272 |
+
*Last updated: 2026-03-28 12:57 UTC*
|