Yixin Song
commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -28,7 +28,7 @@ SmallThinker is a family of **on-device native** Mixture-of-Experts (MoE) langua
|
|
| 28 |
| **Activated Parameters** | 0.6B |
|
| 29 |
| **Number of Layers** | 32 |
|
| 30 |
| **Attention Hidden Dimension** | 1536 |
|
| 31 |
-
| **MoE Hidden Dimension** (per Expert) |
|
| 32 |
| **Number of Attention Heads** | 12 |
|
| 33 |
| **Number of Experts** | 32 |
|
| 34 |
| **Selected Experts per Token** | 4 |
|
|
|
|
| 28 |
| **Activated Parameters** | 0.6B |
|
| 29 |
| **Number of Layers** | 32 |
|
| 30 |
| **Attention Hidden Dimension** | 1536 |
|
| 31 |
+
| **MoE Hidden Dimension** (per Expert) | 768 |
|
| 32 |
| **Number of Attention Heads** | 12 |
|
| 33 |
| **Number of Experts** | 32 |
|
| 34 |
| **Selected Experts per Token** | 4 |
|