Update README.md
Browse files
README.md
CHANGED
|
@@ -210,7 +210,7 @@ Implemented in Python/PyTorch. No custom CUDA required except BitLinear kernels
|
|
| 210 |
| Risk | Mitigation |
|
| 211 |
|------|-----------|
|
| 212 |
| **Cannibalization signal too noisy** | Dual-EMA smoothing validated by GCond [6]; cooldown timer; min age |
|
| 213 |
-
| **Merge collapse** still no soloution want to avoid replay buffer|
|
| 214 |
| **Router instability** | Embedding continuity on split; cooldown between events |
|
| 215 |
| **Expert starvation at 100K** | Death mechanism; pressure system|
|
| 216 |
| **Split/merge oscillation** | min age before merge; hysteresis; cooldown |
|
|
|
|
| 210 |
| Risk | Mitigation |
|
| 211 |
|------|-----------|
|
| 212 |
| **Cannibalization signal too noisy** | Dual-EMA smoothing validated by GCond [6]; cooldown timer; min age |
|
| 213 |
+
| **Merge collapse** |still no soloution want to avoid replay buffer|
|
| 214 |
| **Router instability** | Embedding continuity on split; cooldown between events |
|
| 215 |
| **Expert starvation at 100K** | Death mechanism; pressure system|
|
| 216 |
| **Split/merge oscillation** | min age before merge; hysteresis; cooldown |
|