Update README.md
Browse files
README.md
CHANGED
|
@@ -30,7 +30,7 @@ HRWKV7-Reka-Flash3-Preview is an experimental hybrid architecture model that com
|
|
| 30 |
- 6 GQA layers (No Rope, No Position Embeddings)
|
| 31 |
- **Hidden Dimension:** 6144
|
| 32 |
- **Training Context Window:** 4096 tokens
|
| 33 |
-
- **Inference Context Window** 32768
|
| 34 |
- **Training Strategy** Following RADLADS method based knowledge distillation
|
| 35 |
|
| 36 |
## Technical Innovation
|
|
|
|
| 30 |
- 6 GQA layers (No Rope, No Position Embeddings)
|
| 31 |
- **Hidden Dimension:** 6144
|
| 32 |
- **Training Context Window:** 4096 tokens
|
| 33 |
+
- **Inference Context Window** 32768+
|
| 34 |
- **Training Strategy** Following RADLADS method based knowledge distillation
|
| 35 |
|
| 36 |
## Technical Innovation
|