Update README.md
Browse files
README.md
CHANGED
|
@@ -1,5 +1,5 @@
|
|
| 1 |
---
|
| 2 |
-
license:
|
| 3 |
tags:
|
| 4 |
- pytorch
|
| 5 |
- addressed-state-attention
|
|
@@ -38,4 +38,4 @@ FineWeb, 187M params: 3.73 val loss / 41.6 PPL (75k steps)
|
|
| 38 |
Architecture: 21 layers, 768d, 12 heads, 16 slots
|
| 39 |
Links
|
| 40 |
Code: https://github.com/DigitalDaimyo/AddressedStateAttention
|
| 41 |
-
Paper: https://github.com/DigitalDaimyo/AddressedStateAttention/tree/main/paper_drafts
|
|
|
|
| 1 |
---
|
| 2 |
+
license: mit
|
| 3 |
tags:
|
| 4 |
- pytorch
|
| 5 |
- addressed-state-attention
|
|
|
|
| 38 |
Architecture: 21 layers, 768d, 12 heads, 16 slots
|
| 39 |
Links
|
| 40 |
Code: https://github.com/DigitalDaimyo/AddressedStateAttention
|
| 41 |
+
Paper: https://github.com/DigitalDaimyo/AddressedStateAttention/tree/main/paper_drafts
|