Update README.md
Browse files
README.md
CHANGED
|
@@ -13,8 +13,10 @@ The code to run this custom model can be found [here](https://github.com/tdooms/
|
|
| 13 |
## Model Details
|
| 14 |
- 30 million parameters
|
| 15 |
- 6 layers
|
|
|
|
| 16 |
- model dimension 512
|
| 17 |
- bilinear MLP with expansion factor 4
|
| 18 |
- context length of 256
|
|
|
|
| 19 |
- rotary positional embedding
|
| 20 |
- custom tinystories [tokenizer](https://huggingface.co/tdooms/ts-tokenizer-4096)
|
|
|
|
| 13 |
## Model Details
|
| 14 |
- 30 million parameters
|
| 15 |
- 6 layers
|
| 16 |
+
- 8 attention heads
|
| 17 |
- model dimension 512
|
| 18 |
- bilinear MLP with expansion factor 4
|
| 19 |
- context length of 256
|
| 20 |
+
- trained for 1 epoch (~2.5B tokens)
|
| 21 |
- rotary positional embedding
|
| 22 |
- custom tinystories [tokenizer](https://huggingface.co/tdooms/ts-tokenizer-4096)
|