tdooms commited on
Commit
e5e17f9
·
verified ·
1 Parent(s): 23c11d3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -13,8 +13,10 @@ The code to run this custom model can be found [here](https://github.com/tdooms/
13
  ## Model Details
14
  - 30 million parameters
15
  - 6 layers
 
16
  - model dimension 512
17
  - bilinear MLP with expansion factor 4
18
  - context length of 256
 
19
  - rotary positional embedding
20
  - custom tinystories [tokenizer](https://huggingface.co/tdooms/ts-tokenizer-4096)
 
13
  ## Model Details
14
  - 30 million parameters
15
  - 6 layers
16
+ - 8 attention heads
17
  - model dimension 512
18
  - bilinear MLP with expansion factor 4
19
  - context length of 256
20
+ - trained for 1 epoch (~2.5B tokens)
21
  - rotary positional embedding
22
  - custom tinystories [tokenizer](https://huggingface.co/tdooms/ts-tokenizer-4096)