tdooms
/

ts-medium

tdooms commited on Oct 15, 2024

Commit

e5e17f9

verified ·

1 Parent(s): 23c11d3

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -13,8 +13,10 @@ The code to run this custom model can be found [here](https://github.com/tdooms/
 ## Model Details
 - 30 million parameters
 - 6 layers
 - model dimension 512
 - bilinear MLP with expansion factor 4
 - context length of 256
 - rotary positional embedding
 - custom tinystories [tokenizer](https://huggingface.co/tdooms/ts-tokenizer-4096)

 ## Model Details
 - 30 million parameters
 - 6 layers
+- 8 attention heads
 - model dimension 512
 - bilinear MLP with expansion factor 4
 - context length of 256
+- trained for 1 epoch (~2.5B tokens)
 - rotary positional embedding
 - custom tinystories [tokenizer](https://huggingface.co/tdooms/ts-tokenizer-4096)