tiny-shakespeare / README.md
leafora's picture
Update README.md
55c830f verified
Model specification
- Params: 21 million
- Architecture: Decoder-only transformer
- Training data: 1.1 million tokens from Shakespeare text
- Context length: 256