clankur
/

einygpt

clankur commited on May 16, 2024

Commit

e97eb2d

1 Parent(s): 84b2e2f

adjust readme

Files changed (1) hide show

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ license: mit
 # einygpt
-Here's the models I've trained with the model in [einygpt](https://github.com/clankur/einygpt). For reference they are:
 - [a multihead attention model](./model_weights_mha.pth) replicating the model discussed in the [TinyStories paper](https://arxiv.org/abs/2305.07759) using the GPT2Tokenizer
 - [a multiquery attention model](model_weights_mqa.pth) using the GPT2Tokenizer

 # einygpt
+Here's the models I've trained using the transformer I wrote in [einygpt](https://github.com/clankur/einygpt). For reference they are:
 - [a multihead attention model](./model_weights_mha.pth) replicating the model discussed in the [TinyStories paper](https://arxiv.org/abs/2305.07759) using the GPT2Tokenizer
 - [a multiquery attention model](model_weights_mqa.pth) using the GPT2Tokenizer