SauravP97
/

tiny-stories-3M

Text Generation

Model card Files Files and versions

SauravP97 commited on 20 days ago

Commit

e7b122d

·

verified ·

1 Parent(s): 4933754

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -50,7 +50,7 @@ The model was trained on the TinyStories dataset, which consist of synthetic sho
 ### Training Procedure
-The model was trained from scratch on a **NVIDIA T4** GPU for around 3 hours to achieve a loss of ~`2.17`. The model was trained for `0.22` epochs estimating around `55K` steps.
 We used **EleutherAI/gpt-neo-125M** tokenizer model training and inference.
 #### Training Hyperparameters

 ### Training Procedure
+The model was trained from scratch on a **NVIDIA T4** GPU for around 3 hours to achieve a loss of `2.17`. The model was trained for `0.22` epochs estimating around `55K` steps.
 We used **EleutherAI/gpt-neo-125M** tokenizer model training and inference.
 #### Training Hyperparameters