99e9892
1
2
3
4
5
6
7
8
--- license: mit --- This is a custom model with 123.5M parameter. - A modified version of GPT-2 - More data will be added soon - Pretraining was done on Fineweb Edu dataset - Finetuning not done