A collection of Small Language Models pretrained from scratch (using only PyTorch) on Tiny Stories Dataset on a single Tesla-T4 16GB GPU.
Namrata Thakur
NamrataThakur
AI & ML interests
Small Language Model, Fine-Tuning, From Scratch
Organizations
None yet