A collection of Small Language Models pretrained from scratch (using only PyTorch) on Tiny Stories Dataset on a single Tesla-T4 16GB GPU.
Namrata Thakur
NamrataThakur
AI & ML interests
Small Language Model, Fine-Tuning, From Scratch
Recent Activity
updated a model 24 days ago
NamrataThakur/Small_Language_Model_MOE_127M_Pretrained updated a model 24 days ago
NamrataThakur/Small_Language_Model_GQA_48M_Pretrained updated a model 24 days ago
NamrataThakur/Small_Language_Model_MHA_53M_PretrainedOrganizations
None yet