NamrataThakur/Small_Language_Model_MOE_127M_Pretrained Text Generation • Updated 26 days ago • 2.62k • 1
NamrataThakur/Small_Language_Model_GQA_48M_Pretrained Text Generation • Updated 26 days ago • 2.68k • 1
NamrataThakur/Small_Language_Model_MHA_53M_Pretrained Text Generation • Updated 26 days ago • 2.69k • 1