NamrataThakur/Small_Language_Model_GQA_48M_Pretrained Text Generation • Updated 23 days ago • 2.64k • 1
NamrataThakur/Small_Language_Model_MOE_127M_Pretrained Text Generation • Updated 23 days ago • 2.64k • 1