llm-jp/optimal-sparsity-code-d512-E16-k16-520M-A520M Text Generation • 0.5B • Updated 17 days ago • 6
llm-jp/optimal-sparsity-code-d512-E32-k16-920M-A520M Text Generation • 0.9B • Updated 17 days ago • 7
llm-jp/optimal-sparsity-code-d512-E128-k16-3.3B-A520M Text Generation • 3B • Updated 17 days ago • 10
llm-jp/optimal-sparsity-code-d1024-E128-k2-13.2B-A470M Text Generation • 13B • Updated 17 days ago • 8
llm-jp/optimal-sparsity-code-d1024-E256-k2-26.0B-A470M Text Generation • 26B • Updated 17 days ago • 8
llm-jp/optimal-sparsity-code-d1024-E128-k4-13.2B-A670M Text Generation • 13B • Updated 17 days ago • 15
llm-jp/optimal-sparsity-code-d1024-E256-k4-26.0B-A670M Text Generation • 26B • Updated 17 days ago • 9
llm-jp/optimal-sparsity-code-d1024-E128-k8-13.2B-A1.1B Text Generation • 13B • Updated 17 days ago • 7
llm-jp/optimal-sparsity-code-d1024-E256-k8-26.0B-A1.1B Text Generation • 26B • Updated 17 days ago • 8