efrantar's picture
Added short README
0ebcd27
metadata
license: apache-2.0

switch-large-128_qmoe

This is the google/switch-large-128 model quantized with the QMoE framework to ternary precision and stored in the custom further compressed QMoE format.

Please see the QMoE repository for how to use this model.