Add INT8 and INT4 quantized weights

Files changed (2) hide show

llada_int4_quantized.pt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b42f17257baa051badbedd1d4e577fe868a8fa9fc834efb3c7a54ffca9538685
+size 4788526525

llada_int8_quantized.pt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d5c9b729256e750902a8a91033deecacd091d2b8779a325828580aa9476eb3be
+size 8537189053