Commit History

Re-upload with properly quantized decoder (encoder/adaptor bf16)
f818655
verified

majentik commited on

Add MLX quantized model with KV cache compression
ba8ef4c
verified

majentik commited on

initial commit
dd6c004
verified

majentik commited on