Add MLX 8-bit quantized model with KV cache compression 21d6a54 verified majentik commited on 3 days ago