Add MLX 8-bit quantized model with KV cache compression cfb84ae verified majentik commited on 3 days ago