Add MLX 8-bit quantized model with KV cache compression 24a37dc verified majentik commited on 3 days ago