majentik's picture
Add MLX 8-bit quantized model with KV cache compression
cfb84ae verified