majentik's picture
Add MLX 8-bit quantized model with KV cache compression
21d6a54 verified