majentik's picture
Add MLX quantized model with KV cache compression
1785a33 verified