Leanstral-RotorQuant-MLX-8bit / generation_config.json

Commit History

Add MLX 8-bit quantized model with KV cache compression
cfb84ae
verified

majentik commited on