gemma-4-E4B-RotorQuant-MLX-2bit / generation_config.json

Commit History

Add MLX quantized model with KV cache compression
303f12f
verified

majentik commited on