gemma-4-E2B-RotorQuant-MLX-8bit / generation_config.json

Commit History

Add MLX quantized model with KV cache compression
ca1fe84
verified

majentik commited on