gemma-4-E4B-RotorQuant-MLX-4bit / generation_config.json

Commit History

Add MLX quantized model with KV cache compression
f1e16a4
verified

majentik commited on