Commit History

Add Gemma 4-26B-A4B support: 4.15 tok/s on M4 Mac Mini
3f56a7b

Nico Claude Opus 4.6 (1M context) commited on

Add Gemma 4 MLX model class + preprocess
cc1d5e2
verified

waltgrace commited on

Add Gemma 4 MLX model class + preprocess
4a30158
verified

waltgrace commited on

Add Gemma 4 MLX model class + preprocess
337dcd8
verified

waltgrace commited on

Add Gemma 4 MLX model class + preprocess
e2bb666
verified

waltgrace commited on

Add Gemma 4 MLX model class + preprocess
7409761
verified

waltgrace commited on

Add Coder model + multi-model fixes
46f93d5
verified

waltgrace commited on

Add Coder model + multi-model fixes
3a81687
verified

waltgrace commited on

Add Coder model + multi-model fixes
62e0ebd
verified

waltgrace commited on

Add Coder model + multi-model fixes
7878baa
verified

waltgrace commited on

Add Coder model + multi-model fixes
bded42e
verified

waltgrace commited on

Add chat command + shared generate module
3e350f6
verified

waltgrace commited on

Add chat command + shared generate module
8f6ad09
verified

waltgrace commited on

Add 30B support + bias sweep results
517be8a
verified

waltgrace commited on

Add 30B support + bias sweep results
8829376
verified

waltgrace commited on

Add 30B support + bias sweep results
22d3dc7
verified

waltgrace commited on

Add Ollama-compatible serve command
f7efda4
verified

waltgrace commited on

Add Ollama-compatible serve command
55fe21b
verified

waltgrace commited on

Add download command: mlx-sniper download qwen3.5-35b
e1498f9
verified

waltgrace commited on

Add download command: mlx-sniper download qwen3.5-35b
d694ca1
verified

waltgrace commited on

Update src/mlx_expert_sniper/preprocess.py
05209e2
verified

waltgrace commited on

Update src/mlx_expert_sniper/expert_io.py
bc2c2b1
verified

waltgrace commited on

Update src/mlx_expert_sniper/engine.py
7e0ed9b
verified

waltgrace commited on

Update src/mlx_expert_sniper/coactivation.py
482f969
verified

waltgrace commited on

Update src/mlx_expert_sniper/calibrate.py
9110344
verified

waltgrace commited on

Update src/mlx_expert_sniper/cli.py
1a7f86e
verified

waltgrace commited on

Update src/mlx_expert_sniper/__init__.py
0535634
verified

waltgrace commited on

v0.2.0: Add Qwen3.5-35B-A3B support (5.78 tok/s, 19.5 GB on 16 GB RAM)
d14a3c2
verified

waltgrace commited on

v0.1.0: MoE expert sniping for MLX — run models larger than your RAM
2d8a9bb
verified

waltgrace commited on