waltgrace
/

mlx-expert-sniper

Image-Text-to-Text

Mixture of Experts

mixture-of-experts

vision-language

falcon-perception

Model card Files Files and versions

mlx-expert-sniper / src

Commit History

Add Gemma 4-26B-A4B support: 4.15 tok/s on M4 Mac Mini

3f56a7b

Nico Claude Opus 4.6 (1M context) commited on Apr 6

Add Gemma 4 MLX model class + preprocess

cc1d5e2
verified

waltgrace commited on Apr 2

Add Gemma 4 MLX model class + preprocess

4a30158
verified

waltgrace commited on Apr 2

Add Gemma 4 MLX model class + preprocess

337dcd8
verified

waltgrace commited on Apr 2

Add Gemma 4 MLX model class + preprocess

e2bb666
verified

waltgrace commited on Apr 2

Add Gemma 4 MLX model class + preprocess

7409761
verified

waltgrace commited on Apr 2

Add Coder model + multi-model fixes

46f93d5
verified

waltgrace commited on Apr 2

Add Coder model + multi-model fixes

3a81687
verified

waltgrace commited on Apr 2

Add Coder model + multi-model fixes

62e0ebd
verified

waltgrace commited on Apr 2

Add Coder model + multi-model fixes

7878baa
verified

waltgrace commited on Apr 2

Add Coder model + multi-model fixes

bded42e
verified

waltgrace commited on Apr 2

Add chat command + shared generate module

3e350f6
verified

waltgrace commited on Apr 2

Add chat command + shared generate module

8f6ad09
verified

waltgrace commited on Apr 2

Add 30B support + bias sweep results

517be8a
verified

waltgrace commited on Apr 2

Add 30B support + bias sweep results

8829376
verified

waltgrace commited on Apr 2

Add 30B support + bias sweep results

22d3dc7
verified

waltgrace commited on Apr 2

Add Ollama-compatible serve command

f7efda4
verified

waltgrace commited on Apr 2

Add Ollama-compatible serve command

55fe21b
verified

waltgrace commited on Apr 2

Add download command: mlx-sniper download qwen3.5-35b

e1498f9
verified

waltgrace commited on Apr 2

Add download command: mlx-sniper download qwen3.5-35b

d694ca1
verified

waltgrace commited on Apr 2

Update src/mlx_expert_sniper/preprocess.py

05209e2
verified

waltgrace commited on Apr 1

Update src/mlx_expert_sniper/expert_io.py

bc2c2b1
verified

waltgrace commited on Apr 1

Update src/mlx_expert_sniper/engine.py

7e0ed9b
verified

waltgrace commited on Apr 1

Update src/mlx_expert_sniper/coactivation.py

482f969
verified

waltgrace commited on Apr 1

Update src/mlx_expert_sniper/calibrate.py

9110344
verified

waltgrace commited on Apr 1

Update src/mlx_expert_sniper/cli.py

1a7f86e
verified

waltgrace commited on Apr 1

Update src/mlx_expert_sniper/init.py

0535634
verified

waltgrace commited on Apr 1

v0.2.0: Add Qwen3.5-35B-A3B support (5.78 tok/s, 19.5 GB on 16 GB RAM)

d14a3c2
verified

waltgrace commited on Mar 31

v0.1.0: MoE expert sniping for MLX — run models larger than your RAM

2d8a9bb
verified

waltgrace commited on Mar 30