VISTA-4B-MLX-5bit

MLX (Apple Silicon) conversion of inclusionAI/VISTA-4B, quantized to 5-bit. First MLX build of this model. Text-only build of the backbone.

Quantizations

Part of the VISTA MLX collection.

Variant
8-bit
6-bit
5-bit (this repo)
4-bit

Use with mlx-lm

pip install mlx-lm
python -m mlx_lm generate --model pipenetwork/VISTA-4B-MLX-5bit --prompt "Hello" -m 200

Validation

Smoke-tested locally: loads and generates coherent text.

License

apache-2.0 (inherited from base). Quantization config: {"group_size": 64, "bits": 5, "mode": "affine"}.

Downloads last month
3
Safetensors
Model size
0.8B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pipenetwork/VISTA-4B-MLX-5bit

Quantized
(10)
this model

Collection including pipenetwork/VISTA-4B-MLX-5bit