VISTA-4B-MLX-4bit

MLX (Apple Silicon) conversion of inclusionAI/VISTA-4B, quantized to 4-bit. First MLX build of this model. Text-only build of the backbone.

Quantizations

Part of the VISTA MLX collection.

Variant
8-bit
6-bit
5-bit
4-bit (this repo)

Use with mlx-lm

pip install mlx-lm
python -m mlx_lm generate --model pipenetwork/VISTA-4B-MLX-4bit --prompt "Hello" -m 200

Validation

Smoke-tested locally: loads and generates coherent text.

License

apache-2.0 (inherited from base). Quantization config: {"group_size": 64, "bits": 4, "mode": "affine"}.

Downloads last month
19
Safetensors
Model size
0.7B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pipenetwork/VISTA-4B-MLX-4bit

Quantized
(10)
this model

Collection including pipenetwork/VISTA-4B-MLX-4bit