ekryski
/

FastVLM-0.5B-4bit

4-bit precision

Model card Files Files and versions

FastVLM-0.5B-4bit / README.md

ekryski's picture

Upload README.md with huggingface_hub

5d8317c verified 4 days ago

|

history blame contribute delete

914 Bytes

	---
	license: apache-2.0
	base_model: mlx-community/FastVLM-0.5B-bf16
	language:
	- en
	tags:
	- mlx
	- ffai
	- quantized
	- 4bit
	- affine
	---

	# FastVLM-0.5B-4bit

	4-bit affine quantization of [mlx-community/FastVLM-0.5B-bf16](https://huggingface.co/mlx-community/FastVLM-0.5B-bf16), produced with [FFAI](https://github.com/thewafflehaus/FFAI) 0.1.0's `ffai convert` (mlx-affine format, `group_size=64`).

	## Conversion

	```bash
	ffai convert mlx-community/FastVLM-0.5B-bf16 --bits 4 \
	--upload-repo ekryski/FastVLM-0.5B-4bit
	```

	## See also

	- [FFAI](https://github.com/thewafflehaus/FFAI) — fast Apple Silicon LLM inference. `Model.load("ekryski/FastVLM-0.5B-4bit")` runs this checkpoint end-to-end.
	- [FFAI quickstart](https://github.com/thewafflehaus/FFAI/blob/main/documentation/quickstart.md)
	- [FFAI quantization docs](https://github.com/thewafflehaus/FFAI/blob/main/documentation/quantization.md)