---
license: apache-2.0
base_model: HuggingFaceTB/SmolLM2-360M-Instruct
language:
  - en
tags:
  - mlx
  - ffai
  - quantized
  - 4bit
  - affine
---

# SmolLM2-360M-Instruct-4bit

4-bit affine quantization of [HuggingFaceTB/SmolLM2-360M-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct), produced with [FFAI](https://github.com/thewafflehaus/FFAI) 0.1.0's `ffai convert` (mlx-affine format, `group_size=64`).

## Conversion

```bash
ffai convert HuggingFaceTB/SmolLM2-360M-Instruct --bits 4 \
    --upload-repo ekryski/SmolLM2-360M-Instruct-4bit
```

## See also

- [FFAI](https://github.com/thewafflehaus/FFAI) — fast Apple Silicon LLM inference. `Model.load("ekryski/SmolLM2-360M-Instruct-4bit")` runs this checkpoint end-to-end.
- [FFAI quickstart](https://github.com/thewafflehaus/FFAI/blob/main/documentation/quickstart.md)
- [FFAI quantization docs](https://github.com/thewafflehaus/FFAI/blob/main/documentation/quantization.md)