---
language:
- en
license: llama3.2
tags:
- mobile
- edge-ai
- quantized
- gguf
- 3b
pipeline_tag: text-generation
---

# Llama 3.2 3B Instruct - Mobile (GGUF)

The sweet spot between size and capability. When 1B isn't enough but you still need mobile compatibility.

| Property | Value |
|----------|-------|
| **Parameters** | 3.2 billion |
| **Size** | ~2.1 GB |
| **Speed** | ~16 tok/s (S20 FE CPU) |
| **Quality Retention** | ~96% |

## Best For

- Complex reasoning on mobile (better than 1B)
- Long-form content generation
- Multi-turn conversations with context
- Advanced RAG pipelines
- Research assistant applications