--- language: - en license: llama3.2 tags: - mobile - edge-ai - quantized - gguf - 3b pipeline_tag: text-generation --- # Llama 3.2 3B Instruct - Mobile (GGUF) The sweet spot between size and capability. When 1B isn't enough but you still need mobile compatibility. | Property | Value | |----------|-------| | **Parameters** | 3.2 billion | | **Size** | ~2.1 GB | | **Speed** | ~16 tok/s (S20 FE CPU) | | **Quality Retention** | ~96% | ## Best For - Complex reasoning on mobile (better than 1B) - Long-form content generation - Multi-turn conversations with context - Advanced RAG pipelines - Research assistant applications