3morixd's picture
Professional model card upgrade: benchmarks, code examples, usage guide
56eb540 verified
|
Raw
History Blame Contribute Delete
630 Bytes
---
language:
- en
license: llama3.2
tags:
- mobile
- edge-ai
- quantized
- gguf
- 3b
pipeline_tag: text-generation
---
# Llama 3.2 3B Instruct - Mobile (GGUF)
The sweet spot between size and capability. When 1B isn't enough but you still need mobile compatibility.
| Property | Value |
|----------|-------|
| **Parameters** | 3.2 billion |
| **Size** | ~2.1 GB |
| **Speed** | ~16 tok/s (S20 FE CPU) |
| **Quality Retention** | ~96% |
## Best For
- Complex reasoning on mobile (better than 1B)
- Long-form content generation
- Multi-turn conversations with context
- Advanced RAG pipelines
- Research assistant applications