3morixd's picture
Professional model card upgrade: benchmarks, code examples, usage guide
56eb540 verified
|
Raw
History Blame Contribute Delete
630 Bytes
metadata
language:
  - en
license: llama3.2
tags:
  - mobile
  - edge-ai
  - quantized
  - gguf
  - 3b
pipeline_tag: text-generation

Llama 3.2 3B Instruct - Mobile (GGUF)

The sweet spot between size and capability. When 1B isn't enough but you still need mobile compatibility.

Property Value
Parameters 3.2 billion
Size ~2.1 GB
Speed ~16 tok/s (S20 FE CPU)
Quality Retention ~96%

Best For

  • Complex reasoning on mobile (better than 1B)
  • Long-form content generation
  • Multi-turn conversations with context
  • Advanced RAG pipelines
  • Research assistant applications