File size: 1,731 Bytes
609a8f0
 
c1501c6
 
 
97c2636
c1501c6
 
 
 
 
609a8f0
 
c1501c6
609a8f0
97c2636
ed0cfb4
c1501c6
ed0cfb4
c1501c6
ed0cfb4
 
 
 
c1501c6
cb2ab87
ed0cfb4
c1501c6
 
97c2636
ed0cfb4
97c2636
c1501c6
97c2636
cb2ab87
c1501c6
 
 
97c2636
c1501c6
97c2636
 
c1501c6
97c2636
c1501c6
 
ed0cfb4
 
 
 
 
 
 
 
97c2636
c1501c6
ed0cfb4
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
license: apache-2.0
language:
  - en
library_name: transformers
tags:
  - mobile
  - on-device
  - quantized
  - gguf
  - dispatchai
pipeline_tag: text-generation
---

# SmolLM2-360M-Instruct-mobile**Verified on real phone hardware** — Snapdragon 865, June 2026.

## Phone Benchmark (Samsung S20 FE, Snapdragon 865)

| Metric | Value |
|--------|-------|
| **Phone Speed** | **21.5 tokens/sec** |
| **CPU Speed** | 29.1 tokens/sec |
| **File Size** | 258 MB |
| **Chat Format** | chatml |
| **Test Output** | "Paris" ✅ (correct) |

## Usage

### Python (llama-cpp-python)
```python
from llama_cpp import Llama

llm = Llama(model_path="model.gguf", chat_format="chatml", n_ctx=512, n_threads=4, verbose=False)
response = llm.create_chat_completion(
    messages=[{"role": "user", "content": "What is the capital of France?"}],
    max_tokens=50,
)
print(response["choices"][0]["message"]["content"])
```

### dispatchAI SDK
```python
from dispatchai import load_model
model = load_model("SmolLM2-360M-Instruct-mobile", backend="gguf")
print(model.chat("What is the capital of France?"))
```

### On Android (via ADB)
```bash
hf download dispatchAI/SmolLM2-360M-Instruct-mobile model.gguf
MSYS_NO_PATHCONV=1 adb push model.gguf /data/local/tmp/
MSYS_NO_PATHCONV=1 adb shell "cd /data/local/tmp && LD_LIBRARY_PATH=/data/local/tmp ./llama-cli -m model.gguf -p 'Hello' -n 30 -t 4 -st"
```

## Model Details

| Attribute | Value |
|-----------|-------|
| **Base Model** | HuggingFaceTB/SmolLM2-360M-Instruct |
| **File Size** | 258 MB |
| **Format** | GGUF |
| **Chat Format** | chatml |
| **License** | apache-2.0 |

## About dispatchAI

[dispatchAI](https://huggingface.co/dispatchAI) — Small. Mobile. Free. UAE-built.