3morixd commited on
Commit
d1631dc
·
verified ·
1 Parent(s): 2607a7a

Add model card with phone farm benchmark results

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: microsoft/Phi-3.5-mini-instruct
4
+ tags:
5
+ - dispatch-ai
6
+ - mobile
7
+ - quantized
8
+ - gguf
9
+ - phone-farm-tested
10
+ pipeline_tag: text-generation
11
+ language:
12
+ - en
13
+ ---
14
+
15
+ # Phi-3.5-mini-Instruct-mobile
16
+
17
+ **Dispatch AI** — Built for mobile. Tested on real phones.
18
+
19
+ ## Category
20
+
21
+ Text Generation — Microsoft's efficient model
22
+
23
+ ## Model
24
+
25
+ Re-engineered from [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct).
26
+ Quantized to Q4_K_M GGUF for on-device inference via llama.cpp.
27
+ Size: 2282 MB.
28
+
29
+ ## Phone Farm Test Results
30
+
31
+ Tested on **Samsung Galaxy S20 FE 5G** (Snapdragon 865, 8GB RAM):
32
+
33
+ | Phone | Gen t/s | Prompt t/s |
34
+ |-------|---------|------------|
35
+ | R3CN30WHS2Z | 6.9 | 12.1 |
36
+ | R3CN509PLHA | 7.9 | 18.2 |
37
+
38
+ - **Average: 7.4 t/s**
39
+ - **40-phone aggregate: ~296 t/s**
40
+
41
+
42
+ ## Usage
43
+
44
+ ```bash
45
+ ./llama-cli -m model.gguf -p "Hello" -n 100 -t 4 -c 512
46
+ ```
47
+
48
+ 🌐 [dispatchAI on HuggingFace](https://huggingface.co/dispatchAI)