3morixd commited on
Commit
394694a
·
verified ·
1 Parent(s): f384001

Add model card with phone farm benchmark results

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: openbmb/MiniCPM5-1B
4
+ tags:
5
+ - dispatch-ai
6
+ - mobile
7
+ - quantized
8
+ - gguf
9
+ - phone-farm-tested
10
+ pipeline_tag: text-generation
11
+ language:
12
+ - en
13
+ ---
14
+
15
+ # MiniCPM5-1B-mobile
16
+
17
+ **Dispatch AI** — Built for mobile. Tested on real phones.
18
+
19
+ ## Category
20
+
21
+ Text Generation — OpenBMB's multilingual model
22
+
23
+ ## Model
24
+
25
+ Re-engineered from [openbmb/MiniCPM5-1B](https://huggingface.co/openbmb/MiniCPM5-1B).
26
+ Quantized to Q4_K_M GGUF for on-device inference via llama.cpp.
27
+ Size: 656 MB.
28
+
29
+ ## Phone Farm Test Results
30
+
31
+ Tested on **Samsung Galaxy S20 FE 5G** (Snapdragon 865, 8GB RAM):
32
+
33
+ | Phone | Gen t/s | Prompt t/s |
34
+ |-------|---------|------------|
35
+ | R3CN30WHS2Z | 21.7 | 67.5 |
36
+ | R3CN509PLHA | 22.3 | 95.3 |
37
+
38
+ - **Average: 22.0 t/s**
39
+ - **40-phone aggregate: ~880 t/s**
40
+
41
+
42
+ ## Usage
43
+
44
+ ```bash
45
+ ./llama-cli -m model.gguf -p "Hello" -n 100 -t 4 -c 512
46
+ ```
47
+
48
+ 🌐 [dispatchAI on HuggingFace](https://huggingface.co/dispatchAI)