3morixd commited on
Commit
caf05e1
·
verified ·
1 Parent(s): 8597379

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +10 -39
README.md CHANGED
@@ -14,35 +14,18 @@ pipeline_tag: text-generation
14
 
15
  # MiniCPM5-1B-mobile
16
 
17
- ⚠️ **PARTIAL** — Verified June 2026.
18
 
19
  ## Verification Results
20
 
21
  | Prompt | Response | Correct? |
22
  |--------|----------|----------|
23
- | What is the capital of France? | "" | ⚠️ |
24
- | What is 2+2? Just the number. | "```t
25
- ```
26
-
27
- ####
28
-
29
- ####
30
-
31
- ####
32
-
33
- ####
34
-
35
- ####
36
 
37
- ####
38
 
39
- ####
40
 
41
- ####
42
-
43
- ##" | ⚠️ |
44
-
45
- **Chat format**: `chatml`
46
 
47
  ## Model Details
48
 
@@ -51,8 +34,8 @@ pipeline_tag: text-generation
51
  | **Base Model** | openbmb/MiniCPM3-4B |
52
  | **File Size** | 656 MB |
53
  | **Format** | GGUF |
54
- | **Chat Format** | chatml |
55
- | **CPU Speed** | 18.0 tokens/sec |
56
  | **License** | apache-2.0 |
57
 
58
  ## Usage
@@ -60,21 +43,9 @@ pipeline_tag: text-generation
60
  ```python
61
  from llama_cpp import Llama
62
 
63
- llm = Llama(model_path="model.gguf", chat_format="chatml", n_ctx=512, n_threads=4)
64
- response = llm.create_chat_completion(
65
- messages=[{"role": "user", "content": "What is the capital of France?"}],
66
- max_tokens=50,
67
- )
68
- print(response["choices"][0]["message"]["content"])
69
- ```
70
-
71
- ### dispatchAI SDK
72
- ```python
73
- from dispatchai import load_model
74
- model = load_model("MiniCPM5-1B-mobile", backend="gguf")
75
- print(model.chat("Hello!"))
76
  ```
77
 
78
- ## About dispatchAI
79
-
80
- [dispatchAI](https://huggingface.co/dispatchAI) — Small. Mobile. Free. UAE-built.
 
14
 
15
  # MiniCPM5-1B-mobile
16
 
17
+ **WORKS** — Verified June 2026.
18
 
19
  ## Verification Results
20
 
21
  | Prompt | Response | Correct? |
22
  |--------|----------|----------|
23
+ | The capital of France is | "the city of Paris, which is located in the Île-de-Cracome, a" | |
 
 
 
 
 
 
 
 
 
 
 
 
24
 
 
25
 
26
+ ## Note
27
 
28
+ Use raw completion (no chat format). Best for text continuation.
 
 
 
 
29
 
30
  ## Model Details
31
 
 
34
  | **Base Model** | openbmb/MiniCPM3-4B |
35
  | **File Size** | 656 MB |
36
  | **Format** | GGUF |
37
+ | **Chat Format** | Raw completion (no chat template) |
38
+ | **CPU Speed** | 18.1 tokens/sec |
39
  | **License** | apache-2.0 |
40
 
41
  ## Usage
 
43
  ```python
44
  from llama_cpp import Llama
45
 
46
+ llm = Llama(model_path="model.gguf", n_ctx=512, n_threads=4, verbose=False)
47
+ response = llm("The capital of France is", max_tokens=30, echo=False)
48
+ print(response["choices"][0]["text"])
 
 
 
 
 
 
 
 
 
 
49
  ```
50
 
51
+ 🚀 [dispatchAI](https://huggingface.co/dispatchAI)