3morixd commited on
Commit
f80b34a
·
verified ·
1 Parent(s): c9f3fa6

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +54 -12
README.md CHANGED
@@ -1,19 +1,61 @@
1
  ---
2
  license: apache-2.0
3
- base_model: Qwen/Qwen2.5-Math-1.5B-Instruct
4
- tags: [dispatch-ai, mobile, quantized, gguf, phone-farm-tested]
 
 
 
 
 
 
 
5
  pipeline_tag: text-generation
6
- language: [en]
7
  ---
 
8
  # Qwen2.5-Math-1.5B-mobile
9
- **Dispatch AI** — Built for mobile. Tested on real phones.
10
- ## Category
11
- Math Reasoning — 1.5B math specialist
12
- ## Model
13
- Re-engineered from [Qwen/Qwen2.5-Math-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Math-1.5B-Instruct).
14
- Size: 940 MB. Q4_K_M GGUF for llama.cpp.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
  ## Usage
16
- ```bash
17
- ./llama-cli -m model.gguf -p "Hello" -n 100 -t 4 -c 512
 
 
 
 
 
 
 
 
18
  ```
19
- 🌐 [dispatchAI on HuggingFace](https://huggingface.co/dispatchAI)
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ tags:
7
+ - mobile
8
+ - on-device
9
+ - quantized
10
+ - gguf
11
+ - dispatchai
12
  pipeline_tag: text-generation
 
13
  ---
14
+
15
  # Qwen2.5-Math-1.5B-mobile
16
+
17
+ **WORKS** — Verified June 2026.
18
+
19
+ ## Verification Results
20
+
21
+ | Prompt | Response | Correct? |
22
+ |--------|----------|----------|
23
+ | What is the capital of France? | "The capital of France is Paris." | ✅ |
24
+ | What is 2+2? Just the number. | "The sum of 2 and 2 is 4." | ✅ |
25
+
26
+ **Chat format**: `chatml`
27
+
28
+ ## Model Details
29
+
30
+ | Attribute | Value |
31
+ |-----------|-------|
32
+ | **Base Model** | Qwen/Qwen2.5-Math-1.5B |
33
+ | **File Size** | 940 MB |
34
+ | **Format** | GGUF |
35
+ | **Chat Format** | chatml |
36
+ | **CPU Speed** | 15.7 tokens/sec |
37
+ | **License** | apache-2.0 |
38
+
39
  ## Usage
40
+
41
+ ```python
42
+ from llama_cpp import Llama
43
+
44
+ llm = Llama(model_path="model.gguf", chat_format="chatml", n_ctx=512, n_threads=4)
45
+ response = llm.create_chat_completion(
46
+ messages=[{"role": "user", "content": "What is the capital of France?"}],
47
+ max_tokens=50,
48
+ )
49
+ print(response["choices"][0]["message"]["content"])
50
  ```
51
+
52
+ ### dispatchAI SDK
53
+ ```python
54
+ from dispatchai import load_model
55
+ model = load_model("Qwen2.5-Math-1.5B-mobile", backend="gguf")
56
+ print(model.chat("Hello!"))
57
+ ```
58
+
59
+ ## About dispatchAI
60
+
61
+ [dispatchAI](https://huggingface.co/dispatchAI) — Small. Mobile. Free. UAE-built.