3morixd commited on
Commit
cb2ab87
·
verified ·
1 Parent(s): c1501c6

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +7 -12
README.md CHANGED
@@ -14,18 +14,15 @@ pipeline_tag: text-generation
14
 
15
  # SmolLM2-360M-Instruct-mobile
16
 
17
- ⚠️ **PARTIAL** — Verified June 2026.
18
 
19
  ## Verification Results
20
 
21
  | Prompt | Response | Correct? |
22
  |--------|----------|----------|
23
- | What is the capital of France? | "What is the capital of France?" | ⚠️ |
24
- | What is 2+2? Just the number. | "2+2=4
25
 
26
- So, 2+2 equals 4." | ✅ |
27
-
28
- **Chat format**: `llama-3`
29
 
30
  ## Model Details
31
 
@@ -34,8 +31,8 @@ So, 2+2 equals 4." | ✅ |
34
  | **Base Model** | HuggingFaceTB/SmolLM2-360M-Instruct |
35
  | **File Size** | 258 MB |
36
  | **Format** | GGUF |
37
- | **Chat Format** | llama-3 |
38
- | **CPU Speed** | 26.4 tokens/sec |
39
  | **License** | apache-2.0 |
40
 
41
  ## Usage
@@ -43,7 +40,7 @@ So, 2+2 equals 4." | ✅ |
43
  ```python
44
  from llama_cpp import Llama
45
 
46
- llm = Llama(model_path="model.gguf", chat_format="llama-3", n_ctx=512, n_threads=4)
47
  response = llm.create_chat_completion(
48
  messages=[{"role": "user", "content": "What is the capital of France?"}],
49
  max_tokens=50,
@@ -58,6 +55,4 @@ model = load_model("SmolLM2-360M-Instruct-mobile", backend="gguf")
58
  print(model.chat("Hello!"))
59
  ```
60
 
61
- ## About dispatchAI
62
-
63
- [dispatchAI](https://huggingface.co/dispatchAI) — Small. Mobile. Free. UAE-built.
 
14
 
15
  # SmolLM2-360M-Instruct-mobile
16
 
17
+ **WORKS** — Verified June 2026.
18
 
19
  ## Verification Results
20
 
21
  | Prompt | Response | Correct? |
22
  |--------|----------|----------|
23
+ | What is the capital of France? | "The capital of France is Paris. Paris is a city known for it" | |
24
+ | Say hello in one sentence. | "Hello!" | ✅ |
25
 
 
 
 
26
 
27
  ## Model Details
28
 
 
31
  | **Base Model** | HuggingFaceTB/SmolLM2-360M-Instruct |
32
  | **File Size** | 258 MB |
33
  | **Format** | GGUF |
34
+ | **Chat Format** | chatml |
35
+ | **CPU Speed** | 29.1 tokens/sec |
36
  | **License** | apache-2.0 |
37
 
38
  ## Usage
 
40
  ```python
41
  from llama_cpp import Llama
42
 
43
+ llm = Llama(model_path="model.gguf", chat_format="chatml", n_ctx=512, n_threads=4, verbose=False)
44
  response = llm.create_chat_completion(
45
  messages=[{"role": "user", "content": "What is the capital of France?"}],
46
  max_tokens=50,
 
55
  print(model.chat("Hello!"))
56
  ```
57
 
58
+ 🚀 [dispatchAI](https://huggingface.co/dispatchAI)