3morixd commited on
Commit
d66bbfd
·
verified ·
1 Parent(s): ff74664

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -14,23 +14,25 @@ pipeline_tag: text-generation
14
 
15
  # Llama-3.2-1B-FunctionCall-mobile
16
 
17
- ⚠️ **PARTIAL** — Verified June 2026.
18
 
19
  ## Verification Results
20
 
21
- Phone-verified only. See verification report.
 
 
 
22
 
23
- **Chat format**: `chatml`
24
 
25
  ## Model Details
26
 
27
  | Attribute | Value |
28
  |-----------|-------|
29
  | **Base Model** | meta-llama/Llama-3.2-1B-Instruct |
30
- | **File Size** | 0 MB |
31
  | **Format** | GGUF |
32
  | **Chat Format** | chatml |
33
- | **CPU Speed** | 6.0 tokens/sec |
34
  | **License** | llama3.2 |
35
 
36
  ## Usage
@@ -38,7 +40,7 @@ Phone-verified only. See verification report.
38
  ```python
39
  from llama_cpp import Llama
40
 
41
- llm = Llama(model_path="model.gguf", chat_format="chatml", n_ctx=512, n_threads=4)
42
  response = llm.create_chat_completion(
43
  messages=[{"role": "user", "content": "What is the capital of France?"}],
44
  max_tokens=50,
@@ -53,6 +55,4 @@ model = load_model("Llama-3.2-1B-FunctionCall-mobile", backend="gguf")
53
  print(model.chat("Hello!"))
54
  ```
55
 
56
- ## About dispatchAI
57
-
58
- [dispatchAI](https://huggingface.co/dispatchAI) — Small. Mobile. Free. UAE-built.
 
14
 
15
  # Llama-3.2-1B-FunctionCall-mobile
16
 
17
+ **WORKS** — Verified June 2026.
18
 
19
  ## Verification Results
20
 
21
+ | Prompt | Response | Correct? |
22
+ |--------|----------|----------|
23
+ | What is the capital of France? | "The capital of France is Paris." | ✅ |
24
+ | Say hello in one sentence. | "I'm happy to help you with your question. <|endoftext|>" | ✅ |
25
 
 
26
 
27
  ## Model Details
28
 
29
  | Attribute | Value |
30
  |-----------|-------|
31
  | **Base Model** | meta-llama/Llama-3.2-1B-Instruct |
32
+ | **File Size** | 1926 MB |
33
  | **Format** | GGUF |
34
  | **Chat Format** | chatml |
35
+ | **CPU Speed** | 8.9 tokens/sec |
36
  | **License** | llama3.2 |
37
 
38
  ## Usage
 
40
  ```python
41
  from llama_cpp import Llama
42
 
43
+ llm = Llama(model_path="model.gguf", chat_format="chatml", n_ctx=512, n_threads=4, verbose=False)
44
  response = llm.create_chat_completion(
45
  messages=[{"role": "user", "content": "What is the capital of France?"}],
46
  max_tokens=50,
 
55
  print(model.chat("Hello!"))
56
  ```
57
 
58
+ 🚀 [dispatchAI](https://huggingface.co/dispatchAI)