cmac86 commited on
Commit
a5965ce
·
verified ·
1 Parent(s): e4850c7

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +66 -0
README.md ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: caal-model-license-v1
4
+ license_link: LICENSE
5
+ language:
6
+ - en
7
+ base_model: Qwen/Qwen3.5-2B
8
+ tags:
9
+ - tool-calling
10
+ - voice-assistant
11
+ - fine-tuned
12
+ - gguf
13
+ - qwen3.5
14
+ - caal
15
+ ---
16
+
17
+ # CAAL Qwen3.5 2B — Fine-Tuned for Tool Calling
18
+
19
+ A fine-tuned [Qwen3.5 2B](https://huggingface.co/Qwen/Qwen3.5-2B) model optimized for tool calling in voice assistant workflows. Built for [CAAL](https://github.com/CoreWorxLab/CAAL) (CoreWorxLab Ambient Assistant for Linux).
20
+
21
+ ## Performance
22
+
23
+ **82/85 tests passed (96%)** on the CAAL 85-test evaluation suite:
24
+
25
+ | Category | Score |
26
+ |----------|-------|
27
+ | Single tool calls | 27/27 |
28
+ | Conversational (no tool) | 10/10 |
29
+ | Multi-turn chains | 28/29 |
30
+ | Argument formatting | 17/19 |
31
+
32
+ ## Model Details
33
+
34
+ - **Base model**: Qwen3.5 2B
35
+ - **Training method**: SFT with BF16 LoRA (last-turn-only — previous turns as context, only final response trained)
36
+ - **LoRA config**: r=32, alpha=32
37
+ - **Quantization**: Q4_K_M (GGUF)
38
+ - **File size**: ~1.2 GB
39
+ - **VRAM usage**: ~2.6 GB at 16384 context
40
+
41
+ ## Usage with Ollama
42
+
43
+ ```bash
44
+ # Download the GGUF and create a Modelfile:
45
+ # Modelfile contents:
46
+ # FROM caal-qwen3.5-2b-q4.gguf
47
+ # RENDERER qwen3.5
48
+ # PARSER qwen3.5
49
+ # PARAMETER temperature 0.1
50
+ # PARAMETER num_ctx 16384
51
+
52
+ ollama create caal-qwen35-2b -f Modelfile
53
+ ```
54
+
55
+ ## Designed For
56
+
57
+ - Edge deployment on consumer GPUs (fits on 5GB+ VRAM alongside TTS)
58
+ - Local voice assistants with tool calling
59
+ - Smart home control, email, calendar, and service management
60
+ - Multi-step tool chains (e.g., search → lookup contact → send email)
61
+
62
+ ## License
63
+
64
+ See [LICENSE](LICENSE) for the CAAL Model License v1.0. This model is free for personal, non-commercial use with attribution to CoreWorxLab. Commercial use requires written permission.
65
+
66
+ The base model (Qwen3.5) is licensed under Apache 2.0. Users must comply with both licenses.