bricksandbot commited on
Commit
7c9a711
·
verified ·
1 Parent(s): 55bda91

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +113 -0
README.md ADDED
@@ -0,0 +1,113 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: microsoft/Phi-4-mini-instruct
4
+ tags:
5
+ - phi4
6
+ - gguf
7
+ - quantized
8
+ - q4_k_m
9
+ - buildsnpper
10
+ - sap-assessor
11
+ - chatbot
12
+ - customer-support
13
+ language:
14
+ - en
15
+ pipeline_tag: text-generation
16
+ ---
17
+
18
+ # Buildsnpper SAP Assessor Platform Chatbot (Q4_K_M)
19
+
20
+ Fine-tuned Phi-4-mini-instruct model for the Buildsnpper SAP Assessor Platform customer support chatbot.
21
+
22
+ ## Model Details
23
+
24
+ - **Base Model**: microsoft/Phi-4-mini-instruct (3.8B parameters)
25
+ - **Fine-tuning**: LoRA (rank=16, alpha=32)
26
+ - **Format**: GGUF Q4_K_M quantized
27
+ - **Size**: ~2.5GB
28
+ - **Context Length**: 131,072 tokens
29
+ - **Training Data**: 89 Q&A pairs covering Buildsnpper platform features, workflows, and common user questions
30
+
31
+ ## Use Cases
32
+
33
+ This model is specifically trained to answer questions about:
34
+ - Project and client management in Buildsnpper
35
+ - Subscription and credit system
36
+ - Platform features and navigation
37
+ - Common technical issues
38
+ - Account management
39
+ - Report generation and exports
40
+
41
+ ## Usage
42
+
43
+ ### With llama.cpp
44
+
45
+ ```bash
46
+ # Download the model
47
+ wget https://huggingface.co/bricksandbotltd/buildsnpper-chatbot-Q4_K_M/resolve/main/buildsnpper-chatbot-Q4_K_M.gguf
48
+
49
+ # Run with llama.cpp
50
+ ./llama-cli -m buildsnpper-chatbot-Q4_K_M.gguf -p "How do I create a new project in Buildsnpper?" -n 256
51
+ ```
52
+
53
+ ### With Python (llama-cpp-python)
54
+
55
+ ```python
56
+ from llama_cpp import Llama
57
+
58
+ llm = Llama(
59
+ model_path="buildsnpper-chatbot-Q4_K_M.gguf",
60
+ n_ctx=2048,
61
+ n_threads=4
62
+ )
63
+
64
+ response = llm.create_chat_completion(
65
+ messages=[
66
+ {"role": "user", "content": "How do I assign credits to a client?"}
67
+ ],
68
+ temperature=0.1,
69
+ max_tokens=256
70
+ )
71
+
72
+ print(response['choices'][0]['message']['content'])
73
+ ```
74
+
75
+ ## Training Details
76
+
77
+ - **LoRA Configuration**:
78
+ - Rank: 16
79
+ - Alpha: 32
80
+ - Target modules: qkv_proj, o_proj
81
+ - Dropout: 0.05
82
+
83
+ - **Training Parameters**:
84
+ - Epochs: 3
85
+ - Learning rate: 3e-4
86
+ - Max sequence length: 1024
87
+ - Gradient accumulation: 4 steps
88
+ - Final training loss: 1.42
89
+
90
+ - **Hardware**: Apple M3 MacBook Air (MPS acceleration)
91
+ - **Training time**: ~1.5 hours
92
+
93
+ ## Quantization
94
+
95
+ Original FP16 model (7.67GB) was quantized to Q4_K_M format (2.5GB) using llama.cpp, achieving:
96
+ - 67% size reduction
97
+ - Optimized for CPU inference
98
+ - Minimal quality degradation
99
+
100
+ ## Limitations
101
+
102
+ - Specialized for Buildsnpper platform only
103
+ - May not perform well on general queries outside the platform domain
104
+ - Designed for customer support, not general conversation
105
+
106
+ ## License
107
+
108
+ MIT License - See base model license for additional restrictions.
109
+
110
+ ## Contact
111
+
112
+ - Organization: [bricksandbotltd](https://huggingface.co/bricksandbotltd)
113
+ - Platform: [Buildsnpper SAP Assessor Platform](https://buildsnpper.com)