| | --- |
| | license: mit |
| | base_model: microsoft/Phi-4-mini-instruct |
| | tags: |
| | - phi4 |
| | - gguf |
| | - quantized |
| | - q4_k_m |
| | - buildsnpper |
| | - sap-assessor |
| | - chatbot |
| | - customer-support |
| | language: |
| | - en |
| | pipeline_tag: text-generation |
| | --- |
| | |
| | # Buildsnpper SAP Assessor Platform Chatbot (Q4_K_M) |
| |
|
| | Fine-tuned Phi-4-mini-instruct model for the Buildsnpper SAP Assessor Platform customer support chatbot. |
| |
|
| | ## Model Details |
| |
|
| | - **Base Model**: microsoft/Phi-4-mini-instruct (3.8B parameters) |
| | - **Fine-tuning**: LoRA (rank=16, alpha=32) |
| | - **Format**: GGUF Q4_K_M quantized |
| | - **Size**: ~2.5GB |
| | - **Context Length**: 131,072 tokens |
| | - **Training Data**: 89 Q&A pairs covering Buildsnpper platform features, workflows, and common user questions |
| |
|
| | ## Use Cases |
| |
|
| | This model is specifically trained to answer questions about: |
| | - Project and client management in Buildsnpper |
| | - Subscription and credit system |
| | - Platform features and navigation |
| | - Common technical issues |
| | - Account management |
| | - Report generation and exports |
| |
|
| | ## Usage |
| |
|
| | ### With llama.cpp |
| |
|
| | ```bash |
| | # Download the model |
| | wget https://huggingface.co/bricksandbotltd/buildsnpper-chatbot-Q4_K_M/resolve/main/buildsnpper-chatbot-Q4_K_M.gguf |
| | |
| | # Run with llama.cpp |
| | ./llama-cli -m buildsnpper-chatbot-Q4_K_M.gguf -p "How do I create a new project in Buildsnpper?" -n 256 |
| | ``` |
| |
|
| | ### With Python (llama-cpp-python) |
| |
|
| | ```python |
| | from llama_cpp import Llama |
| | |
| | llm = Llama( |
| | model_path="buildsnpper-chatbot-Q4_K_M.gguf", |
| | n_ctx=2048, |
| | n_threads=4 |
| | ) |
| | |
| | response = llm.create_chat_completion( |
| | messages=[ |
| | {"role": "user", "content": "How do I assign credits to a client?"} |
| | ], |
| | temperature=0.1, |
| | max_tokens=256 |
| | ) |
| | |
| | print(response['choices'][0]['message']['content']) |
| | ``` |
| |
|
| | ## Training Details |
| |
|
| | - **LoRA Configuration**: |
| | - Rank: 16 |
| | - Alpha: 32 |
| | - Target modules: qkv_proj, o_proj |
| | - Dropout: 0.05 |
| |
|
| | - **Training Parameters**: |
| | - Epochs: 3 |
| | - Learning rate: 3e-4 |
| | - Max sequence length: 1024 |
| | - Gradient accumulation: 4 steps |
| | - Final training loss: 1.42 |
| |
|
| | - **Hardware**: Apple M3 MacBook Air (MPS acceleration) |
| | - **Training time**: ~1.5 hours |
| |
|
| | ## Quantization |
| |
|
| | Original FP16 model (7.67GB) was quantized to Q4_K_M format (2.5GB) using llama.cpp, achieving: |
| | - 67% size reduction |
| | - Optimized for CPU inference |
| | - Minimal quality degradation |
| |
|
| | ## Limitations |
| |
|
| | - Specialized for Buildsnpper platform only |
| | - May not perform well on general queries outside the platform domain |
| | - Designed for customer support, not general conversation |
| |
|
| | ## License |
| |
|
| | MIT License - See base model license for additional restrictions. |
| |
|
| | ## Contact |
| |
|
| | - Organization: [bricksandbotltd](https://huggingface.co/bricksandbotltd) |
| | - Platform: [Buildsnpper SAP Assessor Platform](https://buildsnpper.com) |
| |
|