jsdjsdequinia
/

cloud-expert-qwen

+---
+language: en
+license: apache-2.0
+base_model: Qwen/Qwen2.5-Coder-1.5B-Instruct
+tags:
+- qwen2.5
+- cloud
+- azure
+- aws
+- terraform
+- docker
+- kubernetes
+- linux
+- iac
+---
+# Cloud Expert Qwen
+This is a version 1 of Qwen 2.5-Coder 1.5B fine-tuned for cloud computing & IaC
+Fine-tuned on comprehensive cloud computing, Infrastructure as Code, containerization, and Linux system administration documentation.
+## 🎯 What This Model Knows
+- **Cloud Platforms**: Azure, AWS, GCP
+- **Infrastructure as Code**: Terraform, CloudFormation, ARM templates
+- **Containers & Orchestration**: Docker, Kubernetes
+- **Linux**: System administration, troubleshooting, networking
+- **DevOps**: CI/CD, monitoring, security best practices
+## 🚀 Quick Start
+### Option 1: Use with Transformers (Python)
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+from peft import PeftModel
+import torch
+# Load base model
+base_model = "Qwen/Qwen2.5-Coder-1.5B-Instruct"
+model = AutoModelForCausalLM.from_pretrained(
+    base_model,
+    torch_dtype=torch.float16,
+    device_map="auto",
+    trust_remote_code=True
+)
+# Load fine-tuned LoRA adapters
+model = PeftModel.from_pretrained(model, "jsdjsdequinia/cloud-expert-qwen/lora-adapters")
+tokenizer = AutoTokenizer.from_pretrained(base_model, trust_remote_code=True)
+# Ask a question
+question = "How do I troubleshoot SSH connection issues on Linux?"
+prompt = f"<|im_start|>user\n{question}<|im_end|>\n<|im_start|>assistant\n"
+inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
+outputs = model.generate(**inputs, max_new_tokens=200, temperature=0.7)
+response = tokenizer.decode(outputs[0], skip_special_tokens=True)
+print(response)
+```
+### Option 2: Use with Ollama (Recommended for CPU)
+**Perfect for laptops without GPU!**
+1. **Install Ollama**: https://ollama.ai/download
+2. **Download the model**:
+```bash
+   huggingface-cli download jsdjsdequinia/cloud-expert-qwen cloud-expert-qwen-q8_0.gguf --local-dir ./
+   huggingface-cli download jsdjsdequinia/cloud-expert-qwen Modelfile --local-dir ./
+```
+3. **Create Ollama model**:
+```bash
+   ollama create cloud-expert -f Modelfile
+```
+4. **Run it**:
+```bash
+   ollama run cloud-expert
+```
+5. **Or use in code**:
+```python
+   import ollama
+   response = ollama.chat(model='cloud-expert', messages=[
+       {'role': 'user', 'content': 'What is Azure Virtual Machine?'}
+   ])
+   print(response['message']['content'])
+```
+## 📦 Available Formats
+| Format | Size | Use Case | Download |
+|--------|------|----------|----------|
+| **LoRA Adapters** | ~100MB | Fine-tuning, GPU inference | `lora-adapters/` |
+| **Merged Model** | ~3GB | Full model, GPU inference | `merged-model/` |
+| **GGUF (q8_0)** | ~1.5GB | CPU inference with Ollama | `*.gguf` |
+## 📊 Training Details
+- **Base Model**: Qwen/Qwen2.5-Coder-1.5B-Instruct
+- **Training Method**: LoRA (Low-Rank Adaptation)
+- **Training Data**: 43 examples
+  - Manual Q&A pairs on cloud services
+  - Scraped official documentation (Azure, Docker, Kubernetes, etc.)
+  - Linux troubleshooting guides
+- **Training Time**: ~20-30 minutes on RTX 3070
+- **Trainable Parameters**: ~2% (LoRA efficient training)
+## 💡 Example Questions
+```
+- What is Microsoft Azure?
+- How do I deploy a Docker container?
+- Explain Terraform state management
+- How do I troubleshoot disk usage on Linux?
+- Compare Azure VM vs AWS EC2
+- What are Kubernetes best practices?
+- How do I configure a Linux firewall?
+```
+## 🖥️ System Requirements
+### For Training:
+- GPU with 8GB+ VRAM
+- 16GB RAM
+- CUDA 12.1+
+### For Inference:
+**With Transformers (GPU):**
+- GPU with 4GB+ VRAM
+- 8GB RAM
+**With Ollama (CPU - Recommended for work laptops):**
+- Any modern CPU
+- 4GB RAM
+- No GPU needed! ✅
+## ⚡ Performance
+| Setup | Tokens/Second | Use Case |
+|-------|---------------|----------|
+| GPU (RTX 3070) | ~50 tok/s | Development, training |
+| CPU (Ollama, 16GB RAM) | ~10-15 tok/s | Work laptop, portable |
+## 🎓 Use Cases
+✅ Learning cloud technologies
+✅ Quick reference for DevOps tasks
+✅ Understanding IaC best practices
+✅ Linux troubleshooting assistance
+✅ Comparing cloud services
+✅ Interview preparation
+## ⚠️ Limitations
+- Knowledge cutoff: Training data as of 2024
+- May not reflect very recent service updates
+- Best for general concepts and established practices
+- Always verify critical production decisions with official docs
+- Not a replacement for hands-on experience
+## 📜 License
+Apache 2.0 - Free for commercial and personal use
+## 🙏 Credits
+- Base model: [Qwen/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B-Instruct)
+- Fine-tuned by: jsdjsdequinia
+- Documentation sources: Microsoft Azure, Docker, Kubernetes, DigitalOcean, HashiCorp
+## 🐛 Feedback
+Found an issue or have suggestions? Feel free to open an issue on the model page!
+---
+**Built with ❤️ for the DevOps community**