kubernetes-ai-lora / README.md

Update README.md

e015389 verified 2 months ago

8.34 kB

	---
	license: mit
	base_model: unsloth/gemma-3-12b-it-qat-bnb-4bit
	tags:
	- kubernetes
	- devops
	- infrastructure
	- k8s
	- turkish
	- gemma
	- unsloth
	- lora
	datasets:
	- mcipriano/stackoverflow-kubernetes-questions
	- Szaid3680/Devops
	- ahmedgongi/Devops_LLM
	- HelloBoieeee/kubernetes_config
	- sidddddddddddd/kubernetes-with-ood
	- peterpanpan/stackoverflow-kubernetes-questions
	- dereklck/kubernetes_operator_3b_1.5k
	- dereklck/kubernetes_cli_dataset_20k
	library_name: peft
	language:
	- en
	- tr
	---

	# Kubernetes AI - Gemma 3 12B LoRA Adapters

	Fine-tuned Gemma 3 12B model specialized for answering Kubernetes questions in Turkish.

	## Model Description

	This model consists of LoRA adapters fine-tuned on `unsloth/gemma-3-12b-it-qat-bnb-4bit` using a comprehensive dataset of Kubernetes documentation, Stack Overflow questions, and DevOps scenarios.

	Primary Purpose: Answer Kubernetes-related questions in Turkish language.

	### Use Cases

	- Kubernetes cluster management and troubleshooting
	- YAML configuration generation and validation
	- kubectl command assistance
	- Debugging pod, service, and deployment issues
	- Kubernetes best practices and concepts
	- DevOps workflow optimization
	- Turkish language Kubernetes Q&A

	## Quick Start

	### Loading the Model

	```python
	from unsloth import FastLanguageModel
	from peft import PeftModel
	import torch

	# Load base Gemma 3 12B model
	model, tokenizer = FastLanguageModel.from_pretrained(
	model_name="unsloth/gemma-3-12b-it-qat-bnb-4bit",
	max_seq_length=2048,
	dtype=None,
	load_in_4bit=True, # Use 4-bit quantization to fit in GPU memory
	)

	# Load Kubernetes AI LoRA adapters
	model = PeftModel.from_pretrained(
	model,
	"aciklab/kubernetes-ai"
	)

	# Enable inference mode
	FastLanguageModel.for_inference(model)

	# Example usage (Turkish question)
	messages = [
	{"role": "user", "content": "Kubernetes'te 3 replikaya sahip bir deployment nasıl oluştururum?"}
	]

	inputs = tokenizer.apply_chat_template(
	messages,
	tokenize=True,
	add_generation_prompt=True,
	return_tensors="pt"
	).to("cuda")

	outputs = model.generate(
	input_ids=inputs,
	max_new_tokens=512,
	temperature=0.7,
	top_p=0.9,
	do_sample=True
	)

	response = tokenizer.decode(outputs[0], skip_special_tokens=True)
	print(response)
	```

	## Example Questions

	### Turkish Examples

	```python
	# Deployment creation
	"Node.js uygulaması için 3 replika, sağlık kontrolleri ve kaynak limitleri olan bir Kubernetes deployment oluştur."

	# Troubleshooting
	"Pod'um CrashLoopBackOff durumunda. Yaygın nedenleri nelerdir ve nasıl debug ederim?"

	# kubectl commands
	"Production namespace'indeki çalışmayan tüm pod'ları gösteren kubectl komutunu yaz."

	# Best practices
	"Kubernetes'te container güvenliği için en iyi uygulamalar nelerdir?"

	# Service creation
	"LoadBalancer tipinde bir Kubernetes servisi nasıl yapılandırılır?"
	```

	### English Examples

	```python
	"How do I create a Kubernetes deployment with 3 replicas?"
	"What are the common causes of CrashLoopBackOff?"
	"Show me kubectl command to get all pods in production namespace."
	```

	## Training Dataset

	The model was trained on ~157,000 examples from multiple high-quality Kubernetes and DevOps datasets:

	\| Dataset \| Count \| Description \|
	\|---------\|----------\|-------------\|
	\| Kubernetes Official Documentation \| \| \|
	\| - Concepts \| 2,700 \| Core Kubernetes concepts \|
	\| - Kubectl Reference \| 600 \| kubectl command documentation \|
	\| - Setup Guides \| 430 \| Installation and setup \|
	\| - Tasks \| 4,300 \| Practical task guides \|
	\| - Tutorials \| 880 \| Step-by-step tutorials \|
	\| Stack Overflow \| \| \|
	\| mcipriano/stackoverflow-kubernetes-questions \| 30,000 \| Kubernetes Q&A \|
	\| peterpanpan/stackoverflow-kubernetes-questions \| 22,000 \| Additional Kubernetes Q&A \|
	\| DevOps Datasets \| \| \|
	\| Szaid3680/Devops \| 42,000 \| General DevOps content \|
	\| ahmedgongi/Devops_LLM \| 20,500 \| Kubernetes-filtered DevOps (from 140k) \|
	\| Configuration & Operations \| \| \|
	\| HelloBoieeee/kubernetes_config \| 10,000 \| Kubernetes configurations \|
	\| sidddddddddddd/kubernetes-with-ood \| 6,000 \| Kubernetes scenarios (incl. Turkish translations) \|
	\| dereklck/kubernetes_cli_dataset_20k \| 19,000 \| kubectl CLI examples \|
	\| dereklck/kubernetes_operator_3b_1.5k \| 1,800 \| Kubernetes operator patterns \|

	Total Training Examples: ~157,210

	## Training Details

	- Base Model: unsloth/gemma-3-12b-it-qat-bnb-4bit
	- Method: LoRA (Low-Rank Adaptation)
	- Framework: Unsloth
	- LoRA Rank: 8
	- Target Modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
	- Training Checkpoint: checkpoint-8175
	- Max Sequence Length: 1024 tokens
	- Training Time: 28 hours
	- Hardware: NVIDIA GeForce RTX 5070 12GB

	## Hardware Requirements

	- Minimum VRAM: 12GB (with 4-bit quantization)
	- Recommended VRAM: 24GB (for faster inference)
	- CPU RAM: 32GB+
	- Training Hardware: RTX 5070 12GB

	## Limitations

	- May not have information on very recent Kubernetes features released after training
	- Primarily trained for Turkish language responses, though it can handle English queries
	- Best suited for technical Kubernetes questions; general conversation capabilities can be limited

	## Performance Notes

	- Trained on RTX 5070 12GB in 28 hours
	- Works with 12GB VRAM using 4-bit quantization
	- Fast startup by loading only adapters without full model reload

	## License

	This model is released under the MIT License. Free to use in commercial and open-source projects.

	## Acknowledgments

	- Google and Unsloth team for the Gemma 3 base model
	- Unsloth team for the efficient training framework
	- All dataset contributors
	- Kubernetes community for comprehensive documentation
	- NVIDIA for RTX 5070 enabling 28-hour training

	## Contact

	For questions or feedback, please open an issue on the model repository.

	---

	Note: This is a LoRA adapter, not a full model. You must load it on top of `unsloth/gemma-3-12b-it-qat-bnb-4bit` to use it.

	## Related Links

	- [Unsloth Documentation](https://docs.unsloth.ai/)
	- [Gemma Model Card](https://ai.google.dev/gemma)
	- [PEFT Documentation](https://huggingface.co/docs/peft)
	- [Kubernetes Documentation](https://kubernetes.io/docs/)

	## Citations

	### Datasets

	```bibtex
	@misc{stackoverflow-kubernetes-mcipriano,
	author = {mcipriano},
	title = {Stack Overflow Kubernetes Questions},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/mcipriano/stackoverflow-kubernetes-questions}
	}

	@misc{devops-szaid,
	author = {Szaid3680},
	title = {DevOps Dataset},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/Szaid3680/Devops}
	}

	@misc{devops-llm-ahmed,
	author = {ahmedgongi},
	title = {DevOps LLM Dataset},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/ahmedgongi/Devops_LLM}
	}

	@misc{kubernetes-config-hello,
	author = {HelloBoieeee},
	title = {Kubernetes Config Dataset},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/HelloBoieeee/kubernetes_config}
	}

	@misc{kubernetes-ood-sidddddddddddd,
	author = {sidddddddddddd},
	title = {Kubernetes with OOD Dataset},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/sidddddddddddd/kubernetes-with-ood}
	}

	@misc{stackoverflow-kubernetes-peter,
	author = {peterpanpan},
	title = {Stack Overflow Kubernetes Questions},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/peterpanpan/stackoverflow-kubernetes-questions}
	}

	@misc{kubernetes-operator-derek,
	author = {dereklck},
	title = {Kubernetes Operator Dataset},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/dereklck/kubernetes_operator_3b_1.5k}
	}

	@misc{kubernetes-cli-derek,
	author = {dereklck},
	title = {Kubernetes CLI Dataset},
	year = {2024},
	publisher = {HuggingFace},
	url = {https://huggingface.co/datasets/dereklck/kubernetes_cli_dataset_20k}
	}
	```

	### Model

	```bibtex
	@misc{kubernetes-ai,
	author = {aciklab},
	title = {Kubernetes AI Turkish - Gemma 3 12B LoRA Adapters},
	year = {2025},
	publisher = {HuggingFace},
	url = {https://huggingface.co/aciklab/kubernetes-ai},
	note = {Trained on RTX 5070 12GB in 28 hours}
	}
	```