Upload folder using huggingface_hub

Browse files

Files changed (8) hide show

README.md +239 -0
adapter_config.json +26 -0
adapter_model.safetensors +3 -0
gitattributes +35 -0
special_tokens_map.json +30 -0
tokenizer.json +0 -0
tokenizer_config.json +81 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,239 @@

+---
+library_name: peft
+base_model: codellama/CodeLlama-7b-Instruct-hf
+tags:
+- terraform
+- terraform-configuration
+- infrastructure-as-code
+- iac
+- hashicorp
+- codellama
+- lora
+- qlora
+- peft
+- code-generation
+- devops
+- cloud
+- automation
+- configuration-management
+license: apache-2.0
+language:
+- en
+pipeline_tag: text-generation
+---
+# terraform-codellama-7b
+A specialized LoRA fine-tuned model for Terraform infrastructure-as-code generation, built on CodeLlama-7b-Instruct-hf. This model excels at generating Terraform configurations, HCL (HashiCorp Configuration Language) code, and infrastructure automation scripts.
+## Model Description
+This model is a LoRA (Low-Rank Adaptation) fine-tuned version of CodeLlama-7b-Instruct-hf, specifically optimized for generating Terraform configuration files. It was trained on public Terraform Registry documentation to understand Terraform syntax, resource configurations, and best practices.
+### Key Features
+- **Specialized for Terraform**: Fine-tuned specifically for infrastructure-as-code generation
+- **Efficient Training**: Uses QLoRA (4-bit quantization + LoRA) for memory-efficient training
+- **Public Data Only**: Trained exclusively on public Terraform Registry documentation
+- **Production Ready**: Optimized for real-world Terraform development workflows
+## Model Details
+- **Developed by**: Rafi Al Attrach, Patrick Schmitt, Nan Wu, Helena Schneider, Stefania Saju (TUM + IBM Research Project)
+- **Model type**: LoRA fine-tuned CodeLlama
+- **Language(s)**: English
+- **License**: Apache 2.0
+- **Finetuned from**: [codellama/CodeLlama-7b-Instruct-hf](https://huggingface.co/codellama/CodeLlama-7b-Instruct-hf)
+- **Training method**: QLoRA (4-bit quantization + LoRA)
+### Technical Specifications
+- **Base Model**: CodeLlama-7b-Instruct-hf
+- **LoRA Rank**: 64
+- **LoRA Alpha**: 16
+- **Target Modules**: q_proj, v_proj
+- **Training Epochs**: 3
+- **Max Sequence Length**: 512
+- **Quantization**: 4-bit (fp4)
+## Uses
+### Direct Use
+This model is designed for:
+- Generating Terraform configuration files
+- Infrastructure-as-code development
+- Terraform resource configuration
+- DevOps automation
+- Cloud infrastructure planning
+### Example Use Cases
+```python
+# Generate AWS EC2 instance configuration
+prompt = "Create a Terraform configuration for an AWS EC2 instance with t3.medium instance type"
+```
+```python
+# Generate Azure resource group
+prompt = "Create a Terraform configuration for an Azure resource group in West Europe"
+```
+```python
+# Generate GCP compute instance
+prompt = "Create a Terraform configuration for a GCP compute instance with Ubuntu 20.04"
+```
+## How to Get Started
+### Installation
+```bash
+pip install transformers torch peft accelerate bitsandbytes
+```
+### Loading the Model
+#### GPU Usage (Recommended)
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+import torch
+# Load base model with 4-bit quantization (GPU)
+base_model = "codellama/CodeLlama-7b-Instruct-hf"
+model = AutoModelForCausalLM.from_pretrained(
+    base_model,
+    load_in_4bit=True,
+    torch_dtype=torch.float16,
+    device_map="auto"
+)
+# Load LoRA adapter
+model = PeftModel.from_pretrained(model, "rafiaa/terraform-codellama-7b")
+tokenizer = AutoTokenizer.from_pretrained(base_model)
+# Set pad token
+if tokenizer.pad_token is None:
+    tokenizer.pad_token = tokenizer.eos_token
+```
+#### CPU Usage (Alternative)
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+import torch
+# Load base model (CPU compatible)
+base_model = "codellama/CodeLlama-7b-Instruct-hf"
+model = AutoModelForCausalLM.from_pretrained(
+    base_model,
+    torch_dtype=torch.float32,
+    device_map="cpu"
+)
+# Load LoRA adapter
+model = PeftModel.from_pretrained(model, "rafiaa/terraform-codellama-7b")
+tokenizer = AutoTokenizer.from_pretrained(base_model)
+# Set pad token
+if tokenizer.pad_token is None:
+    tokenizer.pad_token = tokenizer.eos_token
+```
+### Usage Example
+```python
+def generate_terraform(prompt, max_length=512):
+    inputs = tokenizer(prompt, return_tensors="pt")
+    with torch.no_grad():
+        outputs = model.generate(
+            **inputs,
+            max_length=max_length,
+            temperature=0.7,
+            do_sample=True,
+            pad_token_id=tokenizer.eos_token_id
+        )
+    return tokenizer.decode(outputs[0], skip_special_tokens=True)
+# Example usage
+prompt = "Create a Terraform configuration for an AWS S3 bucket with versioning enabled"
+result = generate_terraform(prompt)
+print(result)
+```
+## Training Details
+### Training Data
+- **Source**: Public Terraform Registry documentation
+- **Data Type**: Terraform configuration files and documentation
+- **Preprocessing**: Standard text preprocessing with sequence length of 512 tokens
+### Training Procedure
+- **Method**: QLoRA (4-bit quantization + LoRA)
+- **LoRA Rank**: 64
+- **LoRA Alpha**: 16
+- **Target Modules**: q_proj, v_proj
+- **Training Epochs**: 3
+- **Max Sequence Length**: 512
+- **Quantization**: 4-bit (fp4)
+### Training Hyperparameters
+- **Training regime**: 4-bit mixed precision
+- **LoRA Dropout**: 0.0
+- **Learning Rate**: Optimized for QLoRA training
+- **Batch Size**: Optimized for memory efficiency
+## Limitations and Bias
+### Known Limitations
+- **Context Length**: Limited to 512 tokens due to training configuration
+- **Domain Specificity**: Optimized for Terraform, may not perform well on other infrastructure tools
+- **Base Model Limitations**: Inherits limitations from CodeLlama-7b-Instruct-hf
+### Recommendations
+- Use for Terraform-specific tasks only
+- Validate generated configurations before deployment
+- Consider the 512-token context limit for complex configurations
+- For production use, always review and test generated code
+## Environmental Impact
+- **Training Method**: QLoRA reduces computational requirements significantly
+- **Hardware**: Trained using efficient 4-bit quantization
+- **Carbon Footprint**: Reduced compared to full fine-tuning due to QLoRA efficiency
+## Citation
+If you use this model in your research, please cite:
+```bibtex
+@misc{terraform-codellama-7b,
+  title={terraform-codellama-7b: A LoRA Fine-tuned Model for Terraform Code Generation},
+  author={Rafi Al Attrach and Patrick Schmitt and Nan Wu and Helena Schneider and Stefania Saju},
+  year={2024},
+  url={https://huggingface.co/rafiaa/terraform-codellama-7b}
+}
+```
+## Related Models
+- **Base Model**: [codellama/CodeLlama-7b-Instruct-hf](https://huggingface.co/codellama/CodeLlama-7b-Instruct-hf)
+- **Enhanced Version**: [rafiaa/terraform-cloud-codellama-7b](https://huggingface.co/rafiaa/terraform-cloud-codellama-7b) (Recommended - includes cloud provider documentation)
+## Model Card Contact
+- **Author**: rafiaa
+- **Model Repository**: [HuggingFace Model](https://huggingface.co/rafiaa/terraform-codellama-7b)
+- **Issues**: Please report issues through the HuggingFace model page
+---
+*This model is part of a research project conducted in early 2024, focusing on specialized code generation for infrastructure-as-code tools.*

adapter_config.json ADDED Viewed

	@@ -0,0 +1,26 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "codellama/CodeLlama-7b-Instruct-hf",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 16,
+  "lora_dropout": 0.0,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 64,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "v_proj"
+  ],
+  "task_type": "CAUSAL_LM"
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cea474d825b4c72d43efbc03a1996e31d756f40a3a29403e534e92b8b23e3446
+size 134235048

gitattributes ADDED Viewed

	@@ -0,0 +1,35 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "additional_special_tokens": [
+    "▁<PRE>",
+    "▁<MID>",
+    "▁<SUF>",
+    "▁<EOT>"
+  ],
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "</s>",
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,81 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32007": {
+      "content": "▁<PRE>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32008": {
+      "content": "▁<SUF>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32009": {
+      "content": "▁<MID>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32010": {
+      "content": "▁<EOT>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [
+    "▁<PRE>",
+    "▁<MID>",
+    "▁<SUF>",
+    "▁<EOT>"
+  ],
+  "bos_token": "<s>",
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "</s>",
+  "eot_token": "▁<EOT>",
+  "fill_token": "<FILL_ME>",
+  "legacy": null,
+  "middle_token": "▁<MID>",
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "</s>",
+  "prefix_token": "▁<PRE>",
+  "sp_model_kwargs": {},
+  "suffix_token": "▁<SUF>",
+  "tokenizer_class": "CodeLlamaTokenizer",
+  "unk_token": "<unk>",
+  "use_default_system_prompt": false
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ce6b411763930e5bf203e954f08621cbe7096f1c46a4cb8ab58844d94b69172
+size 4600