Thamirawaran
/

llama2-7b-code-lora

Model card Files Files and versions

llama2-7b-code-lora / README.md

Thamirawaran's picture

Upload README.md with huggingface_hub

20cf8f0 verified 16 days ago

|

history blame contribute delete

1.81 kB

	---
	language: en
	license: apache-2.0
	tags:
	- llama2
	- lora
	- code
	- adapter
	datasets:
	- iamtarun/python_code_instructions_18k_alpaca
	---

	# LLaMA-2-7B CODE LoRA Adapter

	This is a LoRA adapter for LLaMA-2-7B fine-tuned on code domain data.

	## Model Details

	- Base Model: meta-llama/Llama-2-7b-hf
	- Adapter Type: LoRA (Low-Rank Adaptation)
	- Domain: Code
	- Training Data: Python code instructions
	- Training Examples: 2000 (1600 train, 200 val, 200 test)
	- Epochs: 2

	## LoRA Configuration

	- Rank (r): 16
	- Alpha: 32
	- Dropout: 0.05
	- Target Modules: q_proj, v_proj, k_proj, o_proj

	## Performance Metrics (100 test examples)

	- Loss: 0.573
	- Perplexity: 1.773
	- BLEU: 32.76

	## Usage

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel

	# Load base model
	model = AutoModelForCausalLM.from_pretrained(
	"meta-llama/Llama-2-7b-hf",
	load_in_8bit=True,
	device_map="auto"
	)
	tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-2-7b-hf")

	# Load adapter
	model = PeftModel.from_pretrained(model, "Thamirawaran/llama2-7b-code-lora")

	# Generate
	prompt = 'Write a Python function to sort a list'
	inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
	outputs = model.generate(**inputs, max_length=256, temperature=0.7)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	## Training Details

	- Trained with FYP_MDLE library
	- 8-bit quantization during training
	- Gradient accumulation: 16 steps
	- Learning rate: 2e-4
	- Warmup steps: 20

	## Citation

	```bibtex
	@misc{llama2-code-lora,
	author = {Team RAISE},
	title = {LLaMA-2-7B Code LoRA Adapter},
	year = {2026},
	publisher = {HuggingFace},
	howpublished = {\url{https://huggingface.co/Thamirawaran/llama2-7b-code-lora}}
	}
	```

	## License

	Apache 2.0