Elinnos
/

codellama-fine-tuning

Model card Files Files and versions

xet

Community

Prithvik-1 commited on Nov 25, 2025

Commit

170941e

verified ·

1 Parent(s): 062da74

Upload RETRAIN_WITH_CHAT_FORMAT.md with huggingface_hub

Browse files

Files changed (1) hide show

RETRAIN_WITH_CHAT_FORMAT.md +66 -0

RETRAIN_WITH_CHAT_FORMAT.md ADDED Viewed

	@@ -0,0 +1,66 @@

+# 🔄 Retrain with CodeLlama Chat Template Format
+## ✅ What Was Done
+1. ✅ **Reformatted Dataset** - Created chat template format dataset
+2. ✅ **Split Dataset** - Split into train/val/test (70/9/15)
+3. ✅ **Updated Training Script** - Tokenization now handles chat format correctly
+## 📂 New Dataset Location
+**Chat Format Dataset:**
+- Original: `datasets/processed/elinnos_fifo_codellama_chat_format.jsonl` (94 samples)
+- Split Train: `datasets/processed/split_chat_format/train.jsonl` (70 samples)
+- Split Val: `datasets/processed/split_chat_format/val.jsonl` (9 samples)
+- Split Test: `datasets/processed/split_chat_format/test.jsonl` (15 samples)
+## 🚀 Retrain Command
+```bash
+cd /workspace/ftt/codellama-migration
+source /venv/main/bin/activate
+python3 scripts/training/finetune_codellama.py \
+    --base-model models/base-models/CodeLlama-7B-Instruct \
+    --dataset datasets/processed/split_chat_format/train.jsonl \
+    --val-dataset datasets/processed/split_chat_format/val.jsonl \
+    --output-dir training-outputs/codellama-fifo-v2-chat \
+    --max-length 1536 \
+    --num-epochs 5 \
+    --learning-rate 2e-5 \
+    --batch-size 4 \
+    --gradient-accumulation-steps 4 \
+    --lora-r 48 \
+    --lora-alpha 96 \
+    --resume-from-checkpoint auto
+```
+Or use the training script:
+```bash
+bash start_training_chat_format.sh
+```
+## 🔍 Key Changes
+1. **Training Format:**
+   - Old: `instruction + EOS + response + EOS`
+   - New: `instruction + response + EOS` (instruction already has chat template)
+2. **Inference Format:**
+   - Use CodeLlama chat template during inference
+   - Match the training format exactly
+## 📊 Expected Results
+After retraining with chat format:
+- ✅ Model should generate Verilog code (not unrelated text)
+- ✅ Model should understand the task correctly
+- ✅ Outputs should match training data format
+## ⚠️ Important Notes
+- **Old model won't work** - The format mismatch means the old model can't be used
+- **Must retrain** - New format requires retraining from scratch
+- **Use new dataset** - Always use `split_chat_format` for training