Elinnos
/

codellama-fine-tuning

Prithvik-1 commited on Nov 25, 2025

Commit

062da74

verified ·

1 Parent(s): a2b3989

Upload SOLUTION_DATASET_REFORMAT.md with huggingface_hub

Files changed (1) hide show

SOLUTION_DATASET_REFORMAT.md ADDED Viewed

+# 🔧 Solution: Reformat Dataset and Retrain
+## ❌ Problem
+The model is generating **completely unrelated code** (Kotlin/Android) instead of Verilog because:
+1. **Format Mismatch**: CodeLlama-Instruct expects chat template format (`<s>[INST]...[/INST]`)
+2. **Training Used Simple Format**: `instruction + EOS + response + EOS`
+3. **Model Confusion**: Model didn't learn the task properly due to format mismatch
+## ✅ Solution: Use CodeLlama Chat Template Format
+We need to:
+1. Reformat dataset to use CodeLlama's chat template
+2. Update training script to use chat template format
+3. Retrain with proper format
+---
+## 📋 Steps to Fix
+### Step 1: Reformat Dataset
+Run:
+```bash
+cd /workspace/ftt/codellama-migration
+source /venv/main/bin/activate
+python3 reformat_dataset_for_codellama.py
+```
+This creates: `datasets/processed/elinnos_fifo_codellama_chat_format.jsonl`
+### Step 2: Update Training Script
+The training script needs to use CodeLlama's chat template format.
+### Step 3: Split and Retrain
+Split the reformatted dataset and retrain.
+---
+## 🎯 Expected Chat Template Format
+**For Training:**
+```
+<s>[INST] <<SYS>>
+System prompt
+<</SYS>>
+User task [/INST] Response </s>
+```
+**For Inference:**
+```
+<s>[INST] <<SYS>>
+System prompt
+<</SYS>>
+User task [/INST]
+```
+The model will continue generating the response after `[/INST]`.