Elinnos
/

codellama-fine-tuning

Prithvik-1 commited on Nov 25, 2025

Commit

195b569

verified ·

1 Parent(s): 597b65c

Upload FORMAT_ISSUE_ANALYSIS.md with huggingface_hub

Files changed (1) hide show

FORMAT_ISSUE_ANALYSIS.md ADDED Viewed

+# 🔍 Format Issue Analysis
+## ❌ Problem Identified
+**Issue:** Model generating unrelated text (Kotlin code instead of Verilog)
+**Root Cause:** Format mismatch between training and CodeLlama-Instruct expectations
+---
+## 📊 Format Comparison
+### CodeLlama-Instruct Expected Format:
+```
+<s>[INST] <<SYS>>
+System message
+<</SYS>>
+User message [/INST] Response </s>
+```
+### Current Training Format:
+```
+instruction + EOS + response + EOS
+```
+### During Training (Current):
+```python
+text = f"{instruction}{tokenizer.eos_token}{response}{tokenizer.eos_token}"
+# Result: "System prompt + task</s>Code here</s>"
+```
+### During Inference (Current):
+```python
+prompt = f"{instruction}{tokenizer.eos_token}"
+# Result: "System prompt + task</s>"
+```
+---
+## 🔧 Solution Options
+### Option 1: Use CodeLlama Chat Template (RECOMMENDED)
+- Update training script to use CodeLlama's chat template
+- Reformat dataset to use chat template format
+- Retrain with proper format
+- **Pros:** Proper format, better results
+- **Cons:** Need to retrain
+### Option 2: Keep Simple Format (QUICK FIX)
+- Use simple format for both training and inference
+- Ensure inference matches training exactly
+- **Pros:** No retraining needed
+- **Cons:** Not using CodeLlama's optimized format
+---
+## ✅ Recommended: Use CodeLlama Chat Template
+CodeLlama-Instruct is designed to work with the chat template format. Using it will give better results.