π Quick Test Command for Single Training Sample
β Updated Command (With Fixes Applied)
cd /workspace/ftt/codellama-migration && source /venv/main/bin/activate && python3 test_single_training_sample.py
This will:
- Load the first training sample
- Test with temperatures: 0.1, 0.2, 0.3
- Show expected vs generated output
- Help you find the best temperature
π§ Manual Test Command (Custom Temperature)
cd /workspace/ftt/codellama-migration
source /venv/main/bin/activate
# Extract sample #1 instruction
INSTRUCTION=$(sed -n '1p' datasets/processed/split/train.jsonl | python3 -c "import sys, json; print(json.load(sys.stdin)['instruction'])")
# Test with optimized parameters
python3 scripts/inference/inference_codellama.py \
--mode local \
--model-path training-outputs/codellama-fifo-v1 \
--prompt "$INSTRUCTION" \
--max-new-tokens 1000 \
--temperature 0.1
π― Key Hyperparameter Changes Made
| Parameter | Old | New | Impact |
|---|---|---|---|
| Repetition Penalty | 1.1 | 1.2 | β Prevents repetitive text |
| Prompt Format | No EOS | + EOS token | β Matches training format |
| Temperature | 0.3 | 0.1 (recommended) | β More deterministic |
| Response Extraction | Full decode | New tokens only | β Cleaner output |
π Expected Results
With these changes, you should see:
- β Verilog code generation (not text notes)
- β Complete modules (module ... endmodule)
- β No repetitive "Note:" statements
- β Better match with training data
π If Still Getting Text Instead of Code
Try these in order:
Lower temperature to 0.05:
--temperature 0.05Increase repetition penalty to 1.3: (Edit inference script or add as parameter)
Check prompt format:
- Ensure instruction matches training data exactly
- No extra formatting or wrappers
Verify training data format:
- Should be:
instruction + EOS + response + EOS - Response should start with ```verilog
- Should be:
Test and let me know the results!