# CodeWraith Model Evaluation Report ## Summary | Metric | CodeWraith-3b-v2 (Llama-3.2-3B-Instruct) | CodeWraith-8b-v2 (Llama-3.1-8B-Instruct) | |--------|-----|-----| | Avg Structural Score | 0.93 | 0.92 | | Function Coverage | 0.84 | 0.85 | | Class Coverage | 0.97 | 0.84 | | Argument Coverage | 0.91 | 0.93 | | Return Type Coverage | 0.97 | 0.97 | | Good Scores (>=80%) | 25 | 24 | | Avg Inference Time (s) | 20.01 | 21.91 | ## CodeWraith-3b-v2 (Llama-3.2-3B-Instruct) - Examples evaluated: 31 - Valid (parseable): 28 - Perfect scores: 15 - Total inference time: 620.2s ## CodeWraith-8b-v2 (Llama-3.1-8B-Instruct) - Examples evaluated: 31 - Valid (parseable): 28 - Perfect scores: 15 - Total inference time: 679.2s