codewraith / data /eval_report_3b_v2.md
slenk's picture
Upload folder using huggingface_hub
eeef81e verified

A newer version of the Gradio SDK is available: 6.14.0

Upgrade

CodeWraith Model Evaluation Report

Summary

Metric CodeWraith-3b-v2 (Llama-3.2-3B-Instruct)
Avg Structural Score 0.93
Function Coverage 0.84
Class Coverage 0.97
Argument Coverage 0.91
Return Type Coverage 0.97
Good Scores (>=80%) 25
Avg Inference Time (s) 20.01

CodeWraith-3b-v2 (Llama-3.2-3B-Instruct)

  • Examples evaluated: 31
  • Valid (parseable): 28
  • Perfect scores: 15
  • Total inference time: 620.2s