codewraith / data /eval_report_3b_v3.md
slenk's picture
Upload folder using huggingface_hub
eeef81e verified

A newer version of the Gradio SDK is available: 6.14.0

Upgrade

CodeWraith Model Evaluation Report

Summary

Metric CodeWraith-3b (Llama-3.2-3B-Instruct)
Avg Structural Score 0.92
Function Coverage 0.83
Class Coverage 0.92
Argument Coverage 0.93
Return Type Coverage 0.84
Good Scores (>=80%) 24
Avg Inference Time (s) 20.01

CodeWraith-3b (Llama-3.2-3B-Instruct)

  • Examples evaluated: 31
  • Valid (parseable): 28
  • Perfect scores: 13
  • Total inference time: 620.2s