codewraith / data /eval_report_8b_v2.md
slenk's picture
Upload folder using huggingface_hub
eeef81e verified

A newer version of the Gradio SDK is available: 6.14.0

Upgrade

CodeWraith Model Evaluation Report

Summary

Metric CodeWraith-8b-v2 (Llama-3.1-8B-Instruct)
Avg Structural Score 0.92
Function Coverage 0.85
Class Coverage 0.84
Argument Coverage 0.93
Return Type Coverage 0.97
Good Scores (>=80%) 24
Avg Inference Time (s) 21.91

CodeWraith-8b-v2 (Llama-3.1-8B-Instruct)

  • Examples evaluated: 31
  • Valid (parseable): 28
  • Perfect scores: 15
  • Total inference time: 679.2s