Nerdsking commited on
Commit
eea98df
·
verified ·
1 Parent(s): 223df54

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -95,8 +95,8 @@ We did not considered it for our score, but "if" considered those extra 5 questi
95
  <td>Reported in third-party performance overview; may differ by protocol</td>
96
  </tr>
97
  <tr>
98
- <td>Stable Code 3B*</td>
99
- <td>~32–33 (estimate)</td>
100
  <td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
101
  </tr>
102
  <tr>
@@ -105,8 +105,8 @@ We did not considered it for our score, but "if" considered those extra 5 questi
105
  <td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
106
  </tr>
107
  <tr>
108
- <td>StarCoder 3B*</td>
109
- <td>~21.6 (estimate)</td>
110
  <td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
111
  </tr>
112
  </tbody>
 
95
  <td>Reported in third-party performance overview; may differ by protocol</td>
96
  </tr>
97
  <tr>
98
+ <td>Qwen/Qwen3-4B-GGUF (bigger size model)</td>
99
+ <td>~73%</td>
100
  <td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
101
  </tr>
102
  <tr>
 
105
  <td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
106
  </tr>
107
  <tr>
108
+ <td>CodeLlama 7B‑Python (bigger size model)</td>
109
+ <td>~74 % </td>
110
  <td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
111
  </tr>
112
  </tbody>