Update README.md
Browse files
README.md
CHANGED
|
@@ -95,7 +95,7 @@ We did not considered it for our score, but "if" considered those extra 5 questi
|
|
| 95 |
<td>Reported in third-party performance overview; may differ by protocol</td>
|
| 96 |
</tr>
|
| 97 |
<tr>
|
| 98 |
-
<td>Qwen/Qwen3-4B-GGUF (bigger size model)</td>
|
| 99 |
<td>~73%</td>
|
| 100 |
<td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
|
| 101 |
</tr>
|
|
@@ -105,7 +105,7 @@ We did not considered it for our score, but "if" considered those extra 5 questi
|
|
| 105 |
<td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
|
| 106 |
</tr>
|
| 107 |
<tr>
|
| 108 |
-
<td>CodeLlama 7B‑Python (bigger size model)</td>
|
| 109 |
<td>~74 % </td>
|
| 110 |
<td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
|
| 111 |
</tr>
|
|
|
|
| 95 |
<td>Reported in third-party performance overview; may differ by protocol</td>
|
| 96 |
</tr>
|
| 97 |
<tr>
|
| 98 |
+
<td>Qwen/Qwen3-4B-GGUF (<u>bigger</u> size model)</td>
|
| 99 |
<td>~73%</td>
|
| 100 |
<td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
|
| 101 |
</tr>
|
|
|
|
| 105 |
<td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
|
| 106 |
</tr>
|
| 107 |
<tr>
|
| 108 |
+
<td>CodeLlama 7B‑Python (<u>much bigger</u> size model)</td>
|
| 109 |
<td>~74 % </td>
|
| 110 |
<td>Indicative proxy from published code-task performance breakdowns (not a strict HumanEval pass@1)</td>
|
| 111 |
</tr>
|