Slim Frikha
commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -167,10 +167,10 @@ We report in the following table our internal pipeline benchmarks.
|
|
| 167 |
</tr>
|
| 168 |
<tr>
|
| 169 |
<td>GPQA (0-shot)</td>
|
| 170 |
-
<td>32.2</td>
|
| 171 |
<td>29.2</td>
|
| 172 |
<td>27.0</td>
|
| 173 |
-
<td
|
| 174 |
</tr>
|
| 175 |
<tr>
|
| 176 |
<td>GPQA (0-shot, COT)</td>
|
|
|
|
| 167 |
</tr>
|
| 168 |
<tr>
|
| 169 |
<td>GPQA (0-shot)</td>
|
| 170 |
+
<td><b>32.2</b></td>
|
| 171 |
<td>29.2</td>
|
| 172 |
<td>27.0</td>
|
| 173 |
+
<td>29.6</td>
|
| 174 |
</tr>
|
| 175 |
<tr>
|
| 176 |
<td>GPQA (0-shot, COT)</td>
|