Update README.md
Browse files
README.md
CHANGED
|
@@ -81,7 +81,7 @@ We evaluate our models on the OSWorld and OSWorld-Verified benchmarks following
|
|
| 81 |
| GTA1-7B-2507 w/ o3 | 100 | 45.2 | 53.1 |
|
| 82 |
| GTA1-7B-2507 w/ GPT-5 | 100 | — | 61.0 |
|
| 83 |
| GTA1-32B w/ o3 | 100 | — | 55.4 |
|
| 84 |
-
| GTA1-32B w/ GPT-5 | 100 | — |
|
| 85 |
|
| 86 |
> **Note:** A dash (—) indicates unavailable results.
|
| 87 |
|
|
|
|
| 81 |
| GTA1-7B-2507 w/ o3 | 100 | 45.2 | 53.1 |
|
| 82 |
| GTA1-7B-2507 w/ GPT-5 | 100 | — | 61.0 |
|
| 83 |
| GTA1-32B w/ o3 | 100 | — | 55.4 |
|
| 84 |
+
| GTA1-32B w/ GPT-5 | 100 | — | 63.4 |
|
| 85 |
|
| 86 |
> **Note:** A dash (—) indicates unavailable results.
|
| 87 |
|