Commit
·
944264b
1
Parent(s):
91171cd
Update README.md
Browse files
README.md
CHANGED
|
@@ -60,7 +60,7 @@ other ACC: 71.64
|
|
| 60 |
|
| 61 |
social ACC: 75.37
|
| 62 |
|
| 63 |
-
**AVERAGE ACC:67.36**
|
| 64 |
|
| 65 |
|
| 66 |
## CEval (Val):
|
|
@@ -74,8 +74,8 @@ Other ACC: 70.23
|
|
| 74 |
|
| 75 |
Hard ACC:54.71
|
| 76 |
|
| 77 |
-
**AVERAGE ACC:73.10**
|
| 78 |
|
| 79 |
## GSM8K
|
| 80 |
|
| 81 |
-
**Zero-shot ACC 0.7012888551933283**
|
|
|
|
| 60 |
|
| 61 |
social ACC: 75.37
|
| 62 |
|
| 63 |
+
**AVERAGE ACC:67.36** (Outperforms ALL models under 70B, very close to those best 70B fine-tunes)
|
| 64 |
|
| 65 |
|
| 66 |
## CEval (Val):
|
|
|
|
| 74 |
|
| 75 |
Hard ACC:54.71
|
| 76 |
|
| 77 |
+
**AVERAGE ACC:73.10** (Outperforms Qwen-14B, and GPT-4)
|
| 78 |
|
| 79 |
## GSM8K
|
| 80 |
|
| 81 |
+
**Zero-shot ACC 0.7012888551933283** (Outperforms MetaMath-13B, Qwen-14B)
|