Update README.md
Browse files
README.md
CHANGED
|
@@ -35,4 +35,14 @@ Benchmark Scores
|
|
| 35 |
|
| 36 |
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|
| 37 |
|----------|------:|------|-----:|------|-----:|---|-----:|
|
| 38 |
-
|winogrande| 1|none | 0|acc |0.7774|± |0.0117|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 35 |
|
| 36 |
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|
| 37 |
|----------|------:|------|-----:|------|-----:|---|-----:|
|
| 38 |
+
|winogrande| 1|none | 0|acc |0.7774|± |0.0117|
|
| 39 |
+
|
| 40 |
+
|Tasks|Version| Filter |n-shot| Metric |Value | |Stderr|
|
| 41 |
+
|-----|------:|----------|-----:|-----------|-----:|---|-----:|
|
| 42 |
+
|gsm8k| 2|get-answer| 5|exact_match|0.6732|± |0.0129|
|
| 43 |
+
|
| 44 |
+
| Tasks |Version|Filter|n-shot|Metric|Value | |Stderr|
|
| 45 |
+
|--------------|------:|------|-----:|------|-----:|---|-----:|
|
| 46 |
+
|truthfulqa_mc2| 2|none | 0|acc |0.4795|± |0.0148|
|
| 47 |
+
|
| 48 |
+
Average 65.658
|