Update README.md
Browse files
README.md
CHANGED
|
@@ -68,9 +68,7 @@ Same process applies. Usually, it is best to do a sliding window over the user a
|
|
| 68 |
|
| 69 |
## Evaluation Metrics
|
| 70 |
The model was evaluated using EleutherAI's [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) test suite. It was evaluated on the following tasks:
|
| 71 |
-
|
| 72 |
-
anli_r1,anli_r2,anli_r3,arc_challenge,arc_easy,boolq,cb,hellaswag,openbookqa,piqa,rte,truthfulqa_mc,wic,winogrande,wsc
|
| 73 |
-
```
|
| 74 |
```
|
| 75 |
| Task |Version| Metric |Value | |Stderr|
|
| 76 |
|-------------|------:|--------|-----:|---|-----:|
|
|
|
|
| 68 |
|
| 69 |
## Evaluation Metrics
|
| 70 |
The model was evaluated using EleutherAI's [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) test suite. It was evaluated on the following tasks:
|
| 71 |
+
|
|
|
|
|
|
|
| 72 |
```
|
| 73 |
| Task |Version| Metric |Value | |Stderr|
|
| 74 |
|-------------|------:|--------|-----:|---|-----:|
|