Update README.md
Browse files
README.md
CHANGED
|
@@ -82,11 +82,11 @@ The results are pretty good! The model has issues, but could have legitimate use
|
|
| 82 |
|
| 83 |
Truthfulness and commonsense reasoning suffered the least from the prune / were healed the best. Knowledge and complex reasoning suffered the most.
|
| 84 |
This model has 67% the parameters of the original, and has:
|
| 85 |
-
~100% the TruthfulQA score of the original
|
| 86 |
-
~60% the ARC Challenge score
|
| 87 |
-
~65% the Hellaswag score
|
| 88 |
-
~85% the Winogrande score
|
| 89 |
-
~45% the the MMLU score
|
| 90 |
|
| 91 |
### Benchmarks
|
| 92 |
{Benchmark images on their way...}
|
|
|
|
| 82 |
|
| 83 |
Truthfulness and commonsense reasoning suffered the least from the prune / were healed the best. Knowledge and complex reasoning suffered the most.
|
| 84 |
This model has 67% the parameters of the original, and has:
|
| 85 |
+
- ~100% the TruthfulQA score of the original
|
| 86 |
+
- ~60% the ARC Challenge score
|
| 87 |
+
- ~65% the Hellaswag score
|
| 88 |
+
- ~85% the Winogrande score
|
| 89 |
+
- ~45% the the MMLU score
|
| 90 |
|
| 91 |
### Benchmarks
|
| 92 |
{Benchmark images on their way...}
|