abacusai
/

Smaug-Llama-3-70B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

ArkaAbacus commited on Jun 4, 2024

Commit

8f558d6

·

verified ·

1 Parent(s): 5af6f9d

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -133,10 +133,10 @@ Meta-Llama-3-70B-Instruct           9.006250
 ### OpenLLM Leaderboard Manual Evaluation
-| Model | ARC  | Hellaswag | MMLU | TruthfulQA | Winogrande | GSM8K* |
-| :---- | ---: | ------:   | ---: | ---:       | ---:       | ---:   |
-| Smaug-Llama-3-70B-Instruct | 70.6 | 86.1 | 79.2 | 62.5 | 83.5 | 90.5 |
-| Llama-3-70B-Instruct | 71.4 | 85.7 | 80.0 | 61.8 | 82.9 | 91.1 |
 **GSM8K** The GSM8K numbers quoted here are computed using a recent release
 of the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness/).

 ### OpenLLM Leaderboard Manual Evaluation
+| Model | ARC  | Hellaswag | MMLU | TruthfulQA | Winogrande | GSM8K* | Average |
+| :---- | ---: | ------:   | ---: | ---:       | ---:       | ---:   | ---:   |
+| Smaug-Llama-3-70B-Instruct | 70.6 | 86.1 | 79.2 | 62.5 | 83.5 | 90.5 | 78.7 |
+| Llama-3-70B-Instruct | 71.4 | 85.7 | 80.0 | 61.8 | 82.9 | 91.1 | 78.8 |
 **GSM8K** The GSM8K numbers quoted here are computed using a recent release
 of the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness/).