Adding Evaluation Results
#2
by
AbrorBalxiyev - opened
README.md
CHANGED
|
@@ -1,5 +1,7 @@
|
|
| 1 |
---
|
| 2 |
-
|
|
|
|
|
|
|
| 3 |
tags:
|
| 4 |
- llama-2
|
| 5 |
- instruct
|
|
@@ -8,14 +10,12 @@ tags:
|
|
| 8 |
- gpt4
|
| 9 |
- synthetic data
|
| 10 |
- distillation
|
|
|
|
| 11 |
datasets:
|
| 12 |
- teknium/openhermes
|
| 13 |
model-index:
|
| 14 |
- name: openhermes-13b
|
| 15 |
results: []
|
| 16 |
-
license: mit
|
| 17 |
-
language:
|
| 18 |
-
- en
|
| 19 |
---
|
| 20 |
|
| 21 |
# OpenHermes-13B
|
|
@@ -155,4 +155,17 @@ The following hyperparameters were used during training:
|
|
| 155 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 156 |
- lr_scheduler_type: cosine
|
| 157 |
- lr_scheduler_warmup_steps: 300
|
| 158 |
-
- num_epochs: 3
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
license: mit
|
| 5 |
tags:
|
| 6 |
- llama-2
|
| 7 |
- instruct
|
|
|
|
| 10 |
- gpt4
|
| 11 |
- synthetic data
|
| 12 |
- distillation
|
| 13 |
+
base_model: NousResearch/Llama-2-13b-hf
|
| 14 |
datasets:
|
| 15 |
- teknium/openhermes
|
| 16 |
model-index:
|
| 17 |
- name: openhermes-13b
|
| 18 |
results: []
|
|
|
|
|
|
|
|
|
|
| 19 |
---
|
| 20 |
|
| 21 |
# OpenHermes-13B
|
|
|
|
| 155 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 156 |
- lr_scheduler_type: cosine
|
| 157 |
- lr_scheduler_warmup_steps: 300
|
| 158 |
+
- num_epochs: 3
|
| 159 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
| 160 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/teknium__OpenHermes-13B-details)
|
| 161 |
+
|
| 162 |
+
| Metric |Value|
|
| 163 |
+
|-------------------|----:|
|
| 164 |
+
|Avg. |12.18|
|
| 165 |
+
|IFEval (0-Shot) |26.68|
|
| 166 |
+
|BBH (3-Shot) |18.21|
|
| 167 |
+
|MATH Lvl 5 (4-Shot)| 1.21|
|
| 168 |
+
|GPQA (0-shot) | 3.02|
|
| 169 |
+
|MuSR (0-shot) | 8.53|
|
| 170 |
+
|MMLU-PRO (5-shot) |15.44|
|
| 171 |
+
|