Update README.md
Browse files
README.md
CHANGED
|
@@ -110,20 +110,3 @@ The following hyperparameters were used during training:
|
|
| 110 |
- Pytorch 2.1.2+cu121
|
| 111 |
- Datasets 2.18.0
|
| 112 |
- Tokenizers 0.15.2
|
| 113 |
-
|
| 114 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
| 115 |
-
|
| 116 |
-
Results for the English Open LLM Leaderboard. For results specific to Dutch, check out [ScandEval](https://scandeval.com/dutch-nlg/).
|
| 117 |
-
|
| 118 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_BramVanroy__fietje-2-instruct)
|
| 119 |
-
|
| 120 |
-
| Metric |Value|
|
| 121 |
-
|-------------------|----:|
|
| 122 |
-
|Avg. |10.20|
|
| 123 |
-
|IFEval (0-Shot) |27.90|
|
| 124 |
-
|BBH (3-Shot) |17.57|
|
| 125 |
-
|MATH Lvl 5 (4-Shot)| 0.53|
|
| 126 |
-
|GPQA (0-shot) | 0.00|
|
| 127 |
-
|MuSR (0-shot) | 2.91|
|
| 128 |
-
|MMLU-PRO (5-shot) |12.26|
|
| 129 |
-
|
|
|
|
| 110 |
- Pytorch 2.1.2+cu121
|
| 111 |
- Datasets 2.18.0
|
| 112 |
- Tokenizers 0.15.2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|