Update README.md
Browse files
README.md
CHANGED
|
@@ -113,7 +113,20 @@ This is my first English & Chinese MoE Model based on
|
|
| 113 |
* [SUSTech/SUS-Chat-34B]
|
| 114 |
|
| 115 |
|
| 116 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 117 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cloudyu__Mixtral_34Bx2_MoE_60B)
|
| 118 |
|
| 119 |
| Metric |Value|
|
|
|
|
| 113 |
* [SUSTech/SUS-Chat-34B]
|
| 114 |
|
| 115 |
|
| 116 |
+
# [New Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
| 117 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cloudyu__Mixtral_34Bx2_MoE_60B)
|
| 118 |
+
|
| 119 |
+
| Metric |Value|
|
| 120 |
+
|-------------------|----:|
|
| 121 |
+
|Avg. |27.42|
|
| 122 |
+
|IFEval (0-Shot) |45.38|
|
| 123 |
+
|BBH (3-Shot) |41.21|
|
| 124 |
+
|MATH Lvl 5 (4-Shot)| 6.57|
|
| 125 |
+
|GPQA (0-shot) |11.74|
|
| 126 |
+
|MuSR (0-shot) |17.78|
|
| 127 |
+
|MMLU-PRO (5-shot) |41.85|
|
| 128 |
+
|
| 129 |
+
# [Old New Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
| 130 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cloudyu__Mixtral_34Bx2_MoE_60B)
|
| 131 |
|
| 132 |
| Metric |Value|
|