mlabonne
/

NeuralDaredevil-7B

Text Generation

mlabonne/example

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

mlabonne commited on Mar 21, 2024

Commit

dfbbdcc

·

verified ·

1 Parent(s): 646d7e2

Update README.md

Files changed (1) hide show

README.md +16 -13

README.md CHANGED Viewed

@@ -123,6 +123,8 @@ Thanks [Argilla](https://huggingface.co/argilla) for providing the dataset and t
 ## 🏆 Evaluation
 The evaluation was performed using [LLM AutoEval](https://github.com/mlabonne/llm-autoeval) on Nous suite.
 | Model | Average | AGIEval | GPT4All | TruthfulQA | Bigbench |
@@ -136,6 +138,20 @@ The evaluation was performed using [LLM AutoEval](https://github.com/mlabonne/ll
 You can find the complete benchmark on [YALL - Yet Another LLM Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard).
 ## 💻 Usage
 ```python
@@ -166,16 +182,3 @@ print(outputs[0]["generated_text"])
     <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/>
   </a>
 </p>
-# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_mlabonne__NeuralDaredevil-7B)
-|             Metric              |Value|
-|---------------------------------|----:|
-|Avg.                             |74.12|
-|AI2 Reasoning Challenge (25-Shot)|69.88|
-|HellaSwag (10-Shot)              |87.62|
-|MMLU (5-Shot)                    |65.12|
-|TruthfulQA (0-shot)              |66.85|
-|Winogrande (5-shot)              |82.08|
-|GSM8k (5-shot)                   |73.16|

 ## 🏆 Evaluation
+### Nous
 The evaluation was performed using [LLM AutoEval](https://github.com/mlabonne/llm-autoeval) on Nous suite.
 | Model | Average | AGIEval | GPT4All | TruthfulQA | Bigbench |
 You can find the complete benchmark on [YALL - Yet Another LLM Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard).
+# [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_mlabonne__NeuralDaredevil-7B)
+|             Metric              |Value|
+|---------------------------------|----:|
+|Avg.                             |74.12|
+|AI2 Reasoning Challenge (25-Shot)|69.88|
+|HellaSwag (10-Shot)              |87.62|
+|MMLU (5-Shot)                    |65.12|
+|TruthfulQA (0-shot)              |66.85|
+|Winogrande (5-shot)              |82.08|
+|GSM8k (5-shot)                   |73.16|
 ## 💻 Usage
 ```python
     <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/>
   </a>
 </p>