clean up the evals

Files changed (1) hide show

README.md CHANGED Viewed

@@ -147,6 +147,20 @@ GGUF (2/3/4/5/6/8 bits): [MaziyarPanahi/phi-2-logical-sft-GGUF](https://huggingf
 ### Response:
 ```
 ## Examples
 ```
@@ -222,19 +236,6 @@ Now, let's eliminate the first possibility, because it contradicts the premise t
 ---
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
@@ -359,17 +360,6 @@ special_tokens:
   pad_token: "<|endoftext|>"
 ```
-</details><br>
-# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__phi-2-logical-sft)
-|             Metric              |Value|
-|---------------------------------|----:|
-|Avg.                             |61.50|
-|AI2 Reasoning Challenge (25-Shot)|61.35|
-|HellaSwag (10-Shot)              |75.14|
-|MMLU (5-Shot)                    |57.40|
-|TruthfulQA (0-shot)              |44.39|
-|Winogrande (5-shot)              |74.90|
-|GSM8k (5-shot)                   |55.80|

 ### Response:
 ```
+## [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__phi-2-logical-sft)
+|             Metric              |Value|
+|---------------------------------|----:|
+|Avg.                             |61.50|
+|AI2 Reasoning Challenge (25-Shot)|61.35|
+|HellaSwag (10-Shot)              |75.14|
+|MMLU (5-Shot)                    |57.40|
+|TruthfulQA (0-shot)              |44.39|
+|Winogrande (5-shot)              |74.90|
+|GSM8k (5-shot)                   |55.80|
 ## Examples
 ```
 ---
 ## Training procedure
 ### Training hyperparameters
   pad_token: "<|endoftext|>"
 ```
+</details>