UCLA-AGI
/

Llama-3-Instruct-8B-SPPO-Iter3

Text Generation

text-generation-inference

Model card Files Files and versions

angelahzyuan commited on Jun 28, 2024

Commit

e75b851

·

verified ·

1 Parent(s): 3829d99

Update README.md

Files changed (1) hide show

README.md +15 -12

README.md CHANGED Viewed

@@ -142,6 +142,21 @@ Results are reported by using [lm-evaluation-harness](https://github.com/Eleuthe
 |[Llama-3-8B-SPPO Iter2](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2) | 64.93 | 56.48 | 76.87 | 75.13 | 80.39 | 65.67 | 69.91
 |[Llama-3-8B-SPPO Iter3](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3) | 65.19 | 58.04 | 77.11 | 74.91 | 80.86 | 65.60 | **70.29**
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -171,16 +186,4 @@ The following hyperparameters were used during training:
       primaryClass={cs.LG}
 }
 ```
-# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
-Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_UCLA-AGI__Llama-3-Instruct-8B-SPPO-Iter3)
-|      Metric       |Value|
-|-------------------|----:|
-|Avg.               |23.68|
-|IFEval (0-Shot)    |68.28|
-|BBH (3-Shot)       |29.74|
-|MATH Lvl 5 (4-Shot)| 7.33|
-|GPQA (0-shot)      | 2.01|
-|MuSR (0-shot)      | 3.09|
-|MMLU-PRO (5-shot)  |29.38|

 |[Llama-3-8B-SPPO Iter2](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2) | 64.93 | 56.48 | 76.87 | 75.13 | 80.39 | 65.67 | 69.91
 |[Llama-3-8B-SPPO Iter3](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3) | 65.19 | 58.04 | 77.11 | 74.91 | 80.86 | 65.60 | **70.29**
+# [Open LLM Leaderboard 2 Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_UCLA-AGI__Llama-3-Instruct-8B-SPPO-Iter3)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |23.68|
+|IFEval (0-Shot)    |68.28|
+|BBH (3-Shot)       |29.74|
+|MATH Lvl 5 (4-Shot)| 7.33|
+|GPQA (0-shot)      | 2.01|
+|MuSR (0-shot)      | 3.09|
+|MMLU-PRO (5-shot)  |29.38|
 ### Training hyperparameters
 The following hyperparameters were used during training:
       primaryClass={cs.LG}
 }
 ```