RedHatAI
/

granite-3.1-8b-instruct-FP8-dynamic

Text Generation

compressed-tensors

Model card Files Files and versions

nm-research commited on Jan 9, 2025

Commit

700d80c

·

verified ·

1 Parent(s): b9ebd8e

Update README.md

Files changed (1) hide show

README.md +12 -9

README.md CHANGED Viewed

@@ -150,18 +150,21 @@ evalplus.evaluate \
 #### OpenLLM Leaderboard V1 evaluation scores
-| Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-FP8-dynamic |
 |-----------------------------------------|:---------------------------------:|:-------------------------------------------:|
-| ARC-Challenge (Acc-Norm, 25-shot)       |                             |                                       |
-| GSM8K (Strict-Match, 5-shot)            |                             |                                        |
-| HellaSwag (Acc-Norm, 10-shot)           |                             |                                        |
-| MMLU (Acc, 5-shot)                      |                             |                                        |
-| TruthfulQA (MC2, 0-shot)                |                             |                                        |
-| Winogrande (Acc, 5-shot)                |                             |                                        |
-| **Average Score**                       | ****                        | ****                                   |
-| **Recovery**                            | **100.00**                       | ****                                   |
 #### HumanEval pass@1 scores

 #### OpenLLM Leaderboard V1 evaluation scores
+| Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w4a16 |
 |-----------------------------------------|:---------------------------------:|:-------------------------------------------:|
+| ARC-Challenge (Acc-Norm, 25-shot)       | 66.81                             | 66.81                                      |
+| GSM8K (Strict-Match, 5-shot)            | 64.52                             | 66.64                                       |
+| HellaSwag (Acc-Norm, 10-shot)           | 84.18                             | 84.16                                       |
+| MMLU (Acc, 5-shot)                      | 65.52                             | 65.36                                        |
+| TruthfulQA (MC2, 0-shot)                | 60.57                             | 60.52                                        |
+| Winogrande (Acc, 5-shot)                | 80.19                             | 79.95                                        |
+| **Average Score**                       | **70.30**                         | **70.57**                                   |
+| **Recovery**                            | **100.00**                        | **100.39**                                   |
 #### HumanEval pass@1 scores
+| Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w4a16 |
+|-----------------------------------------|:---------------------------------:|:-------------------------------------------:|
+| HumanEval Pass@1                        | 71.00                             |  69.90                                     |