RedHatAI
/

granite-3.1-8b-instruct-quantized.w4a16

Text Generation

compressed-tensors

Model card Files Files and versions

nm-research commited on Jan 16, 2025

Commit

d9ba0ad

·

verified ·

1 Parent(s): 99bee74

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -201,6 +201,19 @@ evalplus.evaluate \
 | **Average Score**                       | **70.30**                         | **69.81**                                   |
 | **Recovery**                            | **100.00**                        | **99.31**                                   |
 #### HumanEval pass@1 scores
 | Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w4a16 |
 |-----------------------------------------|:---------------------------------:|:-------------------------------------------:|

 | **Average Score**                       | **70.30**                         | **69.81**                                   |
 | **Recovery**                            | **100.00**                        | **99.31**                                   |
+#### OpenLLM Leaderboard V2 evaluation scores
+| Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w4a16 |
+|-----------------------------------------|:---------------------------------:|:-------------------------------------------:|
+| IFEval (Inst Level Strict Acc, 0-shot)|                          | 73.14                                         |
+| BBH (Acc-Norm, 3-shot)            | 53.19                             | 51.52                                        |
+| Math-Hard (Exact-Match, 4-shot)   | 14.77                            | 16.66                                       |
+| GPQA (Acc-Norm, 0-shot)           | 31.76                             | 29.91                                        |
+| MUSR (Acc-Norm, 0-shot)           | 46.01                             | 45.75                                        |
+| MMLU-Pro (Acc, 5-shot)            | 74.01                             | 0.3423                                        |
+| **Average Score**                 | **42.61**                         | **41.87**                                    |
+| **Recovery**                      | **100.00**                         | **98.26**                                    |
 #### HumanEval pass@1 scores
 | Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w4a16 |
 |-----------------------------------------|:---------------------------------:|:-------------------------------------------:|