huihui-ai
/

Qwen2.5-7B-Instruct-abliterated-v2

Text Generation

text-generation-inference

Model card Files Files and versions

huihui-ai commited on Apr 28

Commit

fa8f8c2

·

verified ·

1 Parent(s): 8a2de0e

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -101,12 +101,12 @@ while True:
 The following data has been re-evaluated and calculated as the average for each test.
-| Model                              |  IF_Eval  | BBH       | GPQA      | MMLU Pro  | TruthfulQA |
-|--------------------------------------|-----------------------|-----------|-----------|------------|
-| Qwen2.5-0.5B-Instruct                | **33.07** | **33.26** | 26.11     | **17.18** | 45.07      |
-| Qwen2.5-0.5B-Instruct-CensorTune     | 16.20     | 32.51     | 25.25     | 17.09     | **45.48**  |
-| Qwen2.5-0.5B-Instruct-abliterated-v3 | 33.02     | 32.58     | **26.45** | 16.42     | 39.24      |
-| Qwen2.5-0.5B-Instruct-abliterated-v2 | 32.15     | 32.51     | 26.43     | 16.29     | 39.56      |
-| Qwen2.5-0.5B-Instruct-abliterated-v1 | 32.96     | 32.83     | 26.23     | 16.42     | 45.40      |
 The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)

 The following data has been re-evaluated and calculated as the average for each test.
+| Benchmark   | Qwen2.5-7B-Instruct | Qwen2.5-7B-Instruct-abliterated-v2 | Qwen2.5-7B-Instruct-abliterated |
+|-------------|---------------------|------------------------------------|---------------------------------|
+| IF_Eval     | 76.44               | **77.82**                          | 76.49                           |
+| MMLU Pro    | **43.12**           | 42.03                              | 41.71                           |
+| TruthfulQA  | 62.46               | 57.81                              | **64.92**                       |
+| BBH         | **53.92**           | 53.01                              | 52.77                           |
+| GPQA        | 31.91               | **32.17**                          | 31.97                           |
 The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)