AxionLab-official commited on
Commit
bf44e48
·
verified ·
1 Parent(s): 17065df

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -2
README.md CHANGED
@@ -34,9 +34,36 @@ For the SFT (supervised finetuning) we used the full Alpaca-Cleaned dataset for
34
 
35
  ---
36
 
37
- ## Benchmarks of the Instruct Model
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
38
 
39
- ![image](https://cdn-uploads.huggingface.co/production/uploads/697f2832c2c5e4daa93cece7/zUZV38kCaAigP4bvV6hl8.png)
40
 
41
  ---
42
 
 
34
 
35
  ---
36
 
37
+ ## Benchmarks
38
+
39
+
40
+ | Task | Metric | Value |
41
+ | :--- | :--- | :---: |
42
+ | arc_easy | acc,none | 0.4659 |
43
+ | arc_easy | acc_stderr,none | 0.0102 |
44
+ | arc_easy | acc_norm,none | 0.4423 |
45
+ | arc_easy | acc_norm_stderr,none | 0.0102 |
46
+ | arc_challenge | acc,none | 0.2287 |
47
+ | arc_challenge | acc_stderr,none | 0.0123 |
48
+ | arc_challenge | acc_norm,none | 0.2756 |
49
+ | arc_challenge | acc_norm_stderr,none | 0.0131 |
50
+ | hellaswag | acc,none | 0.2794 |
51
+ | hellaswag | acc_stderr,none | 0.0045 |
52
+ | hellaswag | acc_norm,none | 0.2922 |
53
+ | hellaswag | acc_norm_stderr,none | 0.0045 |
54
+ | winogrande | acc,none | 0.5154 |
55
+ | winogrande | acc_stderr,none | 0.0140 |
56
+ | piqa | acc,none | 0.5558 |
57
+ | piqa | acc_stderr,none | 0.0114 |
58
+ | piqa | acc_norm,none | 0.5952 |
59
+ | piqa | acc_norm_stderr,none | 0.0115 |
60
+ | openbookqa | acc,none | 0.1580 |
61
+ | openbookqa | acc_stderr,none | 0.0163 |
62
+ | openbookqa | acc_norm,none | 0.2860 |
63
+ | openbookqa | acc_norm_stderr,none | 0.0202 |
64
+ | boolq | acc,none | 0.4205 |
65
+ | boolq | acc_stderr,none | 0.0086 |
66
 
 
67
 
68
  ---
69