Add Artificial Analysis evaluations for ministral-8b

#6
Files changed (1) hide show
  1. README.md +53 -0
README.md CHANGED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ model-index:
3
+ - name: MackenzieTest
4
+ results:
5
+ - task:
6
+ type: evaluation
7
+ dataset:
8
+ name: Artificial Analysis Benchmarks
9
+ type: artificial_analysis
10
+ metrics:
11
+ - name: Artificial Analysis Intelligence Index
12
+ type: artificial_analysis_intelligence_index
13
+ value: 28.2
14
+ - name: Artificial Analysis Coding Index
15
+ type: artificial_analysis_coding_index
16
+ value: 18.4
17
+ - name: Artificial Analysis Math Index
18
+ type: artificial_analysis_math_index
19
+ value: 31.7
20
+ - name: Mmlu Pro
21
+ type: mmlu_pro
22
+ value: 0.642
23
+ - name: Gpqa
24
+ type: gpqa
25
+ value: 0.471
26
+ - name: Hle
27
+ type: hle
28
+ value: 0.043
29
+ - name: Livecodebench
30
+ type: livecodebench
31
+ value: 0.303
32
+ - name: Scicode
33
+ type: scicode
34
+ value: 0.208
35
+ - name: Aime 25
36
+ type: aime_25
37
+ value: 0.317
38
+ - name: Ifbench
39
+ type: ifbench
40
+ value: 0.291
41
+ - name: Lcr
42
+ type: lcr
43
+ value: 0.24
44
+ - name: Terminalbench Hard
45
+ type: terminalbench_hard
46
+ value: 0.043
47
+ - name: Tau2
48
+ type: tau2
49
+ value: 0.266
50
+ source:
51
+ name: Artificial Analysis API
52
+ url: https://artificialanalysis.ai
53
+ ---