Add Artificial Analysis evaluations for ministral-14b

#5
Files changed (1) hide show
  1. README.md +53 -0
README.md CHANGED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ model-index:
3
+ - name: MackenzieTest
4
+ results:
5
+ - task:
6
+ type: evaluation
7
+ dataset:
8
+ name: Artificial Analysis Benchmarks
9
+ type: artificial_analysis
10
+ metrics:
11
+ - name: Artificial Analysis Intelligence Index
12
+ type: artificial_analysis_intelligence_index
13
+ value: 30.5
14
+ - name: Artificial Analysis Coding Index
15
+ type: artificial_analysis_coding_index
16
+ value: 21
17
+ - name: Artificial Analysis Math Index
18
+ type: artificial_analysis_math_index
19
+ value: 30
20
+ - name: Mmlu Pro
21
+ type: mmlu_pro
22
+ value: 0.693
23
+ - name: Gpqa
24
+ type: gpqa
25
+ value: 0.572
26
+ - name: Hle
27
+ type: hle
28
+ value: 0.046
29
+ - name: Livecodebench
30
+ type: livecodebench
31
+ value: 0.351
32
+ - name: Scicode
33
+ type: scicode
34
+ value: 0.236
35
+ - name: Aime 25
36
+ type: aime_25
37
+ value: 0.3
38
+ - name: Ifbench
39
+ type: ifbench
40
+ value: 0.32
41
+ - name: Lcr
42
+ type: lcr
43
+ value: 0.22
44
+ - name: Terminalbench Hard
45
+ type: terminalbench_hard
46
+ value: 0.043
47
+ - name: Tau2
48
+ type: tau2
49
+ value: 0.272
50
+ source:
51
+ name: Artificial Analysis API
52
+ url: https://artificialanalysis.ai
53
+ ---