SeaWolf-AI commited on
Commit
a54f121
·
verified ·
1 Parent(s): f2b1050

fix: update GPQA score to 88.1, add model-index

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -15,13 +15,27 @@ tags:
15
  - mixture-of-experts
16
  - cohere2_moe
17
  - 218b
18
- - gpqa-90
19
  base_model:
20
  - FINAL-Bench/Darwin-218B-kr
21
  - CohereLabs/command-a-plus-05-2026-bf16
22
  base_model_relation: merge
23
  datasets:
24
  - FINAL-Bench/darwin-chem-data-v1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  ---
26
 
27
  # Darwin-218B-Delphi
 
15
  - mixture-of-experts
16
  - cohere2_moe
17
  - 218b
18
+ - gpqa-88
19
  base_model:
20
  - FINAL-Bench/Darwin-218B-kr
21
  - CohereLabs/command-a-plus-05-2026-bf16
22
  base_model_relation: merge
23
  datasets:
24
  - FINAL-Bench/darwin-chem-data-v1
25
+ model-index:
26
+ - name: Darwin-218B-Delphi
27
+ results:
28
+ - task:
29
+ type: question-answering
30
+ name: Question Answering
31
+ dataset:
32
+ name: GPQA Diamond
33
+ type: Idavidrein/gpqa
34
+ config: gpqa_diamond
35
+ metrics:
36
+ - type: accuracy
37
+ value: 88.1
38
+ name: Accuracy
39
  ---
40
 
41
  # Darwin-218B-Delphi