SeaWolf-AI commited on
Commit
651807e
·
verified ·
1 Parent(s): a54f121

feat: add .eval_results/gpqa_diamond.yaml (88.1%) for GPQA leaderboard

Browse files
Files changed (1) hide show
  1. .eval_results/gpqa_diamond.yaml +9 -0
.eval_results/gpqa_diamond.yaml ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: Idavidrein/gpqa
3
+ task_id: diamond
4
+ value: 88.1
5
+ date: '2026-04-27'
6
+ source:
7
+ url: https://huggingface.co/FINAL-Bench/Darwin-218B-Delphi
8
+ name: Model Card
9
+ notes: "Darwin-DELPHI 218B MoE, Pass@1"