PyTorch
English
llama
Eval Results

Add EvalEval community eval results

#18
Files changed (1) hide show
  1. .eval_results/gpqa-diamond.yaml +9 -0
.eval_results/gpqa-diamond.yaml ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: Idavidrein/gpqa
3
+ task_id: diamond
4
+ date: '2026-04-24'
5
+ notes: GPQA Diamond
6
+ source:
7
+ name: EvalEval
8
+ url: https://huggingface.co/datasets/evaleval/EEE_datastore/blob/b11a260fe158662bb63b4a144be2b5690615414d/flat/objects/a1/ce/a1ceb877-0159-470e-8e0d-3a31c6d8d7a5.json
9
+ value: 65.6565656566