burtenshaw HF Staff commited on
Commit
7524de5
·
verified ·
1 Parent(s): 0ae2674

Fix task_id to match benchmark eval.yaml

Browse files
Files changed (1) hide show
  1. .eval_results/gpqa.yaml +2 -1
.eval_results/gpqa.yaml CHANGED
@@ -1,8 +1,9 @@
1
  - dataset:
2
  id: Idavidrein/gpqa
3
- task_id: gpqa_diamond
4
  value: 75.2
5
  date: '2026-01-27'
6
  source:
7
  url: https://huggingface.co/zai-org/GLM-4.7-Flash
8
  name: Model Card
 
 
1
  - dataset:
2
  id: Idavidrein/gpqa
3
+ task_id: diamond
4
  value: 75.2
5
  date: '2026-01-27'
6
  source:
7
  url: https://huggingface.co/zai-org/GLM-4.7-Flash
8
  name: Model Card
9
+ user: burtenshaw