burtenshaw HF Staff commited on
Commit
2bc02b7
·
verified ·
1 Parent(s): 29b5ff8

Fix task_id to diamond (matching benchmark eval.yaml)

Browse files
Files changed (1) hide show
  1. .eval_results/gpqa.yaml +2 -1
.eval_results/gpqa.yaml CHANGED
@@ -1,8 +1,9 @@
1
  - dataset:
2
  id: Idavidrein/gpqa
3
- task_id: gpqa_diamond
4
  value: 38.89
5
  date: '2026-01-27'
6
  source:
7
  url: https://huggingface.co/LiquidAI/LFM2.5-1.2B-Instruct
8
  name: Model Card
 
 
1
  - dataset:
2
  id: Idavidrein/gpqa
3
+ task_id: diamond
4
  value: 38.89
5
  date: '2026-01-27'
6
  source:
7
  url: https://huggingface.co/LiquidAI/LFM2.5-1.2B-Instruct
8
  name: Model Card
9
+ user: burtenshaw