Text Generation
Safetensors
GGUF
English
qwen3_5_text
claude
conversational
instruction-tuned
multilingual
reasoning
open-source
Eval Results

Update .eval_results/gpqa_diamond.yaml

#1
Files changed (1) hide show
  1. .eval_results/gpqa_diamond.yaml +1 -1
.eval_results/gpqa_diamond.yaml CHANGED
@@ -1,6 +1,6 @@
1
  - dataset:
2
  id: Idavidrein/gpqa
3
- task_id: gpqa_diamond
4
  value: 83.4
5
  source:
6
  url: https://huggingface.co/squ11z1/claude-oss
 
1
  - dataset:
2
  id: Idavidrein/gpqa
3
+ task_id: diamond
4
  value: 83.4
5
  source:
6
  url: https://huggingface.co/squ11z1/claude-oss