Text Generation
Safetensors
GGUF
English
qwen3_5_text
claude
conversational
instruction-tuned
multilingual
reasoning
open-source
Eval Results
squ11z1 commited on
Commit
01f2257
·
verified ·
1 Parent(s): d61e3b9

Add gpqa_diamond eval result

Browse files
Files changed (1) hide show
  1. .eval_results/gpqa_diamond.yaml +7 -0
.eval_results/gpqa_diamond.yaml ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: Idavidrein/gpqa
3
+ task_id: gpqa_diamond
4
+ value: 83.4
5
+ source:
6
+ url: https://huggingface.co/squ11z1/claude-oss
7
+ name: Model Card