Update results
Browse files- results/Claude_Sonnet_4.tsv +0 -0
- results/GPT-5.tsv +0 -0
- results/Gemini_2.5_Pro.tsv +0 -0
- results/Gemma_3_27b.tsv +0 -0
- results/Human_Expert.tsv +0 -0
- results/Mistral_Medium_3.1.tsv +0 -0
- results/Qwen_2.5_VL_72b.tsv +0 -0
- results/Qwen_3_VL_235b_Thinking.tsv +0 -0
- results/Straight_Forward.tsv +0 -0
- results/o3.tsv +0 -0
results/Claude_Sonnet_4.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/GPT-5.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/Gemini_2.5_Pro.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/Gemma_3_27b.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/Human_Expert.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/Mistral_Medium_3.1.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/Qwen_2.5_VL_72b.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/Qwen_3_VL_235b_Thinking.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/Straight_Forward.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|
results/o3.tsv
CHANGED
|
The diff for this file is too large to render.
See raw diff
|
|
|