nielsr HF Staff commited on
Commit
336967b
·
verified ·
1 Parent(s): 9184a60

Update .eval_results/swe_bench_verified.yaml

Browse files
.eval_results/swe_bench_verified.yaml CHANGED
@@ -1,8 +1,9 @@
1
  - dataset:
2
  id: SWE-bench/SWE-bench_Verified
3
  task_id: swe_bench_%_resolved
4
- value: 77.8
5
  source:
6
- url: https://huggingface.co/zai-org/GLM-5
7
- name: Model card
8
- user: nielsr
 
 
1
  - dataset:
2
  id: SWE-bench/SWE-bench_Verified
3
  task_id: swe_bench_%_resolved
4
+ value: 72.80
5
  source:
6
+ url: https://www.swebench.com/
7
+ name: SWE-Bench official evaluation
8
+ user: nielsr
9
+ notes: high reasoning