adaamko commited on
Commit
8f5689d
·
verified ·
1 Parent(s): 95a9c82

Add Kimi K2 baseline results

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -48,6 +48,7 @@ Evaluated on 617 held-out test samples from SWE-bench repositories, across 14 to
48
  |-------|-----------|--------|------|-------------|
49
  | **Squeez-2B** | **0.8043** | **0.8624** | **0.7895** | 0.9150 |
50
  | Qwen 3.5 35B A3B (zero-shot) | 0.7402 | 0.7498 | 0.7000 | 0.9177 |
 
51
  | Qwen 3.5 2B (untrained) | 0.4154 | 0.5299 | 0.4075 | 0.8197 |
52
  | BM25 (10%) | 0.1277 | 0.2172 | 0.1314 | 0.9036 |
53
  | First-N (10%) | 0.0741 | 0.1445 | 0.0798 | 0.9055 |
 
48
  |-------|-----------|--------|------|-------------|
49
  | **Squeez-2B** | **0.8043** | **0.8624** | **0.7895** | 0.9150 |
50
  | Qwen 3.5 35B A3B (zero-shot) | 0.7402 | 0.7498 | 0.7000 | 0.9177 |
51
+ | Kimi K2 (zero-shot) | 0.6128 | 0.5286 | 0.5344 | 0.9425 |
52
  | Qwen 3.5 2B (untrained) | 0.4154 | 0.5299 | 0.4075 | 0.8197 |
53
  | BM25 (10%) | 0.1277 | 0.2172 | 0.1314 | 0.9036 |
54
  | First-N (10%) | 0.0741 | 0.1445 | 0.0798 | 0.9055 |