Update README.md
Browse files
README.md
CHANGED
|
@@ -55,17 +55,16 @@ Seed-Coder-8B-Instruct demonstrates strong performance across a variety of codin
|
|
| 55 |
- Robustness across different programming languages and domains.
|
| 56 |
- Ability to understand, reason, and repair complex code snippets.
|
| 57 |
|
| 58 |
-
| Model |
|
| 59 |
-
|
| 60 |
-
| CodeLlama-7B-Instruct |
|
| 61 |
-
| DeepSeek-Coder-6.7B-Instruct |
|
| 62 |
-
| CodeQwen1.5-7B-Chat |
|
| 63 |
-
| Yi-Coder-9B-Chat |
|
| 64 |
-
| Llama-3.1-8B-Instruct |
|
| 65 |
-
| OpenCoder-8B-Instruct |
|
| 66 |
-
| Qwen2.5-Coder-7B-Instruct |
|
| 67 |
-
| Seed-Coder-8B-Instruct (0411) |
|
| 68 |
-
|
| 69 |
|
| 70 |
For detailed results, please check our [📑 paper](https://arxiv.org/pdf/xxx.xxxxx).
|
| 71 |
|
|
|
|
| 55 |
- Robustness across different programming languages and domains.
|
| 56 |
- Ability to understand, reason, and repair complex code snippets.
|
| 57 |
|
| 58 |
+
| Model | HumanEval | MBPP | MHPP | BigCodeBench (Full) | BigCodeBench (Hard) | LiveCodeBench (2410-2502) |
|
| 59 |
+
|:-----------------------------:|:---------:|:----:|:----:|:-------------------:|:-------------------:|:-------------------------:|
|
| 60 |
+
| CodeLlama-7B-Instruct | 40.9 | 54.0 | 6.7 | 21.9 | 3.4 | 3.6 |
|
| 61 |
+
| DeepSeek-Coder-6.7B-Instruct | 74.4 | 74.9 | 20.0 | 35.5 | 10.1 | 9.6 |
|
| 62 |
+
| CodeQwen1.5-7B-Chat | 83.5 | 77.7 | 17.6 | 39.6 | 18.9 | 3.0 |
|
| 63 |
+
| Yi-Coder-9B-Chat | 82.3 | 82.0 | 26.7 | 38.1 | 11.5 | 17.5 |
|
| 64 |
+
| Llama-3.1-8B-Instruct | 68.3 | 70.1 | 17.1 | 36.6 | 13.5 | 11.5 |
|
| 65 |
+
| OpenCoder-8B-Instruct | 83.5 | 79.1 | 30.5 | 40.3 | 16.9 | 17.1 |
|
| 66 |
+
| Qwen2.5-Coder-7B-Instruct | 88.4 | 82.0 | 26.7 | 41.0 | 18.2 | 17.3 |
|
| 67 |
+
| Seed-Coder-8B-Instruct (0411) | 84.8 | 85.2 | 36.2 | 53.3 | 20.5 | 24.7 |
|
|
|
|
| 68 |
|
| 69 |
For detailed results, please check our [📑 paper](https://arxiv.org/pdf/xxx.xxxxx).
|
| 70 |
|