Update README.md
Browse files
README.md
CHANGED
|
@@ -70,14 +70,16 @@ Seed-Coder-8B-Instruct demonstrates strong performance across a variety of codin
|
|
| 70 |
|
| 71 |
| Model | HumanEval | MBPP | MHPP | BigCodeBench (Full) | BigCodeBench (Hard) | LiveCodeBench (2410-2502) |
|
| 72 |
|:-----------------------------:|:---------:|:----:|:----:|:-------------------:|:-------------------:|:-------------------------:|
|
| 73 |
-
|
|
| 74 |
-
|
|
| 75 |
-
|
|
| 76 |
-
|
|
| 77 |
-
|
|
| 78 |
-
|
|
| 79 |
-
|
|
| 80 |
-
|
|
|
|
|
|
|
|
| 81 |
|
| 82 |
For detailed results, please check our [📑 paper](https://arxiv.org/pdf/xxx.xxxxx).
|
| 83 |
|
|
|
|
| 70 |
|
| 71 |
| Model | HumanEval | MBPP | MHPP | BigCodeBench (Full) | BigCodeBench (Hard) | LiveCodeBench (2410-2502) |
|
| 72 |
|:-----------------------------:|:---------:|:----:|:----:|:-------------------:|:-------------------:|:-------------------------:|
|
| 73 |
+
| CodeLlama-7B-Instruct | 40.9 | 54.0 | 6.7 | 21.9 | 3.4 | 3.6 |
|
| 74 |
+
| DeepSeek-Coder-6.7B-Instruct | 74.4 | 74.9 | 20.0 | 35.5 | 10.1 | 9.6 |
|
| 75 |
+
| CodeQwen1.5-7B-Chat | 83.5 | 77.7 | 17.6 | 39.6 | 18.9 | 3.0 |
|
| 76 |
+
| Yi-Coder-9B-Chat | 82.3 | 82.0 | 26.7 | 38.1 | 11.5 | 17.5 |
|
| 77 |
+
| Llama-3.1-8B-Instruct | 68.3 | 70.1 | 17.1 | 36.6 | 13.5 | 11.5 |
|
| 78 |
+
| OpenCoder-8B-Instruct | 83.5 | 79.1 | 30.5 | 40.3 | 16.9 | 17.1 |
|
| 79 |
+
| Qwen2.5-Coder-7B-Instruct | 88.4 | 82.0 | 26.7 | 41.0 | 18.2 | 17.3 |
|
| 80 |
+
| Qwen3-8B | 84.8 | 77.0 | 32.8 | 51.7 | 23.0 | 23.5 |
|
| 81 |
+
| Seed-Coder-8B-Instruct (0411) | 84.8 | 85.2 | 36.2 | 53.3 | 20.5 | 24.7 |
|
| 82 |
+
|
| 83 |
|
| 84 |
For detailed results, please check our [📑 paper](https://arxiv.org/pdf/xxx.xxxxx).
|
| 85 |
|