Update README.md
Browse files
README.md
CHANGED
|
@@ -29,11 +29,15 @@ This model is based on [deepseek-coder-1.3b-base](https://huggingface.co/deepsee
|
|
| 29 |
|
| 30 |
## Benchmark Scores
|
| 31 |
|
| 32 |
-
|
| 33 |
-
|
| 34 |
-
|
| 35 |
-
|
| 36 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
|
| 38 |
|
| 39 |
## Model Usage
|
|
|
|
| 29 |
|
| 30 |
## Benchmark Scores
|
| 31 |
|
| 32 |
+
The performance of the OpenCodeInterpreter-DS-1.3B is highlighted below, showcasing the improvements when execution feedback is incorporated. Scores are presented for two benchmarks: HumanEval and MBPP, with an average increase indicated to demonstrate the overall enhancement in performance.
|
| 33 |
+
|
| 34 |
+
| **Benchmark** | **HumanEval (+)** | **MBPP (+)** | **Average (+)** |
|
| 35 |
+
|----------------------------|--------------------|--------------|-----------------|
|
| 36 |
+
| **OpenCodeInterpreter-DS-1.3B** | 0.652 (0.61) | 0.634 (0.524)| 0.643 (0.567) |
|
| 37 |
+
| **+ Execution Feedback** | 0.652 (0.622) | 0.652 (0.556)| 0.652 (0.589) |
|
| 38 |
+
|
| 39 |
+
*Note: The values in parentheses represent scores prior to the integration of execution feedback, illustrating the performance improvements across benchmarks.*
|
| 40 |
+
|
| 41 |
|
| 42 |
|
| 43 |
## Model Usage
|