Update README.md
Browse files
README.md
CHANGED
|
@@ -59,8 +59,8 @@ Unlike earlier LLMs that demanded rigid prompt engineering, vibe-code interactio
|
|
| 59 |
## Benchmark
|
| 60 |

|
| 61 |
|
| 62 |
-
| Tasks |Version| Filter |n-shot| Metric | | gpt-oss-20b-rl |
|
| 63 |
-
|-----------------------|------:|----------------|-----:|-----------|---|----:|
|
| 64 |
|gpqa_diamond_cot_n_shot| 2|flexible-extract| 5|exact_match|↑ | 0.7633|
|
| 65 |
|humaneval| 1|create_test| 0|pass@1| |0.8452|
|
| 66 |
## Example Usage
|
|
|
|
| 59 |
## Benchmark
|
| 60 |

|
| 61 |
|
| 62 |
+
| Tasks |Version| Filter |n-shot| Metric | | gpt-oss-20b-rl | gpt-oss-20 |
|
| 63 |
+
|-----------------------|------:|----------------|-----:|-----------|---|----:|----:|
|
| 64 |
|gpqa_diamond_cot_n_shot| 2|flexible-extract| 5|exact_match|↑ | 0.7633|
|
| 65 |
|humaneval| 1|create_test| 0|pass@1| |0.8452|
|
| 66 |
## Example Usage
|