Update README.md
Browse files
README.md
CHANGED
|
@@ -64,6 +64,7 @@ Debugged vibecoder dataset
|
|
| 64 |
|-----------------|---------|------------------|--------|--------------|-----------------------------|------------|-------------|
|
| 65 |
| gsm8k_cot | 3 | flexible-extract | 3 | exact_match ↑ | 0.8452+(0.7667) | 0.78 | 0.82 |
|
| 66 |
| humaneval | 1 | create_test | 0 | exact_match ↑ | 0.933+( 0.8) | 0.73 | 0.92 |
|
|
|
|
| 67 |
|
| 68 |
## Example Usage
|
| 69 |
|
|
|
|
| 64 |
|-----------------|---------|------------------|--------|--------------|-----------------------------|------------|-------------|
|
| 65 |
| gsm8k_cot | 3 | flexible-extract | 3 | exact_match ↑ | 0.8452+(0.7667) | 0.78 | 0.82 |
|
| 66 |
| humaneval | 1 | create_test | 0 | exact_match ↑ | 0.933+( 0.8) | 0.73 | 0.92 |
|
| 67 |
+
| mmlu_college_biology| 1 | create_test | 0 | exact_match ↑ | 1.0 | | |
|
| 68 |
|
| 69 |
## Example Usage
|
| 70 |
|