Update README.md
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ This model was evaluated on the OpenLLM v1 benchmarks and reasoning tasks (AIME-
|
|
| 21 |
|
| 22 |
Model outputs were generated with the vLLM engine.
|
| 23 |
|
| 24 |
-
For reasoning tasks we
|
| 25 |
|
| 26 |
|
| 27 |
`OpenLLM v1 `
|
|
|
|
| 21 |
|
| 22 |
Model outputs were generated with the vLLM engine.
|
| 23 |
|
| 24 |
+
For reasoning tasks we estimate pass@1 based on 10 runs with different seeds and `temperature=0.6`, `top_p=0.95` and `max_new_tokens=32768`.
|
| 25 |
|
| 26 |
|
| 27 |
`OpenLLM v1 `
|