Update README.md
Browse files
README.md
CHANGED
|
@@ -33,14 +33,13 @@ After following the sampling-based Pass@1 methodology inspired by [DeepSeek R1](
|
|
| 33 |
|
| 34 |
| Parameter | Value |
|
| 35 |
|------------------|---------|
|
| 36 |
-
| **Dataset** | `
|
| 37 |
| **Temperature** | `0.6` |
|
| 38 |
| **Top_p** | `0.95` |
|
| 39 |
| **Num_samples** | `16` per question |
|
| 40 |
|
| 41 |
-
### Results
|
| 42 |
|
| 43 |
-
|
| 44 |
|
| 45 |
*This metric represents the percentage of questions with at least one correct solution among multiple generated attempts.*
|
| 46 |
|
|
|
|
| 33 |
|
| 34 |
| Parameter | Value |
|
| 35 |
|------------------|---------|
|
| 36 |
+
| **Dataset** | `HuggingFaceH4/MATH-500` |
|
| 37 |
| **Temperature** | `0.6` |
|
| 38 |
| **Top_p** | `0.95` |
|
| 39 |
| **Num_samples** | `16` per question |
|
| 40 |
|
|
|
|
| 41 |
|
| 42 |
+
**At-least-one-correct Rate:** **54.60%** (273 out of 500 questions)
|
| 43 |
|
| 44 |
*This metric represents the percentage of questions with at least one correct solution among multiple generated attempts.*
|
| 45 |
|