Update README.md
Browse files
README.md
CHANGED
|
@@ -63,8 +63,8 @@ Results here are preliminary and reflect internal benchmarking on the same task
|
|
| 63 |
---
|
| 64 |
|
| 65 |
### Method
|
| 66 |
-
- GRPO
|
| 67 |
-
- Evol Merging
|
| 68 |
|
| 69 |
### **License**
|
| 70 |
|
|
|
|
| 63 |
---
|
| 64 |
|
| 65 |
### Method
|
| 66 |
+
- GRPO (Rule base reward + self confidence reward)
|
| 67 |
+
- Evol Merging
|
| 68 |
|
| 69 |
### **License**
|
| 70 |
|