Update README.md
Browse files
README.md
CHANGED
|
@@ -64,12 +64,12 @@ These innovations lead to Broad Reasoning Generalization, allowing our RL-powere
|
|
| 64 |
## 3. Evaluation
|
| 65 |
|
| 66 |
### π Key Results
|
| 67 |
-
- **MMMU:** 76.0
|
| 68 |
-
- **EMMA-Mini(CoT):** 40.3
|
| 69 |
-
- **MMK12:** 78.5
|
| 70 |
-
- **Physics Reasoning:** PhyX-MC-TM (52.8), SeePhys (31.5)
|
| 71 |
-
- **Logic Reasoning:** MME-Reasoning (42.8) VisuLogic (28.5)
|
| 72 |
-
- **Math Benchmarks:** MathVista (77.1), MathVerse (59.6), MathVision (52.6)
|
| 73 |
|
| 74 |
<div align="center">
|
| 75 |
<img src="https://huggingface.co/Skywork/Skywork-R1V3-38B/resolve/main/eval.png" width="800">
|
|
|
|
| 64 |
## 3. Evaluation
|
| 65 |
|
| 66 |
### π Key Results
|
| 67 |
+
- **MMMU:** 76.0
|
| 68 |
+
- **EMMA-Mini(CoT):** 40.3
|
| 69 |
+
- **MMK12:** 78.5
|
| 70 |
+
- **Physics Reasoning:** PhyX-MC-TM (52.8), SeePhys (31.5)
|
| 71 |
+
- **Logic Reasoning:** MME-Reasoning (42.8) VisuLogic (28.5)
|
| 72 |
+
- **Math Benchmarks:** MathVista (77.1), MathVerse (59.6), MathVision (52.6)
|
| 73 |
|
| 74 |
<div align="center">
|
| 75 |
<img src="https://huggingface.co/Skywork/Skywork-R1V3-38B/resolve/main/eval.png" width="800">
|