fakerbaby commited on
Commit
74a3d7c
Β·
verified Β·
1 Parent(s): 16bc57f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -64,12 +64,12 @@ These innovations lead to Broad Reasoning Generalization, allowing our RL-powere
64
  ## 3. Evaluation
65
 
66
  ### 🌟 Key Results
67
- - **MMMU:** 76.0 β€” *Open-source SOTA, approaching human expert low (76.2)*
68
- - **EMMA-Mini(CoT):** 40.3 β€” *Best in open source*
69
- - **MMK12:** 78.5 β€” *Best in open source*
70
- - **Physics Reasoning:** PhyX-MC-TM (52.8), SeePhys (31.5) β€” *Best in open source*
71
- - **Logic Reasoning:** MME-Reasoning (42.8) VisuLogic (28.5) β€” *Best in open source*
72
- - **Math Benchmarks:** MathVista (77.1), MathVerse (59.6), MathVision (52.6) β€” *Exceptional problem-solving*
73
 
74
  <div align="center">
75
  <img src="https://huggingface.co/Skywork/Skywork-R1V3-38B/resolve/main/eval.png" width="800">
 
64
  ## 3. Evaluation
65
 
66
  ### 🌟 Key Results
67
+ - **MMMU:** 76.0
68
+ - **EMMA-Mini(CoT):** 40.3
69
+ - **MMK12:** 78.5
70
+ - **Physics Reasoning:** PhyX-MC-TM (52.8), SeePhys (31.5)
71
+ - **Logic Reasoning:** MME-Reasoning (42.8) VisuLogic (28.5)
72
+ - **Math Benchmarks:** MathVista (77.1), MathVerse (59.6), MathVision (52.6)
73
 
74
  <div align="center">
75
  <img src="https://huggingface.co/Skywork/Skywork-R1V3-38B/resolve/main/eval.png" width="800">