Open_LMM_Reasoning_Leaderboard

Running

KennyUTC commited on Dec 18, 2024

Commit

e2b1fbc

1 Parent(s): ad8152e

update leaderboard

Files changed (1) hide show

meta_data.py CHANGED Viewed

@@ -14,8 +14,16 @@ CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"
 # CONSTANTS-TEXT
 LEADERBORAD_INTRODUCTION = """# Open LMM Reasoning Leaderboard
-This leaderboard aims at providing a comprehensive evaluation of the reasoning capabilities of LMMs. \
-Currently, it is a collection of evaluation results on multiple multi-modal mathematical reasoning benchmarks.
 """
 # CONSTANTS-FIELDS

 # CONSTANTS-TEXT
 LEADERBORAD_INTRODUCTION = """# Open LMM Reasoning Leaderboard
+This leaderboard aims at providing a comprehensive evaluation of the reasoning capabilities of LMMs.
+Currently, it is a collection of evaluation results on multiple multi-modal mathematical reasoning benchmarks.
+We obtain all evaluation results based on the [VLMEvalKit](https://github.com/open-compass/VLMEvalKit), with the corresponding dataset names:
+1. MathVista_MINI: The Test Mini split of MathVista dataset, around 1000 samples.
+2. MathVision: The Full test set of MathVision, around 3000 samples.
+3. MathVerse_MINI_Vision_Only: The Test Mini split of MathVerse, using the "Vision Only" mode, around 700 samples.
+4. DynaMath: The Full test set of DynaMath, around 5000 samples (501 original questions x 10 variants).
+To suggest new models or benchmarks for this leaderboard, please contact dhd.efz@gmail.com.
 """
 # CONSTANTS-FIELDS