Update README.md
Browse files
README.md
CHANGED
|
@@ -52,8 +52,12 @@ we employ the model's output to predict preferences and use pairwise accuracy as
|
|
| 52 |
| Idefics2 | 73.0 | 6.5 | 0.3 | 34.6 | 31.7 |
|
| 53 |
| SSIM-dyn | 42.5 | -5.5 | -17.0 | 28.4 | 36.5 |
|
| 54 |
| MES-dyn | 36.7 | -12.9 | -26.4 | 31.4 | 44.5 |
|
| 55 |
-
|
| 56 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 57 |
|
| 58 |
## Usage
|
| 59 |
### Installation
|
|
|
|
| 52 |
| Idefics2 | 73.0 | 6.5 | 0.3 | 34.6 | 31.7 |
|
| 53 |
| SSIM-dyn | 42.5 | -5.5 | -17.0 | 28.4 | 36.5 |
|
| 54 |
| MES-dyn | 36.7 | -12.9 | -26.4 | 31.4 | 44.5 |
|
| 55 |
+
| Fuyu | - | - | - | - | - |
|
| 56 |
+
| Kosmos-2 | - | - | - | - | - |
|
| 57 |
+
| CogVLM | - | - | - | - | - |
|
| 58 |
+
| OpenFlamingo | - | - | - | - | - |
|
| 59 |
+
The best in MantisScore series is in bold and the best in baselines is underlined.
|
| 60 |
+
"-" means the answer of MLLM is meaningless or in wrong format.
|
| 61 |
|
| 62 |
## Usage
|
| 63 |
### Installation
|