Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -66,7 +66,7 @@ This project introduces **General-Level** and **General-Bench**.
|
|
| 66 |
---
|
| 67 |
|
| 68 |
# πππ General-Level<a name="level" />
|
| 69 |
-
|
| 70 |
**A 5-scale level evaluation system with a new norm for assessing the multimodal generalists (multimodal LLMs/agents).
|
| 71 |
The core is the use of <b style="color:red">synergy</b> as the evaluative criterion, categorizing capabilities based on whether MLLMs preserve synergy across comprehension and generation, as well as across multimodal interactions.**
|
| 72 |
|
|
@@ -89,7 +89,7 @@ The core is the use of <b style="color:red">synergy</b> as the evaluative criter
|
|
| 89 |
---
|
| 90 |
|
| 91 |
# πππ General-Bench<a name="bench" />
|
| 92 |
-
|
| 93 |
**A companion massive multimodal benchmark dataset, encompasses a broader spectrum of skills, modalities, formats, and capabilities, including over 700 tasks and 325K instances.**
|
| 94 |
|
| 95 |
|
|
|
|
| 66 |
---
|
| 67 |
|
| 68 |
# πππ General-Level<a name="level" />
|
| 69 |
+
|
| 70 |
**A 5-scale level evaluation system with a new norm for assessing the multimodal generalists (multimodal LLMs/agents).
|
| 71 |
The core is the use of <b style="color:red">synergy</b> as the evaluative criterion, categorizing capabilities based on whether MLLMs preserve synergy across comprehension and generation, as well as across multimodal interactions.**
|
| 72 |
|
|
|
|
| 89 |
---
|
| 90 |
|
| 91 |
# πππ General-Bench<a name="bench" />
|
| 92 |
+
|
| 93 |
**A companion massive multimodal benchmark dataset, encompasses a broader spectrum of skills, modalities, formats, and capabilities, including over 700 tasks and 325K instances.**
|
| 94 |
|
| 95 |
|