Spaces:

General-Level
/

README

Running

App Files Files Community

scofield7419 commited on Feb 24

Commit

b1bfa3e

verified ·

1 Parent(s): 75d0766

Update README.md

Browse files

Files changed (1) hide show

README.md +39 -1

README.md CHANGED Viewed

@@ -7,4 +7,42 @@ sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 pinned: false
 ---
+<div align="center">
+<img src='https://cdn-uploads.huggingface.co/production/uploads/647773a1168cb428e00e9a8f/N8lP93rB6lL3iqzML4SKZ.png'  width=200px>
+<h1 align="center"><b>On Path to Multimodal Generalist: Levels and Benchmarks</b></h1>
+<p align="center">
+<a href="https://generalist.top/">[📖 Project]</a>
+<a href="https://level.generalist.top">[🏆 Leaderboard]</a>
+<a href="https://arxiv.org/abs/2510.10101">[📄 Paper]</a>
+<a href="https://huggingface.co/General-Level">[🤗 Dataset-HF]</a>
+<a href="https://github.com/path2generalist/GeneralBench">[📝 Dataset-Github]</a>
+</p>
+[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/license/mit)
+---
+</div>
+<h1 align="center" style="color:#F27E7E"><em>
+Does higher performance across tasks indicate a stronger capability of MLLM, and closer to AGI?
+<br>
+NO! <b style="color:red">Synergy</b> does.
+</em></h1>
+This project introduces:
+1. **General-Level**, a 5-scale level evaluation system with a new norm for assessing the multimodal generalists (multimodal LLMs/agents). The core is the use of Synergy as the evaluative criterion, categorizing capabilities based on whether MLLMs preserve synergy across comprehension and generation, as well as across multimodal interactions.
+2. **General-Bench**, a companion  massive multimodal benchmark dataset, encompasses a broader spectrum of skills, modalities, formats, and capabilities, including over 700 tasks and 325K instances.