Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -92,7 +92,7 @@ The core is the use of <b style="color:red">synergy</b> as the evaluative criter
|
|
| 92 |
|
| 93 |
We set two dataset types according to the use purpose:
|
| 94 |
- [**General-Bench-Openset**](https://huggingface.co/datasets/General-Level/General-Bench-Openset) with inputs and labels of samples all publicly open, for **free open-world use** (e.g., for academic experiment/comparisons).
|
| 95 |
-
- [**General-Bench-Closeset**](https://huggingface.co/datasets/General-Level/General-Bench-Closeset) with only sample inputs available, which is used for ranking
|
| 96 |
|
| 97 |
|
| 98 |
<div align="center">
|
|
|
|
| 92 |
|
| 93 |
We set two dataset types according to the use purpose:
|
| 94 |
- [**General-Bench-Openset**](https://huggingface.co/datasets/General-Level/General-Bench-Openset) with inputs and labels of samples all publicly open, for **free open-world use** (e.g., for academic experiment/comparisons).
|
| 95 |
+
- [**General-Bench-Closeset**](https://huggingface.co/datasets/General-Level/General-Bench-Closeset) with only sample inputs available, which is used for **leaderboard ranking**. Participants need to submit the predictions to us for internal evaluation.
|
| 96 |
|
| 97 |
|
| 98 |
<div align="center">
|