Update README.md
Browse files
README.md
CHANGED
|
@@ -8,4 +8,15 @@ metrics:
|
|
| 8 |
- accuracy
|
| 9 |
base_model:
|
| 10 |
- Qwen/Qwen2-VL-7B-Instruct
|
| 11 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
- accuracy
|
| 9 |
base_model:
|
| 10 |
- Qwen/Qwen2-VL-7B-Instruct
|
| 11 |
+
---
|
| 12 |
+
|
| 13 |
+
# GenEval-7B
|
| 14 |
+
|
| 15 |
+
## Model Summary
|
| 16 |
+
|
| 17 |
+
`GenEval-7b` is the first open-source large multimodal model (LMM) designed as an evaluator across different tasks and modalities for assessing model performance. Built on the foundation of `Qwen2-VL-7B-Instruct`, it has been finetuned on [FRABench](https://huggingface.co/datasets/SPUH/FRABench) dataset.
|
| 18 |
+
|
| 19 |
+
For further details, please refer to the following resources:
|
| 20 |
+
- 📰 Paper: https://arxiv.org/abs/2505.12795
|
| 21 |
+
- 🪐 Project Page: https://github.com/ALEX-nlp/FRABench-and-GenEval
|
| 22 |
+
- 📦 Datasets: https://huggingface.co/datasets/SPUH/FRABench
|