SPUH
/

UFEval

SPUH commited on Jun 11, 2025

Commit

ccba491

verified ·

1 Parent(s): 8fbf548

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,4 +8,15 @@ metrics:
 - accuracy
 base_model:
 - Qwen/Qwen2-VL-7B-Instruct
----

 - accuracy
 base_model:
 - Qwen/Qwen2-VL-7B-Instruct
+---
+# GenEval-7B
+## Model Summary
+`GenEval-7b` is the first open-source large multimodal model (LMM) designed as an evaluator across different tasks and modalities for assessing model performance. Built on the foundation of `Qwen2-VL-7B-Instruct`, it has been finetuned on [FRABench](https://huggingface.co/datasets/SPUH/FRABench) dataset.
+For further details, please refer to the following resources:
+- 📰 Paper: https://arxiv.org/abs/2505.12795
+- 🪐 Project Page: https://github.com/ALEX-nlp/FRABench-and-GenEval
+- 📦 Datasets: https://huggingface.co/datasets/SPUH/FRABench