Update README.md
Browse files
README.md
CHANGED
|
@@ -21,4 +21,19 @@ More details are provided in Appendix K of our [paper](https://arxiv.org/abs/250
|
|
| 21 |
<div align="center">
|
| 22 |
<img src="performance.png" alt="Framework" width="1200"/>
|
| 23 |
<br>
|
| 24 |
-
</div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
<div align="center">
|
| 22 |
<img src="performance.png" alt="Framework" width="1200"/>
|
| 23 |
<br>
|
| 24 |
+
</div>
|
| 25 |
+
|
| 26 |
+
## 🫣Citation
|
| 27 |
+
If you find our benchmark, evaluation pipeline or models useful or interesting, please cite our paper.
|
| 28 |
+
|
| 29 |
+
```
|
| 30 |
+
@misc{xu2025edubenchcomprehensivebenchmarkingdataset,
|
| 31 |
+
title={EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scenarios},
|
| 32 |
+
author={Bin Xu and Yu Bai and Huashan Sun and Yiguan Lin and Siming Liu and Xinyue Liang and Yaolin Li and Yang Gao and Heyan Huang},
|
| 33 |
+
year={2025},
|
| 34 |
+
eprint={2505.16160},
|
| 35 |
+
archivePrefix={arXiv},
|
| 36 |
+
primaryClass={cs.CL},
|
| 37 |
+
url={https://arxiv.org/abs/2505.16160},
|
| 38 |
+
}
|
| 39 |
+
```
|