Link Space to the paper (#1)
Browse files- Link Space to the paper (fabc9826d4179d4365f1ee00f9d134e31b5e1fcc)
Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -41,4 +41,18 @@ If you encounter problem on the space, don't hesitate to restart it to remove th
|
|
| 41 |
You'll find
|
| 42 |
- the main table' columns names and properties in `src/display/utils.py`
|
| 43 |
- the logic to read all results and request files, then convert them in dataframe lines, in `src/leaderboard/read_evals.py`, and `src/populate.py`
|
| 44 |
-
- teh logic to allow or filter submissions in `src/submission/submit.py` and `src/submission/check_validity.py`
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 41 |
You'll find
|
| 42 |
- the main table' columns names and properties in `src/display/utils.py`
|
| 43 |
- the logic to read all results and request files, then convert them in dataframe lines, in `src/leaderboard/read_evals.py`, and `src/populate.py`
|
| 44 |
+
- teh logic to allow or filter submissions in `src/submission/submit.py` and `src/submission/check_validity.py`
|
| 45 |
+
|
| 46 |
+
# Citation
|
| 47 |
+
|
| 48 |
+
```bibtex
|
| 49 |
+
@misc{cao2024structevaldeepenbroadenlarge,
|
| 50 |
+
title={StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation},
|
| 51 |
+
author={Boxi Cao and Mengjie Ren and Hongyu Lin and Xianpei Han and Feng Zhang and Junfeng Zhan and Le Sun},
|
| 52 |
+
year={2024},
|
| 53 |
+
eprint={2408.03281},
|
| 54 |
+
archivePrefix={arXiv},
|
| 55 |
+
primaryClass={cs.CL},
|
| 56 |
+
url={https://arxiv.org/abs/2408.03281},
|
| 57 |
+
}
|
| 58 |
+
```
|