Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
danielz02
commited on
Add additional information about metrics
Browse files- src/display/about.py +3 -1
src/display/about.py
CHANGED
|
@@ -29,7 +29,7 @@ TITLE = """<h1 align="center" id="space-title">Trustworthy LLM leaderboard</h1>"
|
|
| 29 |
INTRODUCTION_TEXT = """Powered by the DecodingTrust platform, which provides comprehensive safety and trustworthiness
|
| 30 |
evaluation for LLMs, this leaderboard is designed to help researchers and practitioners better understand the
|
| 31 |
capabilities, limitations, and potential risks of state-of-the-art Large Language Models (LLMs). See our paper for
|
| 32 |
-
details. Access the DecodingTrust platform website [here](https://decodingtrust.github.io/)"""
|
| 33 |
|
| 34 |
# Which evaluations are you running? how can people reproduce what you have?
|
| 35 |
LLM_BENCHMARKS_TEXT = f"""
|
|
@@ -51,6 +51,8 @@ This project is organized around the following eight primary perspectives of tru
|
|
| 51 |
+ Machine Ethics
|
| 52 |
+ Fairness
|
| 53 |
|
|
|
|
|
|
|
| 54 |
## Reproducibility
|
| 55 |
To reproduce our results, checkout https://github.com/AI-secure/DecodingTrust
|
| 56 |
|
|
|
|
| 29 |
INTRODUCTION_TEXT = """Powered by the DecodingTrust platform, which provides comprehensive safety and trustworthiness
|
| 30 |
evaluation for LLMs, this leaderboard is designed to help researchers and practitioners better understand the
|
| 31 |
capabilities, limitations, and potential risks of state-of-the-art Large Language Models (LLMs). See our paper for
|
| 32 |
+
details. Access the DecodingTrust platform website [here](https://decodingtrust.github.io/)."""
|
| 33 |
|
| 34 |
# Which evaluations are you running? how can people reproduce what you have?
|
| 35 |
LLM_BENCHMARKS_TEXT = f"""
|
|
|
|
| 51 |
+ Machine Ethics
|
| 52 |
+ Fairness
|
| 53 |
|
| 54 |
+
We normalize the score of each perspective as 0-100, and these scores are the higher the better.
|
| 55 |
+
|
| 56 |
## Reproducibility
|
| 57 |
To reproduce our results, checkout https://github.com/AI-secure/DecodingTrust
|
| 58 |
|