Spaces:
Running
Running
Update src/about.py
Browse files- src/about.py +1 -6
src/about.py
CHANGED
|
@@ -80,12 +80,7 @@ Addressing the gaps in existing LLM evaluation frameworks, this benchmark is spe
|
|
| 80 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
| 81 |
|
| 82 |
### Download Dataset
|
| 83 |
-
The full dataset is not publicly accessible
|
| 84 |
-
| Category Name | Accuracy |
|
| 85 |
-
|------------|-------------|
|
| 86 |
-
| Fairness | 17% |
|
| 87 |
-
| Saftey | 8.6% |
|
| 88 |
-
| Social norm| 74.4% |
|
| 89 |
|
| 90 |
## About Our Models
|
| 91 |
|
|
|
|
| 80 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
| 81 |
|
| 82 |
### Download Dataset
|
| 83 |
+
The full dataset is not publicly accessible. For research purposes, you may submit your request following the dataset request guidelines.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 84 |
|
| 85 |
## About Our Models
|
| 86 |
|