Spaces:
Running
Running
Update src/about.py
Browse files- src/about.py +3 -3
src/about.py
CHANGED
|
@@ -62,9 +62,9 @@ Addressing the gaps in existing LLM evaluation frameworks, this benchmark is spe
|
|
| 62 |
### A Unified Framework for Persian LLM Evaluation
|
| 63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
| 64 |
|
| 65 |
-
-
|
| 66 |
-
-
|
| 67 |
-
-
|
| 68 |
|
| 69 |
|
| 70 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
|
|
|
| 62 |
### A Unified Framework for Persian LLM Evaluation
|
| 63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
| 64 |
|
| 65 |
+
- Safety: Avoiding harmful or toxic content.
|
| 66 |
+
- Fairness: Mitigating biases in model outputs.
|
| 67 |
+
- Social Norms: Ensuring culturally appropriate behavior.
|
| 68 |
|
| 69 |
|
| 70 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|