Spaces:
Running
Running
Update src/about.py
Browse files- src/about.py +3 -6
src/about.py
CHANGED
|
@@ -62,12 +62,9 @@ Addressing the gaps in existing LLM evaluation frameworks, this benchmark is spe
|
|
| 62 |
### A Unified Framework for Persian LLM Evaluation
|
| 63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
| 64 |
|
| 65 |
-
- **Safety
|
| 66 |
-
|
| 67 |
-
- **
|
| 68 |
-
Mitigating biases in model outputs.
|
| 69 |
-
- **Social Norms**:
|
| 70 |
-
Ensuring culturally appropriate behavior.
|
| 71 |
|
| 72 |
|
| 73 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
|
|
|
| 62 |
### A Unified Framework for Persian LLM Evaluation
|
| 63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
| 64 |
|
| 65 |
+
- **Safety:** Avoiding harmful or toxic content.
|
| 66 |
+
- **Fairness:** Mitigating biases in model outputs.
|
| 67 |
+
- **Social Norms:** Ensuring culturally appropriate behavior.
|
|
|
|
|
|
|
|
|
|
| 68 |
|
| 69 |
|
| 70 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|