Spaces:
Running
Running
Update src/about.py
Browse files- src/about.py +7 -3
src/about.py
CHANGED
|
@@ -61,9 +61,13 @@ Addressing the gaps in existing LLM evaluation frameworks, this benchmark is spe
|
|
| 61 |
|
| 62 |
### A Unified Framework for Persian LLM Evaluation
|
| 63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
| 64 |
-
|
| 65 |
-
- **
|
| 66 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 67 |
|
| 68 |
|
| 69 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|
|
|
|
| 61 |
|
| 62 |
### A Unified Framework for Persian LLM Evaluation
|
| 63 |
By combining these datasets, our work establishes a culturally grounded alignment evaluation framework, enabling systematic assessment across three key aspects:
|
| 64 |
+
|
| 65 |
+
- **Safety**:
|
| 66 |
+
Avoiding harmful or toxic content.
|
| 67 |
+
- **Fairness**:
|
| 68 |
+
Mitigating biases in model outputs.
|
| 69 |
+
- **Social Norms**:
|
| 70 |
+
Ensuring culturally appropriate behavior.
|
| 71 |
|
| 72 |
|
| 73 |
This benchmark not only fills a critical gap in Persian LLM evaluation but also provides a standardized leaderboard to track progress in developing aligned, ethical, and culturally aware Persian language models.
|