Update README.md
Browse files
README.md
CHANGED
|
@@ -69,7 +69,7 @@ datasets:
|
|
| 69 |
- **Primary Use Case** : Detecting and mitigating hidden risks in reasoning traces of Large Reasoning Models (LRMs)
|
| 70 |
|
| 71 |
- **Key Features** :
|
| 72 |
-
- **High Performance**: Achieves an average F1 score exceeding **92%** in QT Moderation tasks, outperforming existing models across both in-distribution (ID) and out-of-distribution (OOD) test sets.
|
| 73 |
|
| 74 |
- **Enhanced Explainability** : Employs a structured analysis process that improves decision transparency and provides clearer insights into safety assessments.
|
| 75 |
|
|
|
|
| 69 |
- **Primary Use Case** : Detecting and mitigating hidden risks in reasoning traces of Large Reasoning Models (LRMs)
|
| 70 |
|
| 71 |
- **Key Features** :
|
| 72 |
+
- **High Performance**: Achieves an average F1 score exceeding **92%** in QT Moderation tasks, outperforming existing models across both in-distribution (ID) and out-of-distribution (OOD) test sets, achieving state-of-the-art (SOTA) performance.
|
| 73 |
|
| 74 |
- **Enhanced Explainability** : Employs a structured analysis process that improves decision transparency and provides clearer insights into safety assessments.
|
| 75 |
|