Update README.md
Browse files
README.md
CHANGED
|
@@ -92,7 +92,7 @@ print("\n==================================\n")
|
|
| 92 |
|
| 93 |
## Performance
|
| 94 |
Unlike the other three benchmarks, which solely evaluate Safety Assessment (i.e., binary safe/unsafe classification), BeaverTails is a multi-class classification benchmark. Its F1 score evaluation extends beyond simple Safety Assessment to measure accuracy across multiple risk categories, providing a more fine-grained assessment of model performance.
|
| 95 |
-
|
| 96 |
## Model Description
|
| 97 |
|
| 98 |
- **Model type:** Guardrail model fine-tuned to enhance safety classification with critiques-augmented finetuning.
|
|
|
|
| 92 |
|
| 93 |
## Performance
|
| 94 |
Unlike the other three benchmarks, which solely evaluate Safety Assessment (i.e., binary safe/unsafe classification), BeaverTails is a multi-class classification benchmark. Its F1 score evaluation extends beyond simple Safety Assessment to measure accuracy across multiple risk categories, providing a more fine-grained assessment of model performance.
|
| 95 |
+

|
| 96 |
## Model Description
|
| 97 |
|
| 98 |
- **Model type:** Guardrail model fine-tuned to enhance safety classification with critiques-augmented finetuning.
|