Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -31,6 +31,7 @@ Trained on ~41K samples from public safety datasets (WildGuard, BeaverTails, Tox
|
|
| 31 |
|
| 32 |
| Model | Params | F1 |
|
| 33 |
|---|---|---|
|
|
|
|
| 34 |
| Qwen3Guard-8B | 8B | 73% |
|
| 35 |
| AprielGuard-8B | 8B | 72% |
|
| 36 |
| Granite Guardian-8B | 8B | 71% |
|
|
@@ -44,8 +45,6 @@ Trained on ~41K samples from public safety datasets (WildGuard, BeaverTails, Tox
|
|
| 44 |
| ToxDectRoberta | 125M | 34.6% |
|
| 45 |
| HateBERT | 110M | 11.6% |
|
| 46 |
|
| 47 |
-
Beats LlamaGuard 3 (8B), ShieldGemma (27B), LlamaGuard 4 (12B), and all encoder-based models in its class. 100x smaller than the nearest guard model that outperforms it.
|
| 48 |
-
|
| 49 |
## WildGuardBench
|
| 50 |
|
| 51 |
| Model | Params | WGTest F1 |
|
|
|
|
| 31 |
|
| 32 |
| Model | Params | F1 |
|
| 33 |
|---|---|---|
|
| 34 |
+
| Toxic Prompt RoBERTa | 125M | 78.7% |
|
| 35 |
| Qwen3Guard-8B | 8B | 73% |
|
| 36 |
| AprielGuard-8B | 8B | 72% |
|
| 37 |
| Granite Guardian-8B | 8B | 71% |
|
|
|
|
| 45 |
| ToxDectRoberta | 125M | 34.6% |
|
| 46 |
| HateBERT | 110M | 11.6% |
|
| 47 |
|
|
|
|
|
|
|
| 48 |
## WildGuardBench
|
| 49 |
|
| 50 |
| Model | Params | WGTest F1 |
|