BRlkl
/

BingoGuard-bert-base-pt-optimized

BRlkl commited on Sep 29, 2025

Commit

bde0da2

verified ·

1 Parent(s): e3b03b6

Upload benchmark_results.md with huggingface_hub

Files changed (1) hide show

benchmark_results.md CHANGED Viewed

@@ -6,9 +6,9 @@ The best performing model was chosen based on the highest average F1 score acros
 ## Best Hyperparameters
 ```
-learning_rate: 1e-05
-per_device_train_batch_size: 8
-num_train_epochs: 4
 weight_decay: 0.05
 lr_scheduler_type: cosine
 ```
@@ -17,8 +17,8 @@ lr_scheduler_type: cosine
 | dataset                           |   accuracy |   f1_score |   recall |   precision |
 |:----------------------------------|-----------:|-----------:|---------:|------------:|
-| BRlkl/BingoGuard-train-test-pt    |   0.902834 |   0.948936 | 0.902834 |    1        |
-| BRlkl/openai-moderation-eval-pt   |   0.692262 |   0.651382 | 0.925287 |    0.502601 |
-| BRlkl/WildGuardTest-pt            |   0.839906 |   0.814714 | 0.793103 |    0.837535 |
-| BRlkl/XSTest-pt                   |   0.824444 |   0.812352 | 0.855    |    0.773756 |
-| BRlkl/toxic-chat-pt (40% holdout) |   0.969027 |   0.814159 | 0.907895 |    0.737968 |

 ## Best Hyperparameters
 ```
+learning_rate: 5e-05
+per_device_train_batch_size: 32
+num_train_epochs: 8
 weight_decay: 0.05
 lr_scheduler_type: cosine
 ```
 | dataset                           |   accuracy |   f1_score |   recall |   precision |
 |:----------------------------------|-----------:|-----------:|---------:|------------:|
+| BRlkl/BingoGuard-train-test-pt    |   0.897773 |   0.946133 | 0.897773 |    1        |
+| BRlkl/openai-moderation-eval-pt   |   0.69881  |   0.651994 | 0.908046 |    0.508584 |
+| BRlkl/WildGuardTest-pt            |   0.839317 |   0.810021 | 0.771883 |    0.852123 |
+| BRlkl/XSTest-pt                   |   0.831111 |   0.824885 | 0.895    |    0.764957 |
+| BRlkl/toxic-chat-pt (40% holdout) |   0.97296  |   0.836795 | 0.927632 |    0.762162 |