BRlkl commited on
Commit
bde0da2
·
verified ·
1 Parent(s): e3b03b6

Upload benchmark_results.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. benchmark_results.md +8 -8
benchmark_results.md CHANGED
@@ -6,9 +6,9 @@ The best performing model was chosen based on the highest average F1 score acros
6
  ## Best Hyperparameters
7
 
8
  ```
9
- learning_rate: 1e-05
10
- per_device_train_batch_size: 8
11
- num_train_epochs: 4
12
  weight_decay: 0.05
13
  lr_scheduler_type: cosine
14
  ```
@@ -17,8 +17,8 @@ lr_scheduler_type: cosine
17
 
18
  | dataset | accuracy | f1_score | recall | precision |
19
  |:----------------------------------|-----------:|-----------:|---------:|------------:|
20
- | BRlkl/BingoGuard-train-test-pt | 0.902834 | 0.948936 | 0.902834 | 1 |
21
- | BRlkl/openai-moderation-eval-pt | 0.692262 | 0.651382 | 0.925287 | 0.502601 |
22
- | BRlkl/WildGuardTest-pt | 0.839906 | 0.814714 | 0.793103 | 0.837535 |
23
- | BRlkl/XSTest-pt | 0.824444 | 0.812352 | 0.855 | 0.773756 |
24
- | BRlkl/toxic-chat-pt (40% holdout) | 0.969027 | 0.814159 | 0.907895 | 0.737968 |
 
6
  ## Best Hyperparameters
7
 
8
  ```
9
+ learning_rate: 5e-05
10
+ per_device_train_batch_size: 32
11
+ num_train_epochs: 8
12
  weight_decay: 0.05
13
  lr_scheduler_type: cosine
14
  ```
 
17
 
18
  | dataset | accuracy | f1_score | recall | precision |
19
  |:----------------------------------|-----------:|-----------:|---------:|------------:|
20
+ | BRlkl/BingoGuard-train-test-pt | 0.897773 | 0.946133 | 0.897773 | 1 |
21
+ | BRlkl/openai-moderation-eval-pt | 0.69881 | 0.651994 | 0.908046 | 0.508584 |
22
+ | BRlkl/WildGuardTest-pt | 0.839317 | 0.810021 | 0.771883 | 0.852123 |
23
+ | BRlkl/XSTest-pt | 0.831111 | 0.824885 | 0.895 | 0.764957 |
24
+ | BRlkl/toxic-chat-pt (40% holdout) | 0.97296 | 0.836795 | 0.927632 | 0.762162 |