Update README.md
Browse files
README.md
CHANGED
|
@@ -40,15 +40,14 @@ We thank the following projects for providing the training data:
|
|
| 40 |
We use Weights & Biases for hyperparameter optimization with a random search strategy (10 folds), aiming to maximize the evaluation F1 score (eval_f1).
|
| 41 |
|
| 42 |
The search space includes:
|
| 43 |
-
|
| 44 |
-
|
| 45 |
-
|
| 46 |
-
|
| 47 |
-
|
| 48 |
-
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
Number of Training Epochs: 5 \
|
| 52 |
|
| 53 |
|
| 54 |
|
|
|
|
| 40 |
We use Weights & Biases for hyperparameter optimization with a random search strategy (10 folds), aiming to maximize the evaluation F1 score (eval_f1).
|
| 41 |
|
| 42 |
The search space includes:
|
| 43 |
+
- Learning Rate: Sampled uniformly between 1e-6 and 1e-4
|
| 44 |
+
- Weight Decay: One of [0.1, 0.01, 0.001]
|
| 45 |
+
- Number of Training Epochs: One of [3, 4, 5, 6]
|
| 46 |
+
|
| 47 |
+
For the final training of this model, the hyperparameters were:
|
| 48 |
+
- Learning Rate: 9.889410158465026e-05
|
| 49 |
+
- Weight Decay: 0.1
|
| 50 |
+
- Number of Training Epochs: 5
|
|
|
|
| 51 |
|
| 52 |
|
| 53 |
|