End of training
Browse files- README.md +40 -24
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 19 |
|
| 20 |
This model was trained from scratch on an unknown dataset.
|
| 21 |
It achieves the following results on the evaluation set:
|
| 22 |
-
- Loss: 0.
|
| 23 |
-
- Accuracy: 0.
|
| 24 |
-
- Auc Score: 0.
|
| 25 |
-
- F1: 0.
|
| 26 |
-
- Precision: 0.
|
| 27 |
-
- Recall: 0.
|
| 28 |
|
| 29 |
## Model description
|
| 30 |
|
|
@@ -44,8 +44,8 @@ More information needed
|
|
| 44 |
|
| 45 |
The following hyperparameters were used during training:
|
| 46 |
- learning_rate: 3e-05
|
| 47 |
-
- train_batch_size:
|
| 48 |
-
- eval_batch_size:
|
| 49 |
- seed: 42
|
| 50 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 51 |
- lr_scheduler_type: linear
|
|
@@ -55,22 +55,38 @@ The following hyperparameters were used during training:
|
|
| 55 |
|
| 56 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Auc Score | F1 | Precision | Recall |
|
| 57 |
|:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:---------:|:------:|
|
| 58 |
-
| 0.
|
| 59 |
-
| 0.
|
| 60 |
-
| 0.
|
| 61 |
-
| 0.
|
| 62 |
-
| 0.
|
| 63 |
-
| 0.
|
| 64 |
-
| 0.
|
| 65 |
-
| 0.
|
| 66 |
-
| 0.
|
| 67 |
-
| 0.
|
| 68 |
-
| 0.
|
| 69 |
-
| 0.
|
| 70 |
-
| 0.
|
| 71 |
-
| 0.
|
| 72 |
-
| 0.
|
| 73 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 74 |
|
| 75 |
|
| 76 |
### Framework versions
|
|
|
|
| 19 |
|
| 20 |
This model was trained from scratch on an unknown dataset.
|
| 21 |
It achieves the following results on the evaluation set:
|
| 22 |
+
- Loss: 0.7687
|
| 23 |
+
- Accuracy: 0.8017
|
| 24 |
+
- Auc Score: 0.8728
|
| 25 |
+
- F1: 0.8298
|
| 26 |
+
- Precision: 0.8074
|
| 27 |
+
- Recall: 0.8534
|
| 28 |
|
| 29 |
## Model description
|
| 30 |
|
|
|
|
| 44 |
|
| 45 |
The following hyperparameters were used during training:
|
| 46 |
- learning_rate: 3e-05
|
| 47 |
+
- train_batch_size: 8
|
| 48 |
+
- eval_batch_size: 8
|
| 49 |
- seed: 42
|
| 50 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 51 |
- lr_scheduler_type: linear
|
|
|
|
| 55 |
|
| 56 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Auc Score | F1 | Precision | Recall |
|
| 57 |
|:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:---------:|:------:|
|
| 58 |
+
| 0.6445 | 0.0923 | 100 | 0.5441 | 0.7435 | 0.8052 | 0.7751 | 0.7701 | 0.7801 |
|
| 59 |
+
| 0.5767 | 0.1845 | 200 | 0.5260 | 0.7555 | 0.8345 | 0.7721 | 0.8179 | 0.7313 |
|
| 60 |
+
| 0.5126 | 0.2768 | 300 | 0.5090 | 0.7629 | 0.8450 | 0.8068 | 0.7493 | 0.8738 |
|
| 61 |
+
| 0.4723 | 0.3690 | 400 | 0.5557 | 0.7417 | 0.8505 | 0.7363 | 0.8728 | 0.6368 |
|
| 62 |
+
| 0.511 | 0.4613 | 500 | 0.4766 | 0.7823 | 0.8525 | 0.8106 | 0.7991 | 0.8225 |
|
| 63 |
+
| 0.5082 | 0.5535 | 600 | 0.4947 | 0.7915 | 0.8565 | 0.8239 | 0.7900 | 0.8607 |
|
| 64 |
+
| 0.4494 | 0.6458 | 700 | 0.4976 | 0.7763 | 0.8560 | 0.8032 | 0.8003 | 0.8062 |
|
| 65 |
+
| 0.4816 | 0.7380 | 800 | 0.4648 | 0.7827 | 0.8624 | 0.8111 | 0.7992 | 0.8233 |
|
| 66 |
+
| 0.4665 | 0.8303 | 900 | 0.4649 | 0.7887 | 0.8656 | 0.8200 | 0.7926 | 0.8493 |
|
| 67 |
+
| 0.5226 | 0.9225 | 1000 | 0.4537 | 0.7929 | 0.8666 | 0.8158 | 0.8222 | 0.8094 |
|
| 68 |
+
| 0.4643 | 1.0148 | 1100 | 0.4747 | 0.7998 | 0.8676 | 0.8287 | 0.8040 | 0.8550 |
|
| 69 |
+
| 0.3617 | 1.1070 | 1200 | 0.5402 | 0.7943 | 0.8668 | 0.8213 | 0.8084 | 0.8347 |
|
| 70 |
+
| 0.3439 | 1.1993 | 1300 | 0.5924 | 0.7966 | 0.8703 | 0.8267 | 0.7988 | 0.8567 |
|
| 71 |
+
| 0.3482 | 1.2915 | 1400 | 0.5369 | 0.8003 | 0.8681 | 0.8287 | 0.8060 | 0.8526 |
|
| 72 |
+
| 0.3855 | 1.3838 | 1500 | 0.5213 | 0.7966 | 0.8702 | 0.8205 | 0.8202 | 0.8208 |
|
| 73 |
+
| 0.335 | 1.4760 | 1600 | 0.5387 | 0.7929 | 0.8702 | 0.8176 | 0.8159 | 0.8192 |
|
| 74 |
+
| 0.382 | 1.5683 | 1700 | 0.5267 | 0.7924 | 0.8710 | 0.8109 | 0.8377 | 0.7858 |
|
| 75 |
+
| 0.341 | 1.6605 | 1800 | 0.6565 | 0.7957 | 0.8722 | 0.8293 | 0.7871 | 0.8762 |
|
| 76 |
+
| 0.3492 | 1.7528 | 1900 | 0.5635 | 0.7957 | 0.8725 | 0.8298 | 0.7855 | 0.8795 |
|
| 77 |
+
| 0.3861 | 1.8450 | 2000 | 0.5204 | 0.7998 | 0.8752 | 0.8281 | 0.8063 | 0.8510 |
|
| 78 |
+
| 0.3451 | 1.9373 | 2100 | 0.5854 | 0.7984 | 0.8757 | 0.8316 | 0.7893 | 0.8787 |
|
| 79 |
+
| 0.2915 | 2.0295 | 2200 | 0.6308 | 0.8021 | 0.8744 | 0.8354 | 0.7897 | 0.8868 |
|
| 80 |
+
| 0.2264 | 2.1218 | 2300 | 0.7711 | 0.7984 | 0.8741 | 0.8234 | 0.8172 | 0.8298 |
|
| 81 |
+
| 0.244 | 2.2140 | 2400 | 0.7302 | 0.8030 | 0.8742 | 0.8346 | 0.7960 | 0.8770 |
|
| 82 |
+
| 0.2477 | 2.3063 | 2500 | 0.8263 | 0.7915 | 0.8721 | 0.8154 | 0.8180 | 0.8127 |
|
| 83 |
+
| 0.2356 | 2.3985 | 2600 | 0.8275 | 0.7980 | 0.8734 | 0.8301 | 0.7926 | 0.8713 |
|
| 84 |
+
| 0.2122 | 2.4908 | 2700 | 0.8132 | 0.7980 | 0.8723 | 0.8234 | 0.8155 | 0.8314 |
|
| 85 |
+
| 0.2443 | 2.5830 | 2800 | 0.7874 | 0.8007 | 0.8728 | 0.8269 | 0.8139 | 0.8404 |
|
| 86 |
+
| 0.2275 | 2.6753 | 2900 | 0.7503 | 0.8003 | 0.8738 | 0.8322 | 0.7938 | 0.8746 |
|
| 87 |
+
| 0.2476 | 2.7675 | 3000 | 0.7822 | 0.7957 | 0.8731 | 0.8206 | 0.8163 | 0.8249 |
|
| 88 |
+
| 0.1961 | 2.8598 | 3100 | 0.7780 | 0.8021 | 0.8731 | 0.8304 | 0.8071 | 0.8550 |
|
| 89 |
+
| 0.2536 | 2.9520 | 3200 | 0.7687 | 0.8017 | 0.8728 | 0.8298 | 0.8074 | 0.8534 |
|
| 90 |
|
| 91 |
|
| 92 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 442499672
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:eec3942a8b3e347f241ddaf9f6e849241670c90ac0ee0131167ddb72a637a871
|
| 3 |
size 442499672
|