Training in progress, epoch 1
Browse files
README.md
CHANGED
|
@@ -18,14 +18,14 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google-bert/bert-base-cased](https://huggingface.co/google-bert/bert-base-cased) on the None dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
-
- Loss: 0.
|
| 22 |
-
- F1 Macro: 0.
|
| 23 |
-
- F1: 0.
|
| 24 |
-
- F1 Neg: 0.
|
| 25 |
- Acc: 0.9125
|
| 26 |
-
- Prec: 0.
|
| 27 |
-
- Recall: 0.
|
| 28 |
-
- Mcc: 0.
|
| 29 |
|
| 30 |
## Model description
|
| 31 |
|
|
@@ -51,18 +51,16 @@ The following hyperparameters were used during training:
|
|
| 51 |
- distributed_type: multi-GPU
|
| 52 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 53 |
- lr_scheduler_type: linear
|
| 54 |
-
- num_epochs:
|
| 55 |
- mixed_precision_training: Native AMP
|
| 56 |
|
| 57 |
### Training results
|
| 58 |
|
| 59 |
| Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 | F1 Neg | Acc | Prec | Recall | Mcc |
|
| 60 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:------:|:------:|:------:|:------:|
|
| 61 |
-
|
|
| 62 |
-
| 0.
|
| 63 |
-
| 0.
|
| 64 |
-
| 0.1573 | 4.0 | 1820 | 0.5008 | 0.8912 | 0.9222 | 0.8601 | 0.9 | 0.9186 | 0.9258 | 0.7824 |
|
| 65 |
-
| 0.0839 | 5.0 | 2275 | 0.5503 | 0.9046 | 0.9320 | 0.8772 | 0.9125 | 0.9266 | 0.9375 | 0.8094 |
|
| 66 |
|
| 67 |
|
| 68 |
### Framework versions
|
|
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [google-bert/bert-base-cased](https://huggingface.co/google-bert/bert-base-cased) on the None dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 0.3765
|
| 22 |
+
- F1 Macro: 0.8988
|
| 23 |
+
- F1: 0.9360
|
| 24 |
+
- F1 Neg: 0.8617
|
| 25 |
- Acc: 0.9125
|
| 26 |
+
- Prec: 0.9209
|
| 27 |
+
- Recall: 0.9517
|
| 28 |
+
- Mcc: 0.7989
|
| 29 |
|
| 30 |
## Model description
|
| 31 |
|
|
|
|
| 51 |
- distributed_type: multi-GPU
|
| 52 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 53 |
- lr_scheduler_type: linear
|
| 54 |
+
- num_epochs: 3
|
| 55 |
- mixed_precision_training: Native AMP
|
| 56 |
|
| 57 |
### Training results
|
| 58 |
|
| 59 |
| Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 | F1 Neg | Acc | Prec | Recall | Mcc |
|
| 60 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:------:|:------:|:------:|:------:|
|
| 61 |
+
| 0.5022 | 1.0 | 824 | 0.4717 | 0.8752 | 0.9102 | 0.8403 | 0.885 | 0.9102 | 0.9102 | 0.7504 |
|
| 62 |
+
| 0.3452 | 2.0 | 1648 | 0.4330 | 0.8882 | 0.9245 | 0.8519 | 0.9 | 0.8942 | 0.9570 | 0.7808 |
|
| 63 |
+
| 0.2232 | 3.0 | 2472 | 0.5604 | 0.8652 | 0.9060 | 0.8244 | 0.8775 | 0.8906 | 0.9219 | 0.7314 |
|
|
|
|
|
|
|
| 64 |
|
| 65 |
|
| 66 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 433270768
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a5010a7ebc991cb5540f5537704d0246a084a6943d5d2b9654c757674caeefcf
|
| 3 |
size 433270768
|
runs/Apr06_15-14-56_tardis/events.out.tfevents.1712409303.tardis.11888.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fce0540a7f103482f087131f840727d35bad7bf8e23e408c60bd9f8c3d508b1f
|
| 3 |
+
size 5519
|
runs/Apr06_15-16-57_tardis/events.out.tfevents.1712409424.tardis.12350.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6a4b8ecbc1aeeb32e9790feaea59489850ec4139ad151bfcf0d1d0f5398203a6
|
| 3 |
+
size 5519
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5048
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:452dad6e205df155e2d9980ef6985f684ee6775fbdc72a11aee59bfda2acd8aa
|
| 3 |
size 5048
|