Training in progress, epoch 1

Browse files

Files changed (5) hide show

README.md +11 -13
model.safetensors +1 -1
runs/Apr06_15-14-56_tardis/events.out.tfevents.1712409303.tardis.11888.0 +3 -0
runs/Apr06_15-16-57_tardis/events.out.tfevents.1712409424.tardis.12350.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,14 +18,14 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-bert/bert-base-cased](https://huggingface.co/google-bert/bert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5503
-- F1 Macro: 0.9046
-- F1: 0.9320
-- F1 Neg: 0.8772
 - Acc: 0.9125
-- Prec: 0.9266
-- Recall: 0.9375
-- Mcc: 0.8094
 ## Model description
@@ -51,18 +51,16 @@ The following hyperparameters were used during training:
 - distributed_type: multi-GPU
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1     | F1 Neg | Acc    | Prec   | Recall | Mcc    |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:------:|:------:|:------:|:------:|
-| No log        | 1.0   | 455  | 0.3613          | 0.8215   | 0.8706 | 0.7724 | 0.835  | 0.8740 | 0.8672 | 0.6430 |
-| 0.4501        | 2.0   | 910  | 0.3824          | 0.8791   | 0.9193 | 0.8390 | 0.8925 | 0.8845 | 0.9570 | 0.7643 |
-| 0.2737        | 3.0   | 1365 | 0.4971          | 0.8770   | 0.9084 | 0.8456 | 0.885  | 0.9268 | 0.8906 | 0.7552 |
-| 0.1573        | 4.0   | 1820 | 0.5008          | 0.8912   | 0.9222 | 0.8601 | 0.9    | 0.9186 | 0.9258 | 0.7824 |
-| 0.0839        | 5.0   | 2275 | 0.5503          | 0.9046   | 0.9320 | 0.8772 | 0.9125 | 0.9266 | 0.9375 | 0.8094 |
 ### Framework versions

 This model is a fine-tuned version of [google-bert/bert-base-cased](https://huggingface.co/google-bert/bert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3765
+- F1 Macro: 0.8988
+- F1: 0.9360
+- F1 Neg: 0.8617
 - Acc: 0.9125
+- Prec: 0.9209
+- Recall: 0.9517
+- Mcc: 0.7989
 ## Model description
 - distributed_type: multi-GPU
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1     | F1 Neg | Acc    | Prec   | Recall | Mcc    |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:------:|:------:|:------:|:------:|
+| 0.5022        | 1.0   | 824  | 0.4717          | 0.8752   | 0.9102 | 0.8403 | 0.885  | 0.9102 | 0.9102 | 0.7504 |
+| 0.3452        | 2.0   | 1648 | 0.4330          | 0.8882   | 0.9245 | 0.8519 | 0.9    | 0.8942 | 0.9570 | 0.7808 |
+| 0.2232        | 3.0   | 2472 | 0.5604          | 0.8652   | 0.9060 | 0.8244 | 0.8775 | 0.8906 | 0.9219 | 0.7314 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:54e740b8d61ce8e6df7825efeb2cf879dbcd8e7ef72ffe51c7db2f7125ecbb04
 size 433270768

 version https://git-lfs.github.com/spec/v1
+oid sha256:a5010a7ebc991cb5540f5537704d0246a084a6943d5d2b9654c757674caeefcf
 size 433270768

runs/Apr06_15-14-56_tardis/events.out.tfevents.1712409303.tardis.11888.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fce0540a7f103482f087131f840727d35bad7bf8e23e408c60bd9f8c3d508b1f
+size 5519

runs/Apr06_15-16-57_tardis/events.out.tfevents.1712409424.tardis.12350.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6a4b8ecbc1aeeb32e9790feaea59489850ec4139ad151bfcf0d1d0f5398203a6
+size 5519

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e86ef5a2a757cedf3f24587f4954e9d81b3575500e6c2449c0bf2dfb9586b589
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:452dad6e205df155e2d9980ef6985f684ee6775fbdc72a11aee59bfda2acd8aa
 size 5048