dtorber commited on
Commit
71bcdb2
·
verified ·
1 Parent(s): a205f44

Training in progress, epoch 1

Browse files
README.md CHANGED
@@ -18,14 +18,14 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google-bert/bert-base-cased](https://huggingface.co/google-bert/bert-base-cased) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.5503
22
- - F1 Macro: 0.9046
23
- - F1: 0.9320
24
- - F1 Neg: 0.8772
25
  - Acc: 0.9125
26
- - Prec: 0.9266
27
- - Recall: 0.9375
28
- - Mcc: 0.8094
29
 
30
  ## Model description
31
 
@@ -51,18 +51,16 @@ The following hyperparameters were used during training:
51
  - distributed_type: multi-GPU
52
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
53
  - lr_scheduler_type: linear
54
- - num_epochs: 5
55
  - mixed_precision_training: Native AMP
56
 
57
  ### Training results
58
 
59
  | Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 | F1 Neg | Acc | Prec | Recall | Mcc |
60
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:------:|:------:|:------:|:------:|
61
- | No log | 1.0 | 455 | 0.3613 | 0.8215 | 0.8706 | 0.7724 | 0.835 | 0.8740 | 0.8672 | 0.6430 |
62
- | 0.4501 | 2.0 | 910 | 0.3824 | 0.8791 | 0.9193 | 0.8390 | 0.8925 | 0.8845 | 0.9570 | 0.7643 |
63
- | 0.2737 | 3.0 | 1365 | 0.4971 | 0.8770 | 0.9084 | 0.8456 | 0.885 | 0.9268 | 0.8906 | 0.7552 |
64
- | 0.1573 | 4.0 | 1820 | 0.5008 | 0.8912 | 0.9222 | 0.8601 | 0.9 | 0.9186 | 0.9258 | 0.7824 |
65
- | 0.0839 | 5.0 | 2275 | 0.5503 | 0.9046 | 0.9320 | 0.8772 | 0.9125 | 0.9266 | 0.9375 | 0.8094 |
66
 
67
 
68
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google-bert/bert-base-cased](https://huggingface.co/google-bert/bert-base-cased) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.3765
22
+ - F1 Macro: 0.8988
23
+ - F1: 0.9360
24
+ - F1 Neg: 0.8617
25
  - Acc: 0.9125
26
+ - Prec: 0.9209
27
+ - Recall: 0.9517
28
+ - Mcc: 0.7989
29
 
30
  ## Model description
31
 
 
51
  - distributed_type: multi-GPU
52
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
53
  - lr_scheduler_type: linear
54
+ - num_epochs: 3
55
  - mixed_precision_training: Native AMP
56
 
57
  ### Training results
58
 
59
  | Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 | F1 Neg | Acc | Prec | Recall | Mcc |
60
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:------:|:------:|:------:|:------:|
61
+ | 0.5022 | 1.0 | 824 | 0.4717 | 0.8752 | 0.9102 | 0.8403 | 0.885 | 0.9102 | 0.9102 | 0.7504 |
62
+ | 0.3452 | 2.0 | 1648 | 0.4330 | 0.8882 | 0.9245 | 0.8519 | 0.9 | 0.8942 | 0.9570 | 0.7808 |
63
+ | 0.2232 | 3.0 | 2472 | 0.5604 | 0.8652 | 0.9060 | 0.8244 | 0.8775 | 0.8906 | 0.9219 | 0.7314 |
 
 
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:54e740b8d61ce8e6df7825efeb2cf879dbcd8e7ef72ffe51c7db2f7125ecbb04
3
  size 433270768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a5010a7ebc991cb5540f5537704d0246a084a6943d5d2b9654c757674caeefcf
3
  size 433270768
runs/Apr06_15-14-56_tardis/events.out.tfevents.1712409303.tardis.11888.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fce0540a7f103482f087131f840727d35bad7bf8e23e408c60bd9f8c3d508b1f
3
+ size 5519
runs/Apr06_15-16-57_tardis/events.out.tfevents.1712409424.tardis.12350.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6a4b8ecbc1aeeb32e9790feaea59489850ec4139ad151bfcf0d1d0f5398203a6
3
+ size 5519
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e86ef5a2a757cedf3f24587f4954e9d81b3575500e6c2449c0bf2dfb9586b589
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:452dad6e205df155e2d9980ef6985f684ee6775fbdc72a11aee59bfda2acd8aa
3
  size 5048