alex-miller commited on
Commit
3f28658
·
verified ·
1 Parent(s): b815dc4

End of training

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.3893
24
- - Accuracy: 0.8416
25
- - F1: 0.8571
26
- - Precision: 0.9057
27
- - Recall: 0.8136
28
 
29
  ## Model description
30
 
@@ -43,38 +43,28 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 5e-06
47
  - train_batch_size: 64
48
  - eval_batch_size: 64
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 20
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
- | 0.6798 | 1.0 | 7 | 0.6701 | 0.5941 | 0.6435 | 0.6607 | 0.6271 |
59
- | 0.6318 | 2.0 | 14 | 0.6479 | 0.6337 | 0.7299 | 0.6410 | 0.8475 |
60
- | 0.6002 | 3.0 | 21 | 0.6237 | 0.6139 | 0.7068 | 0.6351 | 0.7966 |
61
- | 0.5562 | 4.0 | 28 | 0.5941 | 0.6634 | 0.7167 | 0.7049 | 0.7288 |
62
- | 0.5005 | 5.0 | 35 | 0.5547 | 0.7129 | 0.7434 | 0.7778 | 0.7119 |
63
- | 0.4412 | 6.0 | 42 | 0.5162 | 0.7525 | 0.7788 | 0.8148 | 0.7458 |
64
- | 0.3961 | 7.0 | 49 | 0.4838 | 0.7723 | 0.7965 | 0.8333 | 0.7627 |
65
- | 0.3571 | 8.0 | 56 | 0.4575 | 0.7822 | 0.8036 | 0.8491 | 0.7627 |
66
- | 0.3317 | 9.0 | 63 | 0.4378 | 0.8020 | 0.8214 | 0.8679 | 0.7797 |
67
- | 0.2796 | 10.0 | 70 | 0.4263 | 0.8119 | 0.8319 | 0.8704 | 0.7966 |
68
- | 0.2554 | 11.0 | 77 | 0.4187 | 0.8218 | 0.8421 | 0.8727 | 0.8136 |
69
- | 0.2429 | 12.0 | 84 | 0.4074 | 0.8218 | 0.8421 | 0.8727 | 0.8136 |
70
- | 0.2222 | 13.0 | 91 | 0.3992 | 0.8416 | 0.8571 | 0.9057 | 0.8136 |
71
- | 0.2089 | 14.0 | 98 | 0.3948 | 0.8515 | 0.8649 | 0.9231 | 0.8136 |
72
- | 0.2055 | 15.0 | 105 | 0.3947 | 0.8515 | 0.8649 | 0.9231 | 0.8136 |
73
- | 0.2121 | 16.0 | 112 | 0.3949 | 0.8515 | 0.8673 | 0.9074 | 0.8305 |
74
- | 0.1739 | 17.0 | 119 | 0.3917 | 0.8416 | 0.8571 | 0.9057 | 0.8136 |
75
- | 0.1768 | 18.0 | 126 | 0.3896 | 0.8416 | 0.8571 | 0.9057 | 0.8136 |
76
- | 0.1778 | 19.0 | 133 | 0.3894 | 0.8416 | 0.8571 | 0.9057 | 0.8136 |
77
- | 0.1659 | 20.0 | 140 | 0.3893 | 0.8416 | 0.8571 | 0.9057 | 0.8136 |
78
 
79
 
80
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.2909
24
+ - Accuracy: 0.88
25
+ - F1: 0.8966
26
+ - Precision: 0.9123
27
+ - Recall: 0.8814
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 1e-05
47
  - train_batch_size: 64
48
  - eval_batch_size: 64
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - num_epochs: 10
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
+ | 0.6654 | 1.0 | 7 | 0.6058 | 0.75 | 0.7899 | 0.7833 | 0.7966 |
59
+ | 0.5859 | 2.0 | 14 | 0.5028 | 0.8 | 0.8182 | 0.8824 | 0.7627 |
60
+ | 0.4887 | 3.0 | 21 | 0.4160 | 0.81 | 0.8257 | 0.9 | 0.7627 |
61
+ | 0.3762 | 4.0 | 28 | 0.3439 | 0.86 | 0.8772 | 0.9091 | 0.8475 |
62
+ | 0.3176 | 5.0 | 35 | 0.3046 | 0.88 | 0.8947 | 0.9273 | 0.8644 |
63
+ | 0.2659 | 6.0 | 42 | 0.2937 | 0.88 | 0.8947 | 0.9273 | 0.8644 |
64
+ | 0.2592 | 7.0 | 49 | 0.2940 | 0.87 | 0.8889 | 0.8966 | 0.8814 |
65
+ | 0.213 | 8.0 | 56 | 0.2920 | 0.87 | 0.8889 | 0.8966 | 0.8814 |
66
+ | 0.1946 | 9.0 | 63 | 0.2899 | 0.88 | 0.8947 | 0.9273 | 0.8644 |
67
+ | 0.2042 | 10.0 | 70 | 0.2909 | 0.88 | 0.8966 | 0.9123 | 0.8814 |
 
 
 
 
 
 
 
 
 
 
68
 
69
 
70
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:376998c19423f16cc057dcdd3e0a3a0875418ec15e5ed67a6a7e7d13f3fab166
3
  size 672708608
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:51e4ab5b823d7fa281f9af1439beeafd0961806f6088cfe43cc2697baa488145
3
  size 672708608
runs/Aug05_16-20-44_49d74c1f1623/events.out.tfevents.1722874845.49d74c1f1623.226.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2368234ae760c5609dc92b27d819e23ced30d368e15725bac2345c7056d209a0
3
+ size 12209
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7daf07aae547a8fbf4afca4b0d03a64235ae4937d0c18d184da65cf77639b2cf
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:140e90b51eaa0f352864c1728f09e194c7d52d6f1948007d4a5b613793c3d08c
3
  size 5112