alex-miller commited on
Commit
4b4179d
·
verified ·
1 Parent(s): 3f28658

End of training

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.2909
24
- - Accuracy: 0.88
25
- - F1: 0.8966
26
- - Precision: 0.9123
27
- - Recall: 0.8814
28
 
29
  ## Model description
30
 
@@ -43,7 +43,7 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 1e-05
47
  - train_batch_size: 64
48
  - eval_batch_size: 64
49
  - seed: 42
@@ -55,16 +55,16 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
- | 0.6654 | 1.0 | 7 | 0.6058 | 0.75 | 0.7899 | 0.7833 | 0.7966 |
59
- | 0.5859 | 2.0 | 14 | 0.5028 | 0.8 | 0.8182 | 0.8824 | 0.7627 |
60
- | 0.4887 | 3.0 | 21 | 0.4160 | 0.81 | 0.8257 | 0.9 | 0.7627 |
61
- | 0.3762 | 4.0 | 28 | 0.3439 | 0.86 | 0.8772 | 0.9091 | 0.8475 |
62
- | 0.3176 | 5.0 | 35 | 0.3046 | 0.88 | 0.8947 | 0.9273 | 0.8644 |
63
- | 0.2659 | 6.0 | 42 | 0.2937 | 0.88 | 0.8947 | 0.9273 | 0.8644 |
64
- | 0.2592 | 7.0 | 49 | 0.2940 | 0.87 | 0.8889 | 0.8966 | 0.8814 |
65
- | 0.213 | 8.0 | 56 | 0.2920 | 0.87 | 0.8889 | 0.8966 | 0.8814 |
66
- | 0.1946 | 9.0 | 63 | 0.2899 | 0.88 | 0.8947 | 0.9273 | 0.8644 |
67
- | 0.2042 | 10.0 | 70 | 0.2909 | 0.88 | 0.8966 | 0.9123 | 0.8814 |
68
 
69
 
70
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [alex-miller/ODABert](https://huggingface.co/alex-miller/ODABert) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.4448
24
+ - Accuracy: 0.91
25
+ - F1: 0.9217
26
+ - Precision: 0.9464
27
+ - Recall: 0.8983
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 0.0001
47
  - train_batch_size: 64
48
  - eval_batch_size: 64
49
  - seed: 42
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
+ | 0.5967 | 1.0 | 7 | 0.3370 | 0.86 | 0.8727 | 0.9412 | 0.8136 |
59
+ | 0.3255 | 2.0 | 14 | 0.3018 | 0.88 | 0.9016 | 0.8730 | 0.9322 |
60
+ | 0.2274 | 3.0 | 21 | 0.3502 | 0.89 | 0.9076 | 0.9 | 0.9153 |
61
+ | 0.0835 | 4.0 | 28 | 0.4278 | 0.88 | 0.8889 | 0.9796 | 0.8136 |
62
+ | 0.0568 | 5.0 | 35 | 0.7164 | 0.82 | 0.8571 | 0.8060 | 0.9153 |
63
+ | 0.0979 | 6.0 | 42 | 0.3929 | 0.88 | 0.8909 | 0.9608 | 0.8305 |
64
+ | 0.0374 | 7.0 | 49 | 0.4090 | 0.9 | 0.9153 | 0.9153 | 0.9153 |
65
+ | 0.0208 | 8.0 | 56 | 0.5139 | 0.9 | 0.9153 | 0.9153 | 0.9153 |
66
+ | 0.0216 | 9.0 | 63 | 0.4479 | 0.91 | 0.9217 | 0.9464 | 0.8983 |
67
+ | 0.0114 | 10.0 | 70 | 0.4448 | 0.91 | 0.9217 | 0.9464 | 0.8983 |
68
 
69
 
70
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:51e4ab5b823d7fa281f9af1439beeafd0961806f6088cfe43cc2697baa488145
3
  size 672708608
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be9b616d87c408a87b5bb0e843549408941719f6a9380e1fb22c6e902405b5a1
3
  size 672708608
runs/Aug05_16-28-21_49d74c1f1623/events.out.tfevents.1722875302.49d74c1f1623.226.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0871197323ea7f3faad65428b0fbf46e3378fca5096d08b6641ccae76b0eb5f8
3
+ size 12210
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:140e90b51eaa0f352864c1728f09e194c7d52d6f1948007d4a5b613793c3d08c
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ca9c7dab4a2f32f99722937ea39c9998ae24ba2a0f6ab46789a6a265e4d339e3
3
  size 5112