Fariman commited on
Commit
158d18b
·
verified ·
1 Parent(s): 0a2adb2

End of training

Browse files
Files changed (3) hide show
  1. README.md +20 -15
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -18,19 +18,19 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google/electra-small-discriminator](https://huggingface.co/google/electra-small-discriminator) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.4973
22
- - Accuracy: 0.8541
23
- - F1 Macro: 0.3638
24
- - Precision Destination: 0.9241
25
- - Recall Destination: 0.9262
26
- - Precision Origin: 0.4247
27
- - Recall Origin: 0.7045
28
  - Precision Other: 0.0
29
  - Recall Other: 0.0
30
  - Precision Transit: 0.0
31
  - Recall Transit: 0.0
32
- - Super Metric: 1.6307
33
- - Raw Super Metric: 1.6307
34
 
35
  ## Model description
36
 
@@ -55,18 +55,23 @@ The following hyperparameters were used during training:
55
  - seed: 42
56
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
57
  - lr_scheduler_type: linear
58
- - num_epochs: 5
59
  - mixed_precision_training: Native AMP
60
 
61
  ### Training results
62
 
63
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Destination | Recall Destination | Precision Origin | Recall Origin | Precision Other | Recall Other | Precision Transit | Recall Transit | Super Metric | Raw Super Metric |
64
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------------:|:------------------:|:----------------:|:-------------:|:---------------:|:------------:|:-----------------:|:--------------:|:------------:|:----------------:|
65
- | 0.985 | 1.0 | 185 | 0.8616 | 0.8580 | 0.2309 | 0.8580 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.98 | 1.0 |
66
- | 0.7538 | 2.0 | 370 | 0.6105 | 0.8234 | 0.3503 | 0.9292 | 0.8814 | 0.3608 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.6769 | 1.6769 |
67
- | 0.5364 | 3.0 | 555 | 0.5184 | 0.8503 | 0.3683 | 0.9315 | 0.9128 | 0.4217 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7082 | 1.7082 |
68
- | 0.5256 | 4.0 | 740 | 0.4888 | 0.8522 | 0.3661 | 0.9278 | 0.9195 | 0.4231 | 0.75 | 0.0 | 0.0 | 0.0 | 0.0 | 1.6695 | 1.6695 |
69
- | 0.522 | 5.0 | 925 | 0.4973 | 0.8541 | 0.3638 | 0.9241 | 0.9262 | 0.4247 | 0.7045 | 0.0 | 0.0 | 0.0 | 0.0 | 1.6307 | 1.6307 |
 
 
 
 
 
70
 
71
 
72
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google/electra-small-discriminator](https://huggingface.co/google/electra-small-discriminator) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.5391
22
+ - Accuracy: 0.8906
23
+ - F1 Macro: 0.4073
24
+ - Precision Destination: 0.9266
25
+ - Recall Destination: 0.9597
26
+ - Precision Origin: 0.6034
27
+ - Recall Origin: 0.7955
28
  - Precision Other: 0.0
29
  - Recall Other: 0.0
30
  - Precision Transit: 0.0
31
  - Recall Transit: 0.0
32
+ - Super Metric: 1.7552
33
+ - Raw Super Metric: 1.7552
34
 
35
  ## Model description
36
 
 
55
  - seed: 42
56
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
57
  - lr_scheduler_type: linear
58
+ - num_epochs: 10
59
  - mixed_precision_training: Native AMP
60
 
61
  ### Training results
62
 
63
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Destination | Recall Destination | Precision Origin | Recall Origin | Precision Other | Recall Other | Precision Transit | Recall Transit | Super Metric | Raw Super Metric |
64
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------------:|:------------------:|:----------------:|:-------------:|:---------------:|:------------:|:-----------------:|:--------------:|:------------:|:----------------:|
65
+ | 0.9832 | 1.0 | 185 | 0.8369 | 0.8580 | 0.2309 | 0.8580 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.98 | 1.0 |
66
+ | 0.6996 | 2.0 | 370 | 0.5563 | 0.8349 | 0.3584 | 0.9281 | 0.8949 | 0.3889 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.6903 | 1.6903 |
67
+ | 0.5078 | 3.0 | 555 | 0.4766 | 0.8599 | 0.3710 | 0.9305 | 0.9284 | 0.44 | 0.75 | 0.0 | 0.0 | 0.0 | 0.0 | 1.6784 | 1.6784 |
68
+ | 0.4209 | 4.0 | 740 | 0.4103 | 0.8714 | 0.3847 | 0.9332 | 0.9374 | 0.4861 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7328 | 1.7328 |
69
+ | 0.3891 | 5.0 | 925 | 0.4396 | 0.8810 | 0.3957 | 0.9338 | 0.9463 | 0.5294 | 0.8182 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7645 | 1.7645 |
70
+ | 0.2696 | 6.0 | 1110 | 0.5042 | 0.8887 | 0.4026 | 0.9304 | 0.9575 | 0.5738 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7529 | 1.7529 |
71
+ | 0.3568 | 7.0 | 1295 | 0.5009 | 0.8925 | 0.4093 | 0.9267 | 0.9620 | 0.6140 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7574 | 1.7574 |
72
+ | 0.2611 | 8.0 | 1480 | 0.5011 | 0.8772 | 0.3907 | 0.9316 | 0.9441 | 0.5147 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7395 | 1.7395 |
73
+ | 0.2724 | 9.0 | 1665 | 0.5354 | 0.8925 | 0.4093 | 0.9267 | 0.9620 | 0.6140 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7574 | 1.7574 |
74
+ | 0.3317 | 10.0 | 1850 | 0.5391 | 0.8906 | 0.4073 | 0.9266 | 0.9597 | 0.6034 | 0.7955 | 0.0 | 0.0 | 0.0 | 0.0 | 1.7552 | 1.7552 |
75
 
76
 
77
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c5ca3469401921f1c696547f2fdda81ff3d1cbd02cc3d63af29a7f3a2b7d41c3
3
  size 54223256
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f28915cdd3b81d18230475afc2d71fdf6309b5b7570a4420abce5548bb5e71e8
3
  size 54223256
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4a6432b46e82ea5477b496956f7d8aebbf7ed0559259acfc3d31030ae1cfecde
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd8a090393a0d11a1cd36df495ee3929f346ff3e3f55250edb1192de320695df
3
  size 5304