Fariman
/

google_electra-small-discriminator

@@ -18,19 +18,19 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/electra-small-discriminator](https://huggingface.co/google/electra-small-discriminator) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4973
-- Accuracy: 0.8541
-- F1 Macro: 0.3638
-- Precision Destination: 0.9241
-- Recall Destination: 0.9262
-- Precision Origin: 0.4247
-- Recall Origin: 0.7045
 - Precision Other: 0.0
 - Recall Other: 0.0
 - Precision Transit: 0.0
 - Recall Transit: 0.0
-- Super Metric: 1.6307
-- Raw Super Metric: 1.6307
 ## Model description
@@ -55,18 +55,23 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Destination | Recall Destination | Precision Origin | Recall Origin | Precision Other | Recall Other | Precision Transit | Recall Transit | Super Metric | Raw Super Metric |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------------:|:------------------:|:----------------:|:-------------:|:---------------:|:------------:|:-----------------:|:--------------:|:------------:|:----------------:|
-| 0.985         | 1.0   | 185  | 0.8616          | 0.8580   | 0.2309   | 0.8580                | 1.0                | 0.0              | 0.0           | 0.0             | 0.0          | 0.0               | 0.0            | 0.98         | 1.0              |
-| 0.7538        | 2.0   | 370  | 0.6105          | 0.8234   | 0.3503   | 0.9292                | 0.8814             | 0.3608           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.6769       | 1.6769           |
-| 0.5364        | 3.0   | 555  | 0.5184          | 0.8503   | 0.3683   | 0.9315                | 0.9128             | 0.4217           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7082       | 1.7082           |
-| 0.5256        | 4.0   | 740  | 0.4888          | 0.8522   | 0.3661   | 0.9278                | 0.9195             | 0.4231           | 0.75          | 0.0             | 0.0          | 0.0               | 0.0            | 1.6695       | 1.6695           |
-| 0.522         | 5.0   | 925  | 0.4973          | 0.8541   | 0.3638   | 0.9241                | 0.9262             | 0.4247           | 0.7045        | 0.0             | 0.0          | 0.0               | 0.0            | 1.6307       | 1.6307           |
 ### Framework versions

 This model is a fine-tuned version of [google/electra-small-discriminator](https://huggingface.co/google/electra-small-discriminator) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5391
+- Accuracy: 0.8906
+- F1 Macro: 0.4073
+- Precision Destination: 0.9266
+- Recall Destination: 0.9597
+- Precision Origin: 0.6034
+- Recall Origin: 0.7955
 - Precision Other: 0.0
 - Recall Other: 0.0
 - Precision Transit: 0.0
 - Recall Transit: 0.0
+- Super Metric: 1.7552
+- Raw Super Metric: 1.7552
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Destination | Recall Destination | Precision Origin | Recall Origin | Precision Other | Recall Other | Precision Transit | Recall Transit | Super Metric | Raw Super Metric |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------------:|:------------------:|:----------------:|:-------------:|:---------------:|:------------:|:-----------------:|:--------------:|:------------:|:----------------:|
+| 0.9832        | 1.0   | 185  | 0.8369          | 0.8580   | 0.2309   | 0.8580                | 1.0                | 0.0              | 0.0           | 0.0             | 0.0          | 0.0               | 0.0            | 0.98         | 1.0              |
+| 0.6996        | 2.0   | 370  | 0.5563          | 0.8349   | 0.3584   | 0.9281                | 0.8949             | 0.3889           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.6903       | 1.6903           |
+| 0.5078        | 3.0   | 555  | 0.4766          | 0.8599   | 0.3710   | 0.9305                | 0.9284             | 0.44             | 0.75          | 0.0             | 0.0          | 0.0               | 0.0            | 1.6784       | 1.6784           |
+| 0.4209        | 4.0   | 740  | 0.4103          | 0.8714   | 0.3847   | 0.9332                | 0.9374             | 0.4861           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7328       | 1.7328           |
+| 0.3891        | 5.0   | 925  | 0.4396          | 0.8810   | 0.3957   | 0.9338                | 0.9463             | 0.5294           | 0.8182        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7645       | 1.7645           |
+| 0.2696        | 6.0   | 1110 | 0.5042          | 0.8887   | 0.4026   | 0.9304                | 0.9575             | 0.5738           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7529       | 1.7529           |
+| 0.3568        | 7.0   | 1295 | 0.5009          | 0.8925   | 0.4093   | 0.9267                | 0.9620             | 0.6140           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7574       | 1.7574           |
+| 0.2611        | 8.0   | 1480 | 0.5011          | 0.8772   | 0.3907   | 0.9316                | 0.9441             | 0.5147           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7395       | 1.7395           |
+| 0.2724        | 9.0   | 1665 | 0.5354          | 0.8925   | 0.4093   | 0.9267                | 0.9620             | 0.6140           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7574       | 1.7574           |
+| 0.3317        | 10.0  | 1850 | 0.5391          | 0.8906   | 0.4073   | 0.9266                | 0.9597             | 0.6034           | 0.7955        | 0.0             | 0.0          | 0.0               | 0.0            | 1.7552       | 1.7552           |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c5ca3469401921f1c696547f2fdda81ff3d1cbd02cc3d63af29a7f3a2b7d41c3
 size 54223256

 version https://git-lfs.github.com/spec/v1
+oid sha256:f28915cdd3b81d18230475afc2d71fdf6309b5b7570a4420abce5548bb5e71e8
 size 54223256

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4a6432b46e82ea5477b496956f7d8aebbf7ed0559259acfc3d31030ae1cfecde
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:cd8a090393a0d11a1cd36df495ee3929f346ff3e3f55250edb1192de320695df
 size 5304