Model save

Browse files

Files changed (4) hide show

README.md +19 -17
all_results.json +5 -5
model.safetensors +1 -1
train_results.json +5 -5

README.md CHANGED Viewed

@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6168
-- Accuracy: 0.8304
-- F1: 0.8303
-- Precision: 0.8305
-- Recall: 0.8304
-- Roc Auc: 0.8836
 ## Model description
@@ -44,26 +44,28 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 20
-- eval_batch_size: 20
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 44
-- training_steps: 440
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Accuracy | F1     | Precision | Recall | Roc Auc |
 |:-------------:|:-------:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|:-------:|
-| 0.6732        | 3.0182  | 44   | 0.6748          | 0.7321   | 0.7321 | 0.7324    | 0.7321 | 0.7564  |
-| 0.6421        | 7.0091  | 88   | 0.6404          | 0.8036   | 0.8035 | 0.8040    | 0.8036 | 0.8482  |
-| 0.6014        | 10.0273 | 132  | 0.6064          | 0.8036   | 0.8035 | 0.8040    | 0.8036 | 0.8673  |
-| 0.5425        | 14.0182 | 176  | 0.5740          | 0.8036   | 0.8035 | 0.8040    | 0.8036 | 0.8776  |
-| 0.4931        | 18.0091 | 220  | 0.5276          | 0.7857   | 0.7846 | 0.7917    | 0.7857 | 0.8852  |
-| 0.4091        | 21.0273 | 264  | 0.4804          | 0.8036   | 0.8030 | 0.8071    | 0.8036 | 0.9018  |
-| 0.3446        | 25.0182 | 308  | 0.4592          | 0.8036   | 0.8020 | 0.8136    | 0.8036 | 0.8967  |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4191
+- Accuracy: 0.8929
+- F1: 0.8927
+- Precision: 0.8949
+- Recall: 0.8929
+- Roc Auc: 0.9783
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 23
+- eval_batch_size: 23
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 81
+- training_steps: 810
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Accuracy | F1     | Precision | Recall | Roc Auc |
 |:-------------:|:-------:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|:-------:|
+| 0.6473        | 6.0111  | 81   | 0.6350          | 0.75     | 0.7497 | 0.7513    | 0.75   | 0.8418  |
+| 0.5438        | 13.0074 | 162  | 0.5559          | 0.7857   | 0.7857 | 0.7857    | 0.7857 | 0.8980  |
+| 0.4124        | 20.0037 | 243  | 0.4445          | 0.8571   | 0.8564 | 0.8646    | 0.8571 | 0.9439  |
+| 0.2958        | 26.0148 | 324  | 0.3501          | 0.8929   | 0.8927 | 0.8949    | 0.8929 | 0.9745  |
+| 0.2126        | 33.0111 | 405  | 0.2827          | 0.8929   | 0.8927 | 0.8949    | 0.8929 | 0.9745  |
+| 0.1469        | 40.0074 | 486  | 0.3615          | 0.875    | 0.8746 | 0.8794    | 0.875  | 0.9732  |
+| 0.1063        | 47.0037 | 567  | 0.3208          | 0.8929   | 0.8927 | 0.8949    | 0.8929 | 0.9783  |
+| 0.0883        | 53.0148 | 648  | 0.4270          | 0.875    | 0.8746 | 0.8794    | 0.875  | 0.9745  |
+| 0.0631        | 60.0111 | 729  | 0.4191          | 0.8929   | 0.8927 | 0.8949    | 0.8929 | 0.9783  |
 ### Framework versions

all_results.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-    "epoch": 25.01818181818182,
     "eval_accuracy": 0.8303571428571429,
     "eval_f1": 0.8303436179542373,
     "eval_loss": 0.6168394088745117,
@@ -10,8 +10,8 @@
     "eval_samples_per_second": 1.451,
     "eval_steps_per_second": 0.078,
     "total_flos": 0.0,
-    "train_loss": 0.5427347839652717,
-    "train_runtime": 4082.9807,
-    "train_samples_per_second": 2.155,
-    "train_steps_per_second": 0.108
 }

 {
+    "epoch": 60.01111111111111,
     "eval_accuracy": 0.8303571428571429,
     "eval_f1": 0.8303436179542373,
     "eval_loss": 0.6168394088745117,
     "eval_samples_per_second": 1.451,
     "eval_steps_per_second": 0.078,
     "total_flos": 0.0,
+    "train_loss": 0.2925308818516254,
+    "train_runtime": 9457.9204,
+    "train_samples_per_second": 1.97,
+    "train_steps_per_second": 0.086
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2ae84d1fe48109b3ca34d56c0a0b215aad6905f78dee594dc29670655a155cbf
 size 137270032

 version https://git-lfs.github.com/spec/v1
+oid sha256:23f4937332b094511b38eacee170c24e6f2d018b827f1855cc92f24e5b989bb0
 size 137270032

train_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
-    "epoch": 25.01818181818182,
     "total_flos": 0.0,
-    "train_loss": 0.5427347839652717,
-    "train_runtime": 4082.9807,
-    "train_samples_per_second": 2.155,
-    "train_steps_per_second": 0.108
 }

 {
+    "epoch": 60.01111111111111,
     "total_flos": 0.0,
+    "train_loss": 0.2925308818516254,
+    "train_runtime": 9457.9204,
+    "train_samples_per_second": 1.97,
+    "train_steps_per_second": 0.086
 }