End of training

Browse files

Files changed (4) hide show

README.md +23 -23
config.json +4 -4
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -14,13 +14,13 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.5583
-- F1 Macro: 0.2390
-- Precision Macro: 0.1576
-- Recall Macro: 0.5983
-- F1 Micro: 0.2661
-- Precision Micro: 0.1710
-- Recall Micro: 0.6
 ## Model description
@@ -45,28 +45,28 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1
 - num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1 Macro | Precision Macro | Recall Macro | F1 Micro | Precision Micro | Recall Micro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|
-| 1.4025        | 1.0   | 2    | 1.6025          | 0.1875   | 0.1116          | 0.7115       | 0.2386   | 0.1414          | 0.7636       |
-| 1.082         | 2.0   | 4    | 1.5860          | 0.1866   | 0.1109          | 0.7204       | 0.2353   | 0.1391          | 0.7636       |
-| 1.1491        | 3.0   | 6    | 1.5860          | 0.2050   | 0.1298          | 0.7134       | 0.2363   | 0.1404          | 0.7455       |
-| 1.134         | 4.0   | 8    | 1.5785          | 0.2097   | 0.1311          | 0.7134       | 0.2405   | 0.1434          | 0.7455       |
-| 1.1738        | 5.0   | 10   | 1.5680          | 0.2450   | 0.1561          | 0.7589       | 0.2567   | 0.1536          | 0.7818       |
-| 1.2036        | 6.0   | 12   | 1.5670          | 0.2407   | 0.1538          | 0.7257       | 0.2611   | 0.1583          | 0.7455       |
-| 1.0812        | 7.0   | 14   | 1.5687          | 0.2399   | 0.1537          | 0.7065       | 0.2614   | 0.1594          | 0.7273       |
-| 1.2164        | 8.0   | 16   | 1.5680          | 0.2403   | 0.1551          | 0.6769       | 0.2648   | 0.1638          | 0.6909       |
-| 1.0811        | 9.0   | 18   | 1.5678          | 0.2085   | 0.1359          | 0.5807       | 0.2357   | 0.1467          | 0.6          |
-| 0.999         | 10.0  | 20   | 1.5698          | 0.2155   | 0.1397          | 0.5895       | 0.2444   | 0.1535          | 0.6          |
-| 1.071         | 11.0  | 22   | 1.5737          | 0.2212   | 0.1450          | 0.5807       | 0.2578   | 0.1642          | 0.6          |
-| 1.2738        | 12.0  | 24   | 1.5721          | 0.2291   | 0.1501          | 0.5911       | 0.2688   | 0.1717          | 0.6182       |
-| 0.9731        | 13.0  | 26   | 1.5666          | 0.2339   | 0.1541          | 0.5895       | 0.2672   | 0.1719          | 0.6          |
-| 1.072         | 14.0  | 28   | 1.5598          | 0.2405   | 0.1591          | 0.5983       | 0.2672   | 0.1719          | 0.6          |
-| 1.0495        | 15.0  | 30   | 1.5583          | 0.2390   | 0.1576          | 0.5983       | 0.2661   | 0.1710          | 0.6          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5482
+- F1 Macro: 0.8380
+- Precision Macro: 0.8096
+- Recall Macro: 0.8688
+- F1 Micro: 0.8552
+- Precision Micro: 0.8252
+- Recall Micro: 0.8874
 ## Model description
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 212
 - num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1 Macro | Precision Macro | Recall Macro | F1 Micro | Precision Micro | Recall Micro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|
+| 0.6444        | 1.0   | 213  | 0.5045          | 0.6683   | 0.5514          | 0.9416       | 0.6611   | 0.5087          | 0.9437       |
+| 0.3875        | 2.0   | 426  | 0.3121          | 0.8016   | 0.7045          | 0.9342       | 0.8214   | 0.7272          | 0.9437       |
+| 0.292         | 3.0   | 639  | 0.3003          | 0.8095   | 0.7256          | 0.9294       | 0.8265   | 0.7398          | 0.9361       |
+| 0.2172        | 4.0   | 852  | 0.3231          | 0.8340   | 0.7807          | 0.8982       | 0.8509   | 0.7973          | 0.9122       |
+| 0.1935        | 5.0   | 1065 | 0.3262          | 0.8262   | 0.7628          | 0.9073       | 0.8445   | 0.7826          | 0.9170       |
+| 0.154         | 6.0   | 1278 | 0.3807          | 0.8351   | 0.7975          | 0.8794       | 0.8506   | 0.8183          | 0.8855       |
+| 0.1007        | 7.0   | 1491 | 0.4045          | 0.8297   | 0.7774          | 0.8922       | 0.8456   | 0.7902          | 0.9094       |
+| 0.0866        | 8.0   | 1704 | 0.4100          | 0.8289   | 0.7706          | 0.9010       | 0.8434   | 0.7863          | 0.9094       |
+| 0.0671        | 9.0   | 1917 | 0.4667          | 0.8335   | 0.7981          | 0.8726       | 0.8497   | 0.8127          | 0.8903       |
+| 0.0544        | 10.0  | 2130 | 0.5062          | 0.8412   | 0.8139          | 0.8707       | 0.8557   | 0.8254          | 0.8884       |
+| 0.0482        | 11.0  | 2343 | 0.5141          | 0.8335   | 0.8076          | 0.8616       | 0.8521   | 0.8287          | 0.8769       |
+| 0.0377        | 12.0  | 2556 | 0.5217          | 0.8346   | 0.8022          | 0.8699       | 0.8520   | 0.8194          | 0.8874       |
+| 0.0304        | 13.0  | 2769 | 0.5419          | 0.8370   | 0.8104          | 0.8658       | 0.8537   | 0.8266          | 0.8826       |
+| 0.0307        | 14.0  | 2982 | 0.5397          | 0.8367   | 0.8043          | 0.8721       | 0.8533   | 0.8210          | 0.8884       |
+| 0.0238        | 15.0  | 3195 | 0.5482          | 0.8380   | 0.8096          | 0.8688       | 0.8552   | 0.8252          | 0.8874       |
 ### Framework versions

config.json CHANGED Viewed

@@ -5,10 +5,10 @@
   "model_type": "bert_model",
   "num_classes": 4,
   "pos_weight": [
-    24.0,
-    15.666666666666666,
-    4.555555555555555,
-    6.142857142857143
   ],
   "torch_dtype": "float32",
   "transformers_version": "4.47.0"

   "model_type": "bert_model",
   "num_classes": 4,
   "pos_weight": [
+    6.803561171740379,
+    7.997350993377483,
+    2.8530913216108904,
+    5.13086642599278
   ],
   "torch_dtype": "float32",
   "transformers_version": "4.47.0"

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:191121d51980d0d8c5013bc3635cdc65881e296ec8b5234d24986f51e87c2267
 size 437964888

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0a8d66f9bde3494d37e915b2988a90446b69b7fe4d49d6f8d7c62a6dff08272
 size 437964888

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4014918409db83589f1bbd62c7b44833686b2ff8b3bad5a08246ddc7f29f031a
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:da5fdee7b06418ccb0926044f31cc7ddb86ef91a16318e6916dc80154b687146
 size 5368