End of training
Browse files- README.md +23 -23
- config.json +4 -4
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -14,13 +14,13 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 14 |
|
| 15 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 16 |
It achieves the following results on the evaluation set:
|
| 17 |
-
- Loss:
|
| 18 |
-
- F1 Macro: 0.
|
| 19 |
-
- Precision Macro: 0.
|
| 20 |
-
- Recall Macro: 0.
|
| 21 |
-
- F1 Micro: 0.
|
| 22 |
-
- Precision Micro: 0.
|
| 23 |
-
- Recall Micro: 0.
|
| 24 |
|
| 25 |
## Model description
|
| 26 |
|
|
@@ -45,28 +45,28 @@ The following hyperparameters were used during training:
|
|
| 45 |
- seed: 42
|
| 46 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 47 |
- lr_scheduler_type: linear
|
| 48 |
-
- lr_scheduler_warmup_steps:
|
| 49 |
- num_epochs: 15
|
| 50 |
|
| 51 |
### Training results
|
| 52 |
|
| 53 |
| Training Loss | Epoch | Step | Validation Loss | F1 Macro | Precision Macro | Recall Macro | F1 Micro | Precision Micro | Recall Micro |
|
| 54 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|
|
| 55 |
-
|
|
| 56 |
-
|
|
| 57 |
-
|
|
| 58 |
-
|
|
| 59 |
-
|
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
|
|
| 63 |
-
|
|
| 64 |
-
| 0.
|
| 65 |
-
|
|
| 66 |
-
|
|
| 67 |
-
| 0.
|
| 68 |
-
|
|
| 69 |
-
|
|
| 70 |
|
| 71 |
|
| 72 |
### Framework versions
|
|
|
|
| 14 |
|
| 15 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 16 |
It achieves the following results on the evaluation set:
|
| 17 |
+
- Loss: 0.5482
|
| 18 |
+
- F1 Macro: 0.8380
|
| 19 |
+
- Precision Macro: 0.8096
|
| 20 |
+
- Recall Macro: 0.8688
|
| 21 |
+
- F1 Micro: 0.8552
|
| 22 |
+
- Precision Micro: 0.8252
|
| 23 |
+
- Recall Micro: 0.8874
|
| 24 |
|
| 25 |
## Model description
|
| 26 |
|
|
|
|
| 45 |
- seed: 42
|
| 46 |
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 47 |
- lr_scheduler_type: linear
|
| 48 |
+
- lr_scheduler_warmup_steps: 212
|
| 49 |
- num_epochs: 15
|
| 50 |
|
| 51 |
### Training results
|
| 52 |
|
| 53 |
| Training Loss | Epoch | Step | Validation Loss | F1 Macro | Precision Macro | Recall Macro | F1 Micro | Precision Micro | Recall Micro |
|
| 54 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|
|
| 55 |
+
| 0.6444 | 1.0 | 213 | 0.5045 | 0.6683 | 0.5514 | 0.9416 | 0.6611 | 0.5087 | 0.9437 |
|
| 56 |
+
| 0.3875 | 2.0 | 426 | 0.3121 | 0.8016 | 0.7045 | 0.9342 | 0.8214 | 0.7272 | 0.9437 |
|
| 57 |
+
| 0.292 | 3.0 | 639 | 0.3003 | 0.8095 | 0.7256 | 0.9294 | 0.8265 | 0.7398 | 0.9361 |
|
| 58 |
+
| 0.2172 | 4.0 | 852 | 0.3231 | 0.8340 | 0.7807 | 0.8982 | 0.8509 | 0.7973 | 0.9122 |
|
| 59 |
+
| 0.1935 | 5.0 | 1065 | 0.3262 | 0.8262 | 0.7628 | 0.9073 | 0.8445 | 0.7826 | 0.9170 |
|
| 60 |
+
| 0.154 | 6.0 | 1278 | 0.3807 | 0.8351 | 0.7975 | 0.8794 | 0.8506 | 0.8183 | 0.8855 |
|
| 61 |
+
| 0.1007 | 7.0 | 1491 | 0.4045 | 0.8297 | 0.7774 | 0.8922 | 0.8456 | 0.7902 | 0.9094 |
|
| 62 |
+
| 0.0866 | 8.0 | 1704 | 0.4100 | 0.8289 | 0.7706 | 0.9010 | 0.8434 | 0.7863 | 0.9094 |
|
| 63 |
+
| 0.0671 | 9.0 | 1917 | 0.4667 | 0.8335 | 0.7981 | 0.8726 | 0.8497 | 0.8127 | 0.8903 |
|
| 64 |
+
| 0.0544 | 10.0 | 2130 | 0.5062 | 0.8412 | 0.8139 | 0.8707 | 0.8557 | 0.8254 | 0.8884 |
|
| 65 |
+
| 0.0482 | 11.0 | 2343 | 0.5141 | 0.8335 | 0.8076 | 0.8616 | 0.8521 | 0.8287 | 0.8769 |
|
| 66 |
+
| 0.0377 | 12.0 | 2556 | 0.5217 | 0.8346 | 0.8022 | 0.8699 | 0.8520 | 0.8194 | 0.8874 |
|
| 67 |
+
| 0.0304 | 13.0 | 2769 | 0.5419 | 0.8370 | 0.8104 | 0.8658 | 0.8537 | 0.8266 | 0.8826 |
|
| 68 |
+
| 0.0307 | 14.0 | 2982 | 0.5397 | 0.8367 | 0.8043 | 0.8721 | 0.8533 | 0.8210 | 0.8884 |
|
| 69 |
+
| 0.0238 | 15.0 | 3195 | 0.5482 | 0.8380 | 0.8096 | 0.8688 | 0.8552 | 0.8252 | 0.8874 |
|
| 70 |
|
| 71 |
|
| 72 |
### Framework versions
|
config.json
CHANGED
|
@@ -5,10 +5,10 @@
|
|
| 5 |
"model_type": "bert_model",
|
| 6 |
"num_classes": 4,
|
| 7 |
"pos_weight": [
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
|
| 11 |
-
|
| 12 |
],
|
| 13 |
"torch_dtype": "float32",
|
| 14 |
"transformers_version": "4.47.0"
|
|
|
|
| 5 |
"model_type": "bert_model",
|
| 6 |
"num_classes": 4,
|
| 7 |
"pos_weight": [
|
| 8 |
+
6.803561171740379,
|
| 9 |
+
7.997350993377483,
|
| 10 |
+
2.8530913216108904,
|
| 11 |
+
5.13086642599278
|
| 12 |
],
|
| 13 |
"torch_dtype": "float32",
|
| 14 |
"transformers_version": "4.47.0"
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 437964888
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a0a8d66f9bde3494d37e915b2988a90446b69b7fe4d49d6f8d7c62a6dff08272
|
| 3 |
size 437964888
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5368
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:da5fdee7b06418ccb0926044f31cc7ddb86ef91a16318e6916dc80154b687146
|
| 3 |
size 5368
|