trungpq commited on
Commit
b68ca9a
·
verified ·
1 Parent(s): 1e6f7ad

End of training

Browse files
Files changed (4) hide show
  1. README.md +80 -0
  2. config.json +12 -0
  3. model.safetensors +3 -0
  4. training_args.bin +3 -0
README.md ADDED
@@ -0,0 +1,80 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags:
4
+ - generated_from_trainer
5
+ metrics:
6
+ - accuracy
7
+ model-index:
8
+ - name: rlcc-new-appearance-upsample_replacement-absa-max
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # rlcc-new-appearance-upsample_replacement-absa-max
16
+
17
+ This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 2.6950
20
+ - Accuracy: 0.6340
21
+ - F1 Macro: 0.5522
22
+ - Precision Macro: 0.6107
23
+ - Recall Macro: 0.5627
24
+ - Total Tf: [265, 153, 1101, 153]
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 2e-05
44
+ - train_batch_size: 64
45
+ - eval_batch_size: 64
46
+ - seed: 42
47
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
+ - lr_scheduler_type: linear
49
+ - lr_scheduler_warmup_steps: 44
50
+ - num_epochs: 25
51
+
52
+ ### Training results
53
+
54
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Macro | Recall Macro | Total Tf |
55
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------:|:------------:|:---------------------:|
56
+ | 1.1211 | 1.0 | 45 | 1.1177 | 0.5359 | 0.3656 | 0.3252 | 0.5 | [224, 194, 1060, 194] |
57
+ | 1.1137 | 2.0 | 90 | 1.1100 | 0.5335 | 0.3706 | 0.3610 | 0.4970 | [223, 195, 1059, 195] |
58
+ | 0.9748 | 3.0 | 135 | 1.1131 | 0.6220 | 0.5222 | 0.5541 | 0.5492 | [260, 158, 1096, 158] |
59
+ | 0.7356 | 4.0 | 180 | 1.2100 | 0.6005 | 0.5350 | 0.5590 | 0.5661 | [251, 167, 1087, 167] |
60
+ | 0.6668 | 5.0 | 225 | 1.2673 | 0.6124 | 0.5507 | 0.5629 | 0.5569 | [256, 162, 1092, 162] |
61
+ | 0.4741 | 6.0 | 270 | 1.4287 | 0.6077 | 0.5256 | 0.5559 | 0.5372 | [254, 164, 1090, 164] |
62
+ | 0.43 | 7.0 | 315 | 1.5078 | 0.6172 | 0.5497 | 0.5736 | 0.5523 | [258, 160, 1094, 160] |
63
+ | 0.3213 | 8.0 | 360 | 1.6583 | 0.6364 | 0.5492 | 0.6162 | 0.5612 | [266, 152, 1102, 152] |
64
+ | 0.2496 | 9.0 | 405 | 1.6353 | 0.6364 | 0.5850 | 0.5959 | 0.5862 | [266, 152, 1102, 152] |
65
+ | 0.1908 | 10.0 | 450 | 1.8595 | 0.6364 | 0.5635 | 0.6157 | 0.5688 | [266, 152, 1102, 152] |
66
+ | 0.1383 | 11.0 | 495 | 2.0273 | 0.6292 | 0.5662 | 0.5924 | 0.5734 | [263, 155, 1099, 155] |
67
+ | 0.1125 | 12.0 | 540 | 2.0201 | 0.6555 | 0.6054 | 0.6258 | 0.6020 | [274, 144, 1110, 144] |
68
+ | 0.1046 | 13.0 | 585 | 2.3728 | 0.6411 | 0.5737 | 0.6257 | 0.5854 | [268, 150, 1104, 150] |
69
+ | 0.0897 | 14.0 | 630 | 2.4554 | 0.6459 | 0.5712 | 0.6292 | 0.5861 | [270, 148, 1106, 148] |
70
+ | 0.0518 | 15.0 | 675 | 2.2957 | 0.6531 | 0.5947 | 0.6288 | 0.5921 | [273, 145, 1109, 145] |
71
+ | 0.0587 | 16.0 | 720 | 2.4788 | 0.6411 | 0.5814 | 0.6075 | 0.5801 | [268, 150, 1104, 150] |
72
+ | 0.0445 | 17.0 | 765 | 2.6950 | 0.6340 | 0.5522 | 0.6107 | 0.5627 | [265, 153, 1101, 153] |
73
+
74
+
75
+ ### Framework versions
76
+
77
+ - Transformers 4.52.4
78
+ - Pytorch 2.6.0+cu124
79
+ - Datasets 3.6.0
80
+ - Tokenizers 0.21.2
config.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "absa_method": "max",
3
+ "architectures": [
4
+ "BERTModel"
5
+ ],
6
+ "class_weight": null,
7
+ "class_weights": null,
8
+ "model_type": "bert_with_absa",
9
+ "num_classes": 3,
10
+ "torch_dtype": "float32",
11
+ "transformers_version": "4.52.4"
12
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90b22d2a4bf7c4a0b78f57fa654466b53ef9497b84e5bc5ea8c0c1b3d4dcc1e8
3
+ size 875933728
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4acb6cbc73ded91d62836eb00da7c17e814ac92e40e08aa6f99386afe8f475ed
3
+ size 5368