trungpq
/

slac-new-appearance-upsample_replacement

@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3018
-- Accuracy: 0.91
-- F1 Macro: 0.7789
-- Precision Macro: 0.7901
-- Recall Macro: 0.7689
-- Total Tf: [91, 9, 91, 9]
 ## Model description
@@ -46,28 +46,28 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 2
 - num_epochs: 15
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Macro | Recall Macro | Total Tf         |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------:|:------------:|:----------------:|
-| 0.683         | 1.0   | 3    | 0.5989          | 0.87     | 0.5315   | 0.6100          | 0.5303       | [87, 13, 87, 13] |
-| 0.6398        | 2.0   | 6    | 0.5865          | 0.87     | 0.6807   | 0.6879          | 0.6742       | [87, 13, 87, 13] |
-| 0.5988        | 3.0   | 9    | 0.5538          | 0.89     | 0.6801   | 0.7427          | 0.6496       | [89, 11, 89, 11] |
-| 0.5735        | 4.0   | 12   | 0.5253          | 0.89     | 0.6801   | 0.7427          | 0.6496       | [89, 11, 89, 11] |
-| 0.5122        | 5.0   | 15   | 0.4897          | 0.89     | 0.7298   | 0.7390          | 0.7216       | [89, 11, 89, 11] |
-| 0.3797        | 6.0   | 18   | 0.4573          | 0.86     | 0.7255   | 0.6978          | 0.7765       | [86, 14, 86, 14] |
-| 0.3398        | 7.0   | 21   | 0.4187          | 0.88     | 0.7347   | 0.7209          | 0.7519       | [88, 12, 88, 12] |
-| 0.3337        | 8.0   | 24   | 0.3798          | 0.91     | 0.7789   | 0.7901          | 0.7689       | [91, 9, 91, 9]   |
-| 0.2884        | 9.0   | 27   | 0.3550          | 0.91     | 0.7606   | 0.8004          | 0.7330       | [91, 9, 91, 9]   |
-| 0.2525        | 10.0  | 30   | 0.3381          | 0.91     | 0.7606   | 0.8004          | 0.7330       | [91, 9, 91, 9]   |
-| 0.2639        | 11.0  | 33   | 0.3255          | 0.91     | 0.7606   | 0.8004          | 0.7330       | [91, 9, 91, 9]   |
-| 0.2462        | 12.0  | 36   | 0.3144          | 0.91     | 0.7606   | 0.8004          | 0.7330       | [91, 9, 91, 9]   |
-| 0.2171        | 13.0  | 39   | 0.3073          | 0.91     | 0.7789   | 0.7901          | 0.7689       | [91, 9, 91, 9]   |
-| 0.211         | 14.0  | 42   | 0.3035          | 0.91     | 0.7789   | 0.7901          | 0.7689       | [91, 9, 91, 9]   |
-| 0.2252        | 15.0  | 45   | 0.3018          | 0.91     | 0.7789   | 0.7901          | 0.7689       | [91, 9, 91, 9]   |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2258
+- Accuracy: 0.9735
+- F1 Macro: 0.9525
+- Precision Macro: 0.9561
+- Recall Macro: 0.9491
+- Total Tf: [1506, 41, 1506, 41]
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 313
 - num_epochs: 15
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro | Precision Macro | Recall Macro | Total Tf             |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------------:|:------------:|:--------------------:|
+| 0.1063        | 1.0   | 314  | 0.1240          | 0.9619   | 0.9354   | 0.9168          | 0.9573       | [1488, 59, 1488, 59] |
+| 0.0273        | 2.0   | 628  | 0.1173          | 0.9741   | 0.9545   | 0.9504          | 0.9586       | [1507, 40, 1507, 40] |
+| 0.0209        | 3.0   | 942  | 0.1238          | 0.9735   | 0.9528   | 0.9535          | 0.9521       | [1506, 41, 1506, 41] |
+| 0.0153        | 4.0   | 1256 | 0.1414          | 0.9716   | 0.9493   | 0.9507          | 0.9479       | [1503, 44, 1503, 44] |
+| 0.0034        | 5.0   | 1570 | 0.1748          | 0.9741   | 0.9533   | 0.9606          | 0.9465       | [1507, 40, 1507, 40] |
+| 0.0144        | 6.0   | 1884 | 0.1686          | 0.9709   | 0.9482   | 0.9489          | 0.9475       | [1502, 45, 1502, 45] |
+| 0.0033        | 7.0   | 2198 | 0.2072          | 0.9677   | 0.9420   | 0.9462          | 0.9380       | [1497, 50, 1497, 50] |
+| 0.0065        | 8.0   | 2512 | 0.1792          | 0.9741   | 0.9535   | 0.9592          | 0.9480       | [1507, 40, 1507, 40] |
+| 0.0024        | 9.0   | 2826 | 0.1889          | 0.9741   | 0.9540   | 0.9540          | 0.9540       | [1507, 40, 1507, 40] |
+| 0.0031        | 10.0  | 3140 | 0.2047          | 0.9729   | 0.9522   | 0.9482          | 0.9563       | [1505, 42, 1505, 42] |
+| 0.0006        | 11.0  | 3454 | 0.2151          | 0.9735   | 0.9521   | 0.9601          | 0.9445       | [1506, 41, 1506, 41] |
+| 0.0011        | 12.0  | 3768 | 0.2255          | 0.9722   | 0.9504   | 0.9525          | 0.9483       | [1504, 43, 1504, 43] |
+| 0.0011        | 13.0  | 4082 | 0.2239          | 0.9722   | 0.9505   | 0.9512          | 0.9498       | [1504, 43, 1504, 43] |
+| 0.0004        | 14.0  | 4396 | 0.2233          | 0.9735   | 0.9525   | 0.9561          | 0.9491       | [1506, 41, 1506, 41] |
+| 0.0011        | 15.0  | 4710 | 0.2258          | 0.9735   | 0.9525   | 0.9561          | 0.9491       | [1506, 41, 1506, 41] |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:77285879e57db05bbac9fac685cb6183f1801b87d735c2b5e3497be33d240fe2
 size 437955556

 version https://git-lfs.github.com/spec/v1
+oid sha256:f732f5f20335837ac7b72dcf8b58d785ecfcbd8691b91edb060398e282797f3d
 size 437955556

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5e3145ad148b899304801ab09f2d8084de9fd069bf958d25caf640e7bb199884
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:a2123b7828c73941c5ccb6d149825dfa773f42f2778d7824ec7748ffdc27d1a6
 size 5368