End of training

Browse files

Files changed (5) hide show

README.md +20 -20
model.safetensors +1 -1
runs/Mar31_16-42-38_dgx10/events.out.tfevents.1743432162.dgx10.924525.2 +3 -0
runs/Mar31_16-42-38_dgx10/events.out.tfevents.1743446781.dgx10.924525.3 +3 -0
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -20,14 +20,14 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ufal/robeczech-base](https://huggingface.co/ufal/robeczech-base) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5511
-- Accuracy: 0.8942
-- Micro Precision: 0.8942
-- Micro Recall: 0.8942
-- Micro F1: 0.8942
-- Macro Precision: 0.8789
-- Macro Recall: 0.8525
-- Macro F1: 0.8597
 ## Model description
@@ -46,14 +46,14 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
-- lr_scheduler_type: reduce_lr_on_plateau
 - lr_scheduler_warmup_steps: 1000
 - num_epochs: 10
@@ -61,16 +61,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step   | Validation Loss | Accuracy | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 |
 |:-------------:|:------:|:------:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|
-| 0.5022        | 1.0000 | 11305  | 0.4532          | 0.8668   | 0.8668          | 0.8668       | 0.8668   | 0.8227          | 0.7921       | 0.7968   |
-| 0.3915        | 2.0    | 22611  | 0.3896          | 0.8853   | 0.8853          | 0.8853       | 0.8853   | 0.8454          | 0.8386       | 0.8359   |
-| 0.2895        | 3.0000 | 33916  | 0.3825          | 0.8929   | 0.8929          | 0.8929       | 0.8929   | 0.8591          | 0.8409       | 0.8448   |
-| 0.2404        | 4.0    | 45222  | 0.4012          | 0.8944   | 0.8944          | 0.8944       | 0.8944   | 0.8545          | 0.8517       | 0.8489   |
-| 0.1901        | 5.0000 | 56527  | 0.4284          | 0.8988   | 0.8988          | 0.8988       | 0.8988   | 0.8640          | 0.8599       | 0.8577   |
-| 0.1586        | 6.0    | 67833  | 0.4548          | 0.8983   | 0.8983          | 0.8983       | 0.8983   | 0.8518          | 0.8607       | 0.8526   |
-| 0.1325        | 7.0000 | 79138  | 0.4821          | 0.9020   | 0.9020          | 0.9020       | 0.9020   | 0.8729          | 0.8619       | 0.8641   |
-| 0.1124        | 8.0    | 90444  | 0.5333          | 0.8976   | 0.8976          | 0.8976       | 0.8976   | 0.8685          | 0.8518       | 0.8556   |
-| 0.0883        | 9.0000 | 101749 | 0.6051          | 0.8977   | 0.8977          | 0.8977       | 0.8977   | 0.8615          | 0.8558       | 0.8551   |
-| 0.0874        | 9.9996 | 113050 | 0.6492          | 0.9006   | 0.9006          | 0.9006       | 0.9006   | 0.8601          | 0.8637       | 0.8581   |
 ### Framework versions

 This model is a fine-tuned version of [ufal/robeczech-base](https://huggingface.co/ufal/robeczech-base) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9004
+- Accuracy: 0.8819
+- Micro Precision: 0.8819
+- Micro Recall: 0.8819
+- Micro F1: 0.8819
+- Macro Precision: 0.8488
+- Macro Recall: 0.8326
+- Macro F1: 0.8379
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1000
 - num_epochs: 10
 | Training Loss | Epoch  | Step   | Validation Loss | Accuracy | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 |
 |:-------------:|:------:|:------:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|
+| 0.5518        | 1.0000 | 11305  | 0.5227          | 0.8496   | 0.8496          | 0.8496       | 0.8496   | 0.8293          | 0.7648       | 0.7779   |
+| 0.4797        | 2.0    | 22611  | 0.4742          | 0.8623   | 0.8623          | 0.8623       | 0.8623   | 0.8191          | 0.8141       | 0.8052   |
+| 0.369         | 3.0000 | 33916  | 0.4886          | 0.8684   | 0.8684          | 0.8684       | 0.8684   | 0.8493          | 0.8094       | 0.8198   |
+| 0.308         | 4.0    | 45222  | 0.4829          | 0.8685   | 0.8685          | 0.8685       | 0.8685   | 0.8347          | 0.8231       | 0.8228   |
+| 0.2395        | 5.0000 | 56527  | 0.4928          | 0.8755   | 0.8755          | 0.8755       | 0.8755   | 0.8300          | 0.8326       | 0.8265   |
+| 0.1852        | 6.0    | 67833  | 0.5186          | 0.8799   | 0.8799          | 0.8799       | 0.8799   | 0.8528          | 0.8385       | 0.8401   |
+| 0.1353        | 7.0000 | 79138  | 0.5951          | 0.8809   | 0.8809          | 0.8809       | 0.8809   | 0.8419          | 0.8419       | 0.8377   |
+| 0.0945        | 8.0    | 90444  | 0.6848          | 0.8847   | 0.8847          | 0.8847       | 0.8847   | 0.8510          | 0.8478       | 0.8438   |
+| 0.0551        | 9.0000 | 101749 | 0.7723          | 0.8867   | 0.8867          | 0.8867       | 0.8867   | 0.8469          | 0.8440       | 0.8405   |
+| 0.0319        | 9.9996 | 113050 | 0.8430          | 0.8882   | 0.8882          | 0.8882       | 0.8882   | 0.8492          | 0.8487       | 0.8448   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ce4b09c7a91e77f1ec8f13d478d5061116a62fd587618a3639544a77d651184c
 size 504532408

 version https://git-lfs.github.com/spec/v1
+oid sha256:8247b40a616cc12f40ffab8dc6e14923b33d50ba56b479a9e183903a576c0ddc
 size 504532408

runs/Mar31_16-42-38_dgx10/events.out.tfevents.1743432162.dgx10.924525.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a85ba3ae830ba8c676057e0c7c93c3fb96b83a7debdb3b3a8c5e5f1b5b985ebf
+size 65367

runs/Mar31_16-42-38_dgx10/events.out.tfevents.1743446781.dgx10.924525.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b4da223ede9b2d4547784384aa3ca787f5f31523fa11f7c2481357dc27762ef7
+size 757

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bd347c115947f325102151d0f0f28f4b3955219062b7a95ebb6e34f82f96a83f
-size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:d586a8ca17d4e6ed9c5439e93cccba6929c21065519d7962a729e2c00e5b7242
+size 5304