End of training

Browse files

Files changed (5) hide show

README.md +24 -27
model.safetensors +1 -1
runs/Jul01_11-28-52_n30/events.out.tfevents.1751362137.n30.3629418.0 +3 -0
runs/Jul01_11-28-52_n30/events.out.tfevents.1751380515.n30.3629418.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -5,16 +5,12 @@ base_model: ufal/robeczech-base
 tags:
 - generated_from_trainer
 datasets:
-- stulcrad/CERED-2
 metrics:
 - accuracy
-- f1
-- recall
 model-index:
 - name: Robeczech-CERED2
   results: []
-language:
-- cs
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -24,14 +20,14 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ufal/robeczech-base](https://huggingface.co/ufal/robeczech-base) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9004
-- Accuracy: 0.8819
-- Micro Precision: 0.8819
-- Micro Recall: 0.8819
-- Micro F1: 0.8819
-- Macro Precision: 0.8488
-- Macro Recall: 0.8326
-- Macro F1: 0.8379
 ## Model description
@@ -50,31 +46,32 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
-- lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1000
 - num_epochs: 10
 ### Training results
 | Training Loss | Epoch  | Step   | Validation Loss | Accuracy | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 |
 |:-------------:|:------:|:------:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|
-| 0.5518        | 1.0000 | 11305  | 0.5227          | 0.8496   | 0.8496          | 0.8496       | 0.8496   | 0.8293          | 0.7648       | 0.7779   |
-| 0.4797        | 2.0    | 22611  | 0.4742          | 0.8623   | 0.8623          | 0.8623       | 0.8623   | 0.8191          | 0.8141       | 0.8052   |
-| 0.369         | 3.0000 | 33916  | 0.4886          | 0.8684   | 0.8684          | 0.8684       | 0.8684   | 0.8493          | 0.8094       | 0.8198   |
-| 0.308         | 4.0    | 45222  | 0.4829          | 0.8685   | 0.8685          | 0.8685       | 0.8685   | 0.8347          | 0.8231       | 0.8228   |
-| 0.2395        | 5.0000 | 56527  | 0.4928          | 0.8755   | 0.8755          | 0.8755       | 0.8755   | 0.8300          | 0.8326       | 0.8265   |
-| 0.1852        | 6.0    | 67833  | 0.5186          | 0.8799   | 0.8799          | 0.8799       | 0.8799   | 0.8528          | 0.8385       | 0.8401   |
-| 0.1353        | 7.0000 | 79138  | 0.5951          | 0.8809   | 0.8809          | 0.8809       | 0.8809   | 0.8419          | 0.8419       | 0.8377   |
-| 0.0945        | 8.0    | 90444  | 0.6848          | 0.8847   | 0.8847          | 0.8847       | 0.8847   | 0.8510          | 0.8478       | 0.8438   |
-| 0.0551        | 9.0000 | 101749 | 0.7723          | 0.8867   | 0.8867          | 0.8867       | 0.8867   | 0.8469          | 0.8440       | 0.8405   |
-| 0.0319        | 9.9996 | 113050 | 0.8430          | 0.8882   | 0.8882          | 0.8882       | 0.8882   | 0.8492          | 0.8487       | 0.8448   |
 ### Framework versions
@@ -82,4 +79,4 @@ The following hyperparameters were used during training:
 - Transformers 4.46.2
 - Pytorch 2.5.1+cu124
 - Datasets 3.1.0
-- Tokenizers 0.20.3

 tags:
 - generated_from_trainer
 datasets:
+- generator
 metrics:
 - accuracy
 model-index:
 - name: Robeczech-CERED2
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [ufal/robeczech-base](https://huggingface.co/ufal/robeczech-base) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1300
+- Accuracy: 0.8985
+- Micro Precision: 0.8985
+- Micro Recall: 0.8985
+- Micro F1: 0.8985
+- Macro Precision: 0.8711
+- Macro Recall: 0.8608
+- Macro F1: 0.8632
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 1500
 - num_epochs: 10
+- label_smoothing_factor: 0.1
 ### Training results
 | Training Loss | Epoch  | Step   | Validation Loss | Accuracy | Micro Precision | Micro Recall | Micro F1 | Macro Precision | Macro Recall | Macro F1 |
 |:-------------:|:------:|:------:|:---------------:|:--------:|:---------------:|:------------:|:--------:|:---------------:|:------------:|:--------:|
+| 1.1585        | 1.0000 | 11305  | 1.1208          | 0.8608   | 0.8608          | 0.8608       | 0.8608   | 0.8155          | 0.7878       | 0.7914   |
+| 1.0617        | 2.0    | 22611  | 1.0567          | 0.8873   | 0.8873          | 0.8873       | 0.8873   | 0.8547          | 0.8428       | 0.8430   |
+| 0.9804        | 3.0000 | 33916  | 1.0558          | 0.8900   | 0.8900          | 0.8900       | 0.8900   | 0.8546          | 0.8414       | 0.8438   |
+| 0.9327        | 4.0    | 45222  | 1.0585          | 0.8920   | 0.8920          | 0.8920       | 0.8920   | 0.8557          | 0.8475       | 0.8483   |
+| 0.8927        | 5.0000 | 56527  | 1.0820          | 0.8917   | 0.8917          | 0.8917       | 0.8917   | 0.8484          | 0.8499       | 0.8455   |
+| 0.861         | 6.0    | 67833  | 1.0774          | 0.8982   | 0.8982          | 0.8982       | 0.8982   | 0.8596          | 0.8567       | 0.8545   |
+| 0.8344        | 7.0000 | 79138  | 1.0987          | 0.8979   | 0.8979          | 0.8979       | 0.8979   | 0.8641          | 0.8558       | 0.8567   |
+| 0.8222        | 8.0    | 90444  | 1.1113          | 0.8991   | 0.8991          | 0.8991       | 0.8991   | 0.8639          | 0.8544       | 0.8558   |
+| 0.8096        | 9.0000 | 101749 | 1.1159          | 0.9001   | 0.9001          | 0.9001       | 0.9001   | 0.8584          | 0.8589       | 0.8552   |
+| 0.8071        | 9.9996 | 113050 | 1.1176          | 0.8994   | 0.8994          | 0.8994       | 0.8994   | 0.8561          | 0.8577       | 0.8539   |
 ### Framework versions
 - Transformers 4.46.2
 - Pytorch 2.5.1+cu124
 - Datasets 3.1.0
+- Tokenizers 0.20.3

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8247b40a616cc12f40ffab8dc6e14923b33d50ba56b479a9e183903a576c0ddc
 size 504532408

 version https://git-lfs.github.com/spec/v1
+oid sha256:d3e0bdb9de9c80cb0df8773197590fd1aee7bb8ba1464e4649b248b791a39f34
 size 504532408

runs/Jul01_11-28-52_n30/events.out.tfevents.1751362137.n30.3629418.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:387b61d1e8fa2b3b280f4061f0ac2fb6c7bcf5c61fcb73076db5cef623e07fd7
+size 65364

runs/Jul01_11-28-52_n30/events.out.tfevents.1751380515.n30.3629418.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:17eab353e581726fa9bf9f36cfb0432c2f6e29436d64ec6b64fdf38c965fb811
+size 757

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d586a8ca17d4e6ed9c5439e93cccba6929c21065519d7962a729e2c00e5b7242
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:05a75e87d1a13becf36da38f50338924ea1a55b1e73114f468d8f01f0d4c7de1
 size 5304