End of training

Files changed (9) hide show

README.md CHANGED Viewed

@@ -20,12 +20,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the None dataset.
 It achieves the following results on the evaluation set:
-- Bleu: 0.8249
-- F1: 0.9232
-- Wer: 0.0832
-- Cer: 0.0268
 - Meteor: 0.9148
-- Loss: 6.1040
 ## Model description
@@ -57,9 +57,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Bleu   | F1     | Wer    | Cer    | Meteor | Validation Loss |
 |:-------------:|:-----:|:-----:|:------:|:------:|:------:|:------:|:------:|:---------------:|
-| 6.1256        | 1.0   | 12500 | 0.7995 | 0.9118 | 0.0952 | 0.0304 | 0.9020 | 6.1150          |
-| 6.1178        | 2.0   | 25000 | 0.8172 | 0.9199 | 0.0870 | 0.0282 | 0.9109 | 6.1073          |
-| 6.1012        | 3.0   | 37500 | 0.8249 | 0.9232 | 0.0832 | 0.0268 | 0.9148 | 6.1040          |
 ### Framework versions

 This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the None dataset.
 It achieves the following results on the evaluation set:
+- Bleu: 0.8239
+- F1: 0.9229
+- Wer: 0.0824
+- Cer: 0.0262
 - Meteor: 0.9148
+- Loss: 6.1042
 ## Model description
 | Training Loss | Epoch | Step  | Bleu   | F1     | Wer    | Cer    | Meteor | Validation Loss |
 |:-------------:|:-----:|:-----:|:------:|:------:|:------:|:------:|:------:|:---------------:|
+| 6.1256        | 1.0   | 12500 | 0.7992 | 0.9121 | 0.0950 | 0.0308 | 0.9022 | 6.1147          |
+| 6.1187        | 2.0   | 25000 | 0.8172 | 0.9198 | 0.0868 | 0.0281 | 0.9112 | 6.1067          |
+| 6.0999        | 3.0   | 37500 | 0.8239 | 0.9229 | 0.0824 | 0.0262 | 0.9148 | 6.1042          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:df925e911da24f615a4d68b215c318fa2965e08757af878cf2d246e3c44f46fc
 size 9457520

 version https://git-lfs.github.com/spec/v1
+oid sha256:623178331108dd55759c25e0b0c82afe9518f980c8d9591c61f9af36582850cc
 size 9457520

all_results.json CHANGED Viewed

@@ -1,17 +1,17 @@
 {
     "epoch": 3.0,
-    "eval_bleu": 0.824917705791878,
-    "eval_cer": 0.02681490368523881,
-    "eval_f1": 0.923196495238835,
-    "eval_loss": 6.103950500488281,
-    "eval_meteor": 0.9148204705294067,
-    "eval_runtime": 588.9014,
-    "eval_samples_per_second": 8.49,
-    "eval_steps_per_second": 0.267,
-    "eval_wer": 0.08322672667667196,
     "total_flos": 6.544636730199245e+17,
-    "train_loss": 6.1357289762369795,
-    "train_runtime": 6359.0647,
-    "train_samples_per_second": 188.702,
-    "train_steps_per_second": 5.897
 }

 {
     "epoch": 3.0,
+    "eval_bleu": 0.823943051750742,
+    "eval_cer": 0.02620693733959594,
+    "eval_f1": 0.9229451426010926,
+    "eval_loss": 6.104158401489258,
+    "eval_meteor": 0.914842353945872,
+    "eval_runtime": 585.9186,
+    "eval_samples_per_second": 8.534,
+    "eval_steps_per_second": 0.268,
+    "eval_wer": 0.08241131435383867,
     "total_flos": 6.544636730199245e+17,
+    "train_loss": 6.135559250488281,
+    "train_runtime": 6379.9802,
+    "train_samples_per_second": 188.084,
+    "train_steps_per_second": 5.878
 }

eval_final_results.json CHANGED Viewed

@@ -1,12 +1,12 @@
 {
     "epoch": 3.0,
-    "eval_bleu": 0.824917705791878,
-    "eval_cer": 0.02681490368523881,
-    "eval_f1": 0.923196495238835,
-    "eval_loss": 6.103950500488281,
-    "eval_meteor": 0.9148204705294067,
-    "eval_runtime": 588.9014,
-    "eval_samples_per_second": 8.49,
-    "eval_steps_per_second": 0.267,
-    "eval_wer": 0.08322672667667196
 }

 {
     "epoch": 3.0,
+    "eval_bleu": 0.823943051750742,
+    "eval_cer": 0.02620693733959594,
+    "eval_f1": 0.9229451426010926,
+    "eval_loss": 6.104158401489258,
+    "eval_meteor": 0.914842353945872,
+    "eval_runtime": 585.9186,
+    "eval_samples_per_second": 8.534,
+    "eval_steps_per_second": 0.268,
+    "eval_wer": 0.08241131435383867
 }

logs/events.out.tfevents.1748594726.c1279aa5eb8f.2392797.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:8dfdd73098d373d0a0a80419a87b84bd5fede6bdbe4d461c073d47b77297d5ab
+size 169393

logs/events.out.tfevents.1748601691.c1279aa5eb8f.2392797.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:73a5035d517561a4f79c0b6359c933fc48f844d074ceb09feb278c85c55a0ac5
+size 607

train_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
     "epoch": 3.0,
     "total_flos": 6.544636730199245e+17,
-    "train_loss": 6.1357289762369795,
-    "train_runtime": 6359.0647,
-    "train_samples_per_second": 188.702,
-    "train_steps_per_second": 5.897
 }

 {
     "epoch": 3.0,
     "total_flos": 6.544636730199245e+17,
+    "train_loss": 6.135559250488281,
+    "train_runtime": 6379.9802,
+    "train_samples_per_second": 188.084,
+    "train_steps_per_second": 5.878
 }

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6c4657d63dc5ae7140d37135453e97b70890bf594b38d44b5f03bd7e75012e4c
 size 7864

 version https://git-lfs.github.com/spec/v1
+oid sha256:36abf2cd69f7eda3a79614183ba40bf29c0001c076aae486529f711fd65d9ab9
 size 7864