End of training

Browse files

Files changed (4) hide show

README.md +17 -22
model.safetensors +1 -1
special_tokens_map.json +10 -22
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,5 +1,7 @@
 ---
 library_name: transformers
 tags:
 - generated_from_trainer
 model-index:
@@ -12,11 +14,10 @@ should probably proofread and complete it, then remove this comment. -->
 # opus-ph-ph
-This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.9171
-- Bleu Global: 28.0878
-- Gen Len: 7.4973
 ## Model description
@@ -35,31 +36,25 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-06
-- train_batch_size: 64
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 15
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Bleu Global | Gen Len | Validation Loss |
-|:-------------:|:-----:|:----:|:-----------:|:-------:|:---------------:|
-| 0.1176        | 1.0   | 634  | 27.2355     | 7.6117  | 2.3737          |
-| 0.1024        | 2.0   | 1268 | 28.4868     | 7.5173  | 2.4315          |
-| 0.0788        | 3.0   | 1902 | 28.2414     | 7.5385  | 2.5622          |
-| 0.0438        | 4.0   | 2536 | 27.6541     | 7.4658  | 2.6708          |
-| 0.0344        | 5.0   | 3170 | 28.4412     | 7.5115  | 2.6867          |
-| 0.0294        | 6.0   | 3804 | 28.9421     | 7.5008  | 2.7144          |
-| 0.0253        | 7.0   | 4438 | 28.5901     | 7.5542  | 2.8013          |
-| 0.0176        | 8.0   | 5072 | 28.4891     | 7.5348  | 2.8497          |
-| 0.0155        | 9.0   | 5706 | 28.5233     | 7.5419  | 2.8761          |
-| 0.014         | 10.0  | 6340 | 28.3278     | 7.5328  | 2.8908          |
-| 0.0167        | 11.0  | 6974 | 2.8892      | 28.2921 | 7.502           |
-| 0.0161        | 12.0  | 7608 | 2.9171      | 28.0878 | 7.4973          |
 ### Framework versions

 ---
 library_name: transformers
+license: apache-2.0
+base_model: Helsinki-NLP/opus-mt-tc-bible-big-mul-mul
 tags:
 - generated_from_trainer
 model-index:
 # opus-ph-ph
+This model is a fine-tuned version of [Helsinki-NLP/opus-mt-tc-bible-big-mul-mul](https://huggingface.co/Helsinki-NLP/opus-mt-tc-bible-big-mul-mul) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.7528
+- Bleu Global: 28.4879
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 32
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bleu Global |
+|:-------------:|:-----:|:----:|:---------------:|:-----------:|
+| 0.6364        | 1.0   | 1268 | 1.9412          | 25.9524     |
+| 0.1826        | 2.0   | 2536 | 2.2076          | 25.5630     |
+| 0.1035        | 3.0   | 3804 | 2.3956          | 28.8255     |
+| 0.06          | 4.0   | 5072 | 2.5487          | 28.1852     |
+| 0.0436        | 5.0   | 6340 | 2.6189          | 28.7365     |
+| 0.0298        | 6.0   | 7608 | 2.7528          | 28.4879     |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7c668294176d0e54c71a9fed52fb651c63de9cb1fd76e09905257f23666927fb
 size 991093820

 version https://git-lfs.github.com/spec/v1
+oid sha256:780483067b493ccfafec5ad0af65ad1b85e705df692d9d28dc3002eb18984e00
 size 991093820

special_tokens_map.json CHANGED Viewed

@@ -1,26 +1,14 @@
 {
   "additional_special_tokens": [
-    ">>mdh<<"
   ],
-  "eos_token": {
-    "content": "</s>",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  },
-  "pad_token": {
-    "content": "<pad>",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  },
-  "unk_token": {
-    "content": "<unk>",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  }
 }

 {
   "additional_special_tokens": [
+    {
+      "content": ">>mdh<<",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false
+    }
   ],
+  "eos_token": "</s>",
+  "pad_token": "<pad>",
+  "unk_token": "<unk>"
 }

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:73eda5ee9cf32ff01a29e8c1552a48af8854603707302725e40ddbf82d6070d5
 size 5905

 version https://git-lfs.github.com/spec/v1
+oid sha256:b9a4da7d1f7805ac730e39a13e1e0ee863442f98930e123242538858ecf99ebf
 size 5905