Training complete

Browse files

Files changed (5) hide show

README.md +78 -0
generation_config.json +16 -0
model.safetensors +1 -1
runs/Apr24_20-54-33_Kabelo/events.out.tfevents.1745520951.Kabelo.23792.0 +2 -2
runs/Apr24_20-54-33_Kabelo/events.out.tfevents.1745526196.Kabelo.23792.1 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,78 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: Helsinki-NLP/opus-mt-af-en
+tags:
+- translation
+- generated_from_trainer
+metrics:
+- bleu
+model-index:
+- name: Af-En_update_pc
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# Af-En_update_pc
+This model is a fine-tuned version of [Helsinki-NLP/opus-mt-af-en](https://huggingface.co/Helsinki-NLP/opus-mt-af-en) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.8319
+- Model Preparation Time: 0.0
+- Bleu: 50.6655
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- num_epochs: 15
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Model Preparation Time | Bleu    |
+|:-------------:|:-----:|:-----:|:---------------:|:----------------------:|:-------:|
+| 1.5372        | 1.0   | 5105  | 1.9631          | 0.0                    | 36.5658 |
+| 1.2529        | 2.0   | 10210 | 1.8745          | 0.0                    | 46.3922 |
+| 1.0324        | 3.0   | 15315 | 1.7872          | 0.0                    | 47.6135 |
+| 0.8459        | 4.0   | 20420 | 1.6956          | 0.0                    | 49.7908 |
+| 0.6545        | 5.0   | 25525 | 1.6746          | 0.0                    | 50.3903 |
+| 0.5743        | 6.0   | 30630 | 1.6951          | 0.0                    | 50.8569 |
+| 0.4897        | 7.0   | 35735 | 1.6863          | 0.0                    | 50.6211 |
+| 0.4328        | 8.0   | 40840 | 1.7113          | 0.0                    | 50.6402 |
+| 0.3955        | 9.0   | 45945 | 1.7482          | 0.0                    | 50.7567 |
+| 0.3565        | 10.0  | 51050 | 1.7655          | 0.0                    | 50.8257 |
+| 0.3275        | 11.0  | 56155 | 1.7988          | 0.0                    | 50.5494 |
+| 0.2696        | 12.0  | 61260 | 1.8035          | 0.0                    | 50.6981 |
+| 0.2635        | 13.0  | 66365 | 1.8194          | 0.0                    | 50.6104 |
+| 0.2526        | 14.0  | 71470 | 1.8190          | 0.0                    | 50.6222 |
+| 0.2338        | 15.0  | 76575 | 1.8319          | 0.0                    | 50.6658 |
+### Framework versions
+- Transformers 4.51.3
+- Pytorch 2.6.0
+- Datasets 3.5.0
+- Tokenizers 0.21.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,16 @@

+{
+  "bad_words_ids": [
+    [
+      57444
+    ]
+  ],
+  "bos_token_id": 0,
+  "decoder_start_token_id": 57444,
+  "eos_token_id": 0,
+  "forced_eos_token_id": 0,
+  "max_length": 512,
+  "num_beams": 4,
+  "pad_token_id": 57444,
+  "renormalize_logits": true,
+  "transformers_version": "4.51.3"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:735b6d6ca060fd6819a3ca778d98dd8d81f4c9d4c4e2ca58949527f96a4c77e2
 size 299575816

 version https://git-lfs.github.com/spec/v1
+oid sha256:8cf66af7644bd4ac9e7bcd4eaaf56c465eaa0bed83507c52dc44ea56358b65b6
 size 299575816

runs/Apr24_20-54-33_Kabelo/events.out.tfevents.1745520951.Kabelo.23792.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:daf770155683065d19a9d304418f899ca1d22ef54bdca90dbbc16c4a51f02e36
-size 39466

 version https://git-lfs.github.com/spec/v1
+oid sha256:7954d3f46ca147921c90fa40fd22d03dbcfe8028caf57dfd7831f19cc041eb5c
+size 45125

runs/Apr24_20-54-33_Kabelo/events.out.tfevents.1745526196.Kabelo.23792.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:48292fd61cf94312ef3a2b3166375b21b5c18b90d104e2c3154217cc6e4344d3
+size 480