BoghdadyJR commited on
Commit
1127713
·
verified ·
1 Parent(s): e0529ca

Final implementation

Browse files
README.md CHANGED
@@ -13,13 +13,16 @@ model-index:
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
 
 
 
16
  # ar-en-model
17
 
18
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-ar-en](https://huggingface.co/Helsinki-NLP/opus-mt-ar-en) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.2988
21
- - Bleu: 31.7475
22
- - Gen Len: 33.5269
23
 
24
  ## Model description
25
 
@@ -44,28 +47,21 @@ The following hyperparameters were used during training:
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - num_epochs: 10
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
54
- | No log | 1.0 | 312 | 0.3562 | 25.2145 | 33.397 |
55
- | 0.6311 | 2.0 | 624 | 0.3311 | 27.4389 | 32.9487 |
56
- | 0.6311 | 3.0 | 936 | 0.3178 | 28.887 | 33.6215 |
57
- | 0.2523 | 4.0 | 1248 | 0.3102 | 29.9143 | 33.648 |
58
- | 0.219 | 5.0 | 1560 | 0.3057 | 30.6256 | 33.4651 |
59
- | 0.219 | 6.0 | 1872 | 0.3028 | 30.9421 | 33.4459 |
60
- | 0.2025 | 7.0 | 2184 | 0.3008 | 31.5297 | 33.8605 |
61
- | 0.2025 | 8.0 | 2496 | 0.2996 | 31.8919 | 33.8492 |
62
- | 0.1895 | 9.0 | 2808 | 0.2989 | 31.7071 | 33.854 |
63
- | 0.182 | 10.0 | 3120 | 0.2988 | 31.7475 | 33.5269 |
64
 
65
 
66
  ### Framework versions
67
 
68
  - Transformers 4.42.3
69
- - Pytorch 2.2.1+cu121
70
  - Datasets 2.20.0
71
  - Tokenizers 0.19.1
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/i9kdffyd)
17
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/i9kdffyd)
18
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/i9kdffyd)
19
  # ar-en-model
20
 
21
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-ar-en](https://huggingface.co/Helsinki-NLP/opus-mt-ar-en) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.1237
24
+ - Bleu: 61.7434
25
+ - Gen Len: 31.2413
26
 
27
  ## Model description
28
 
 
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 3
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
57
+ | 0.1394 | 1.0 | 1250 | 0.1527 | 63.1436 | 30.9039 |
58
+ | 0.1215 | 2.0 | 2500 | 0.1507 | 63.0988 | 30.8535 |
59
+ | 0.1122 | 3.0 | 3750 | 0.1508 | 63.1177 | 30.8768 |
 
 
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
63
 
64
  - Transformers 4.42.3
65
+ - Pytorch 2.1.2
66
  - Datasets 2.20.0
67
  - Tokenizers 0.19.1
runs/Aug04_21-48-17_6a18a6ce128c/events.out.tfevents.1722808099.6a18a6ce128c.34.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b9feb08ca02df29593811f93a94a8d9f7ee31a0be68289a74c18480012426898
3
- size 7973
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ababcdfd78156350d82fcbd70c54c6a33f4d7690d27978ac2786dcd00e9149b0
3
+ size 8697
runs/Aug04_21-48-17_6a18a6ce128c/events.out.tfevents.1722813803.6a18a6ce128c.34.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68906d4ce330508eab166940c072773aa2df8a8ca753d449c358c03e691db9cc
3
+ size 458