nails-tyobs commited on
Commit
4355328
·
verified ·
1 Parent(s): f91b967

Training complete

Browse files
README.md CHANGED
@@ -1,8 +1,6 @@
1
  ---
2
- license: cc-by-nc-4.0
3
- base_model: facebook/nllb-200-distilled-600M
4
  tags:
5
- - translation5
6
  - generated_from_trainer
7
  model-index:
8
  - name: checkpoint-2176
@@ -14,7 +12,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # checkpoint-2176
16
 
17
- This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
18
 
19
  ## Model description
20
 
@@ -41,11 +39,16 @@ The following hyperparameters were used during training:
41
  - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - num_epochs: 20
 
 
 
 
 
45
 
46
  ### Framework versions
47
 
48
  - Transformers 4.43.1
49
- - Pytorch 1.13.0
50
  - Datasets 2.16.1
51
  - Tokenizers 0.19.1
 
1
  ---
 
 
2
  tags:
3
+ - translation6
4
  - generated_from_trainer
5
  model-index:
6
  - name: checkpoint-2176
 
12
 
13
  # checkpoint-2176
14
 
15
+ This model was trained from scratch on the None dataset.
16
 
17
  ## Model description
18
 
 
39
  - total_train_batch_size: 32
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
+ - num_epochs: 5
43
+ - mixed_precision_training: Native AMP
44
+
45
+ ### Training results
46
+
47
+
48
 
49
  ### Framework versions
50
 
51
  - Transformers 4.43.1
52
+ - Pytorch 2.3.0+cu121
53
  - Datasets 2.16.1
54
  - Tokenizers 0.19.1
runs/Sep06_12-50-57_3c1cff3d9c4c/events.out.tfevents.1725627073.3c1cff3d9c4c.9844.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9a15f84406dc9cef465560c4a1ac72574c2d956142aeec0322379ce34f09128e
3
- size 5542
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:653b0df4e095dbf8e37328f1c869fb6224eee0d8124d30b6adc279a791c6a187
3
+ size 5896