alvations commited on
Commit
1a3f788
·
1 Parent(s): ad24e7f

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -11
README.md CHANGED
@@ -2,6 +2,8 @@
2
  license: apache-2.0
3
  tags:
4
  - generated_from_trainer
 
 
5
  model-index:
6
  - name: mt5-aym-zero
7
  results: []
@@ -12,17 +14,12 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # mt5-aym-zero
14
 
15
- This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - eval_loss: 0.1782
18
- - eval_chrf: 24.0328
19
- - eval_bleu: 2.744
20
- - eval_gen_len: 17.2126
21
- - eval_runtime: 84.865
22
- - eval_samples_per_second: 11.748
23
- - eval_steps_per_second: 1.473
24
- - epoch: 41.05
25
- - step: 160000
26
 
27
  ## Model description
28
 
@@ -48,7 +45,14 @@ The following hyperparameters were used during training:
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
  - lr_scheduler_warmup_steps: 500
51
- - training_steps: 200000
 
 
 
 
 
 
 
52
 
53
  ### Framework versions
54
 
 
2
  license: apache-2.0
3
  tags:
4
  - generated_from_trainer
5
+ metrics:
6
+ - bleu
7
  model-index:
8
  - name: mt5-aym-zero
9
  results: []
 
14
 
15
  # mt5-aym-zero
16
 
17
+ This model is a fine-tuned version of [alvations/mt5-aym-zero](https://huggingface.co/alvations/mt5-aym-zero) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.1892
20
+ - Chrf: 23.9471
21
+ - Bleu: 2.9963
22
+ - Gen Len: 17.2839
 
 
 
 
 
23
 
24
  ## Model description
25
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - lr_scheduler_warmup_steps: 500
48
+ - training_steps: 20000
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Chrf | Bleu | Gen Len |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:------:|:-------:|
54
+ | 0.0658 | 5.13 | 20000 | 0.1892 | 23.9471 | 2.9963 | 17.2839 |
55
+
56
 
57
  ### Framework versions
58