Floyd93 commited on
Commit
b4cdb2b
·
verified ·
1 Parent(s): 3fc751f

End of training

Browse files
README.md CHANGED
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.0440
19
- - Bleu: 20.0
20
- - Gen Len: 18.2937
21
 
22
  ## Model description
23
 
@@ -47,9 +47,9 @@ The following hyperparameters were used during training:
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
51
- |:-------------:|:-----:|:-----:|:---------------:|:----:|:-------:|
52
- | 0.0618 | 1.0 | 12530 | 0.0440 | 20.0 | 18.2937 |
53
 
54
 
55
  ### Framework versions
 
15
 
16
  This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.0380
19
+ - Bleu: 20.2965
20
+ - Gen Len: 18.2799
21
 
22
  ## Model description
23
 
 
47
 
48
  ### Training results
49
 
50
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
51
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|
52
+ | 0.0527 | 1.0 | 12530 | 0.0380 | 20.2965 | 18.2799 |
53
 
54
 
55
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:973195e57167fa1f5f13aaff8a5fba272db261ccad6cf056f44591e6184a80b5
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9153a53889c6473d454a88b0e57bac0989c86a05997610f7493b183a6fe06e7d
3
  size 242041896
runs/Jan31_11-07-07_gemini-2.lyon.grid5000.fr/events.out.tfevents.1706695628.gemini-2.lyon.grid5000.fr.9392.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f6eeca4e5b55b029811555bbc789b9ce5d3caca358451a1af2a9a8631cea99a2
3
- size 9421
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:50cfb52111bfcde840b73e77490cfe20d9d50a3df8bcbf8286b5012d87badf83
3
+ size 10145