Yanmife commited on
Commit
fa8fadf
·
verified ·
1 Parent(s): 4a2073f

Yanmife/nllb-menyo

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.0255
22
 
23
  ## Model description
24
 
@@ -53,70 +53,70 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-------:|:-----:|:---------------:|
56
- | 4.7665 | 0.7663 | 200 | 4.0682 |
57
- | 3.428 | 1.5326 | 400 | 2.8849 |
58
- | 2.5029 | 2.2989 | 600 | 2.3294 |
59
- | 2.287 | 3.0651 | 800 | 2.2594 |
60
- | 2.2178 | 3.8314 | 1000 | 2.2211 |
61
- | 2.1698 | 4.5977 | 1200 | 2.1918 |
62
- | 2.143 | 5.3640 | 1400 | 2.1696 |
63
- | 2.1153 | 6.1303 | 1600 | 2.1555 |
64
- | 2.086 | 6.8966 | 1800 | 2.1394 |
65
- | 2.0733 | 7.6628 | 2000 | 2.1277 |
66
- | 2.0539 | 8.4291 | 2200 | 2.1257 |
67
- | 2.0394 | 9.1954 | 2400 | 2.1141 |
68
- | 2.026 | 9.9617 | 2600 | 2.1131 |
69
- | 2.0133 | 10.7280 | 2800 | 2.1080 |
70
- | 1.9941 | 11.4943 | 3000 | 2.1034 |
71
- | 2.0018 | 12.2605 | 3200 | 2.0965 |
72
- | 1.9825 | 13.0268 | 3400 | 2.0905 |
73
- | 1.9702 | 13.7931 | 3600 | 2.0859 |
74
- | 1.9621 | 14.5594 | 3800 | 2.0831 |
75
- | 1.9546 | 15.3257 | 4000 | 2.0820 |
76
- | 1.9411 | 16.0920 | 4200 | 2.0751 |
77
- | 1.9371 | 16.8582 | 4400 | 2.0721 |
78
- | 1.9299 | 17.6245 | 4600 | 2.0687 |
79
- | 1.9185 | 18.3908 | 4800 | 2.0665 |
80
- | 1.9083 | 19.1571 | 5000 | 2.0596 |
81
- | 1.911 | 19.9234 | 5200 | 2.0590 |
82
- | 1.906 | 20.6897 | 5400 | 2.0557 |
83
- | 1.8773 | 21.4559 | 5600 | 2.0578 |
84
- | 1.8932 | 22.2222 | 5800 | 2.0531 |
85
- | 1.8766 | 22.9885 | 6000 | 2.0487 |
86
- | 1.874 | 23.7548 | 6200 | 2.0501 |
87
- | 1.8799 | 24.5211 | 6400 | 2.0454 |
88
- | 1.862 | 25.2874 | 6600 | 2.0439 |
89
- | 1.855 | 26.0536 | 6800 | 2.0429 |
90
- | 1.8544 | 26.8199 | 7000 | 2.0412 |
91
- | 1.8553 | 27.5862 | 7200 | 2.0397 |
92
- | 1.8517 | 28.3525 | 7400 | 2.0378 |
93
- | 1.8466 | 29.1188 | 7600 | 2.0366 |
94
- | 1.8409 | 29.8851 | 7800 | 2.0354 |
95
- | 1.8431 | 30.6513 | 8000 | 2.0348 |
96
- | 1.8234 | 31.4176 | 8200 | 2.0375 |
97
- | 1.8375 | 32.1839 | 8400 | 2.0318 |
98
- | 1.8263 | 32.9502 | 8600 | 2.0307 |
99
- | 1.8261 | 33.7165 | 8800 | 2.0302 |
100
- | 1.8223 | 34.4828 | 9000 | 2.0298 |
101
- | 1.8135 | 35.2490 | 9200 | 2.0305 |
102
- | 1.8251 | 36.0153 | 9400 | 2.0286 |
103
- | 1.8154 | 36.7816 | 9600 | 2.0285 |
104
- | 1.8151 | 37.5479 | 9800 | 2.0281 |
105
- | 1.813 | 38.3142 | 10000 | 2.0275 |
106
- | 1.8185 | 39.0805 | 10200 | 2.0266 |
107
- | 1.8147 | 39.8467 | 10400 | 2.0259 |
108
- | 1.8104 | 40.6130 | 10600 | 2.0255 |
109
- | 1.8067 | 41.3793 | 10800 | 2.0246 |
110
- | 1.8095 | 42.1456 | 11000 | 2.0274 |
111
- | 1.8042 | 42.9119 | 11200 | 2.0249 |
112
- | 1.8021 | 43.6782 | 11400 | 2.0257 |
113
- | 1.8076 | 44.4444 | 11600 | 2.0250 |
114
- | 1.8049 | 45.2107 | 11800 | 2.0260 |
115
- | 1.8071 | 45.9770 | 12000 | 2.0251 |
116
- | 1.8091 | 46.7433 | 12200 | 2.0250 |
117
- | 1.8027 | 47.5096 | 12400 | 2.0254 |
118
- | 1.8004 | 48.2759 | 12600 | 2.0254 |
119
- | 1.8087 | 49.0421 | 12800 | 2.0255 |
120
 
121
 
122
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 2.0240
22
 
23
  ## Model description
24
 
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-------:|:-----:|:---------------:|
56
+ | 4.7656 | 0.7663 | 200 | 4.0671 |
57
+ | 3.4231 | 1.5326 | 400 | 2.8752 |
58
+ | 2.4958 | 2.2989 | 600 | 2.3280 |
59
+ | 2.2866 | 3.0651 | 800 | 2.2600 |
60
+ | 2.2179 | 3.8314 | 1000 | 2.2216 |
61
+ | 2.1699 | 4.5977 | 1200 | 2.1922 |
62
+ | 2.1431 | 5.3640 | 1400 | 2.1698 |
63
+ | 2.1148 | 6.1303 | 1600 | 2.1553 |
64
+ | 2.0856 | 6.8966 | 1800 | 2.1395 |
65
+ | 2.0729 | 7.6628 | 2000 | 2.1275 |
66
+ | 2.0532 | 8.4291 | 2200 | 2.1258 |
67
+ | 2.039 | 9.1954 | 2400 | 2.1135 |
68
+ | 2.025 | 9.9617 | 2600 | 2.1131 |
69
+ | 2.0123 | 10.7280 | 2800 | 2.1075 |
70
+ | 1.9932 | 11.4943 | 3000 | 2.1025 |
71
+ | 2.0002 | 12.2605 | 3200 | 2.0961 |
72
+ | 1.9817 | 13.0268 | 3400 | 2.0899 |
73
+ | 1.9688 | 13.7931 | 3600 | 2.0858 |
74
+ | 1.9606 | 14.5594 | 3800 | 2.0827 |
75
+ | 1.9531 | 15.3257 | 4000 | 2.0806 |
76
+ | 1.9392 | 16.0920 | 4200 | 2.0748 |
77
+ | 1.9354 | 16.8582 | 4400 | 2.0723 |
78
+ | 1.9283 | 17.6245 | 4600 | 2.0679 |
79
+ | 1.9166 | 18.3908 | 4800 | 2.0671 |
80
+ | 1.9065 | 19.1571 | 5000 | 2.0594 |
81
+ | 1.9092 | 19.9234 | 5200 | 2.0576 |
82
+ | 1.9041 | 20.6897 | 5400 | 2.0548 |
83
+ | 1.8753 | 21.4559 | 5600 | 2.0578 |
84
+ | 1.8914 | 22.2222 | 5800 | 2.0515 |
85
+ | 1.8742 | 22.9885 | 6000 | 2.0486 |
86
+ | 1.8716 | 23.7548 | 6200 | 2.0496 |
87
+ | 1.8775 | 24.5211 | 6400 | 2.0453 |
88
+ | 1.86 | 25.2874 | 6600 | 2.0424 |
89
+ | 1.8531 | 26.0536 | 6800 | 2.0420 |
90
+ | 1.8522 | 26.8199 | 7000 | 2.0397 |
91
+ | 1.8536 | 27.5862 | 7200 | 2.0388 |
92
+ | 1.8497 | 28.3525 | 7400 | 2.0364 |
93
+ | 1.8442 | 29.1188 | 7600 | 2.0353 |
94
+ | 1.8387 | 29.8851 | 7800 | 2.0337 |
95
+ | 1.8413 | 30.6513 | 8000 | 2.0330 |
96
+ | 1.8217 | 31.4176 | 8200 | 2.0358 |
97
+ | 1.8356 | 32.1839 | 8400 | 2.0306 |
98
+ | 1.8243 | 32.9502 | 8600 | 2.0289 |
99
+ | 1.8242 | 33.7165 | 8800 | 2.0294 |
100
+ | 1.8197 | 34.4828 | 9000 | 2.0276 |
101
+ | 1.8116 | 35.2490 | 9200 | 2.0281 |
102
+ | 1.8229 | 36.0153 | 9400 | 2.0274 |
103
+ | 1.8135 | 36.7816 | 9600 | 2.0271 |
104
+ | 1.8135 | 37.5479 | 9800 | 2.0270 |
105
+ | 1.8113 | 38.3142 | 10000 | 2.0264 |
106
+ | 1.8165 | 39.0805 | 10200 | 2.0253 |
107
+ | 1.8133 | 39.8467 | 10400 | 2.0244 |
108
+ | 1.8082 | 40.6130 | 10600 | 2.0236 |
109
+ | 1.8048 | 41.3793 | 10800 | 2.0230 |
110
+ | 1.8077 | 42.1456 | 11000 | 2.0257 |
111
+ | 1.8022 | 42.9119 | 11200 | 2.0237 |
112
+ | 1.8005 | 43.6782 | 11400 | 2.0244 |
113
+ | 1.806 | 44.4444 | 11600 | 2.0236 |
114
+ | 1.8028 | 45.2107 | 11800 | 2.0243 |
115
+ | 1.8053 | 45.9770 | 12000 | 2.0237 |
116
+ | 1.8074 | 46.7433 | 12200 | 2.0235 |
117
+ | 1.8009 | 47.5096 | 12400 | 2.0240 |
118
+ | 1.7992 | 48.2759 | 12600 | 2.0240 |
119
+ | 1.8069 | 49.0421 | 12800 | 2.0240 |
120
 
121
 
122
  ### Framework versions
runs/Oct02_20-44-56_04354b73b469/events.out.tfevents.1759437898.04354b73b469.19.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b4cd6f808c9e9c627dcbee007dbfb9cc341909dd2db5c2a9512d075011e6fcef
3
- size 35526
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69444963531cec4e3504b6215d2f8b91ec63474200c5055ab9a677301c1781c9
3
+ size 36362