Yanmife/nllb-menyo
Browse files
README.md
CHANGED
|
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
-
- Loss: 2.
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
@@ -53,70 +53,70 @@ The following hyperparameters were used during training:
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 55 |
|:-------------:|:-------:|:-----:|:---------------:|
|
| 56 |
-
| 4.
|
| 57 |
-
| 3.
|
| 58 |
-
| 2.
|
| 59 |
-
| 2.
|
| 60 |
-
| 2.
|
| 61 |
-
| 2.
|
| 62 |
-
| 2.
|
| 63 |
-
| 2.
|
| 64 |
-
| 2.
|
| 65 |
-
| 2.
|
| 66 |
-
| 2.
|
| 67 |
-
| 2.
|
| 68 |
-
| 2.
|
| 69 |
-
| 2.
|
| 70 |
-
| 1.
|
| 71 |
-
| 2.
|
| 72 |
-
| 1.
|
| 73 |
-
| 1.
|
| 74 |
-
| 1.
|
| 75 |
-
| 1.
|
| 76 |
-
| 1.
|
| 77 |
-
| 1.
|
| 78 |
-
| 1.
|
| 79 |
-
| 1.
|
| 80 |
-
| 1.
|
| 81 |
-
| 1.
|
| 82 |
-
| 1.
|
| 83 |
-
| 1.
|
| 84 |
-
| 1.
|
| 85 |
-
| 1.
|
| 86 |
-
| 1.
|
| 87 |
-
| 1.
|
| 88 |
-
| 1.
|
| 89 |
-
| 1.
|
| 90 |
-
| 1.
|
| 91 |
-
| 1.
|
| 92 |
-
| 1.
|
| 93 |
-
| 1.
|
| 94 |
-
| 1.
|
| 95 |
-
| 1.
|
| 96 |
-
| 1.
|
| 97 |
-
| 1.
|
| 98 |
-
| 1.
|
| 99 |
-
| 1.
|
| 100 |
-
| 1.
|
| 101 |
-
| 1.
|
| 102 |
-
| 1.
|
| 103 |
-
| 1.
|
| 104 |
-
| 1.
|
| 105 |
-
| 1.
|
| 106 |
-
| 1.
|
| 107 |
-
| 1.
|
| 108 |
-
| 1.
|
| 109 |
-
| 1.
|
| 110 |
-
| 1.
|
| 111 |
-
| 1.
|
| 112 |
-
| 1.
|
| 113 |
-
| 1.
|
| 114 |
-
| 1.
|
| 115 |
-
| 1.
|
| 116 |
-
| 1.
|
| 117 |
-
| 1.
|
| 118 |
-
| 1.
|
| 119 |
-
| 1.
|
| 120 |
|
| 121 |
|
| 122 |
### Framework versions
|
|
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 2.0240
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
|
|
| 53 |
|
| 54 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 55 |
|:-------------:|:-------:|:-----:|:---------------:|
|
| 56 |
+
| 4.7656 | 0.7663 | 200 | 4.0671 |
|
| 57 |
+
| 3.4231 | 1.5326 | 400 | 2.8752 |
|
| 58 |
+
| 2.4958 | 2.2989 | 600 | 2.3280 |
|
| 59 |
+
| 2.2866 | 3.0651 | 800 | 2.2600 |
|
| 60 |
+
| 2.2179 | 3.8314 | 1000 | 2.2216 |
|
| 61 |
+
| 2.1699 | 4.5977 | 1200 | 2.1922 |
|
| 62 |
+
| 2.1431 | 5.3640 | 1400 | 2.1698 |
|
| 63 |
+
| 2.1148 | 6.1303 | 1600 | 2.1553 |
|
| 64 |
+
| 2.0856 | 6.8966 | 1800 | 2.1395 |
|
| 65 |
+
| 2.0729 | 7.6628 | 2000 | 2.1275 |
|
| 66 |
+
| 2.0532 | 8.4291 | 2200 | 2.1258 |
|
| 67 |
+
| 2.039 | 9.1954 | 2400 | 2.1135 |
|
| 68 |
+
| 2.025 | 9.9617 | 2600 | 2.1131 |
|
| 69 |
+
| 2.0123 | 10.7280 | 2800 | 2.1075 |
|
| 70 |
+
| 1.9932 | 11.4943 | 3000 | 2.1025 |
|
| 71 |
+
| 2.0002 | 12.2605 | 3200 | 2.0961 |
|
| 72 |
+
| 1.9817 | 13.0268 | 3400 | 2.0899 |
|
| 73 |
+
| 1.9688 | 13.7931 | 3600 | 2.0858 |
|
| 74 |
+
| 1.9606 | 14.5594 | 3800 | 2.0827 |
|
| 75 |
+
| 1.9531 | 15.3257 | 4000 | 2.0806 |
|
| 76 |
+
| 1.9392 | 16.0920 | 4200 | 2.0748 |
|
| 77 |
+
| 1.9354 | 16.8582 | 4400 | 2.0723 |
|
| 78 |
+
| 1.9283 | 17.6245 | 4600 | 2.0679 |
|
| 79 |
+
| 1.9166 | 18.3908 | 4800 | 2.0671 |
|
| 80 |
+
| 1.9065 | 19.1571 | 5000 | 2.0594 |
|
| 81 |
+
| 1.9092 | 19.9234 | 5200 | 2.0576 |
|
| 82 |
+
| 1.9041 | 20.6897 | 5400 | 2.0548 |
|
| 83 |
+
| 1.8753 | 21.4559 | 5600 | 2.0578 |
|
| 84 |
+
| 1.8914 | 22.2222 | 5800 | 2.0515 |
|
| 85 |
+
| 1.8742 | 22.9885 | 6000 | 2.0486 |
|
| 86 |
+
| 1.8716 | 23.7548 | 6200 | 2.0496 |
|
| 87 |
+
| 1.8775 | 24.5211 | 6400 | 2.0453 |
|
| 88 |
+
| 1.86 | 25.2874 | 6600 | 2.0424 |
|
| 89 |
+
| 1.8531 | 26.0536 | 6800 | 2.0420 |
|
| 90 |
+
| 1.8522 | 26.8199 | 7000 | 2.0397 |
|
| 91 |
+
| 1.8536 | 27.5862 | 7200 | 2.0388 |
|
| 92 |
+
| 1.8497 | 28.3525 | 7400 | 2.0364 |
|
| 93 |
+
| 1.8442 | 29.1188 | 7600 | 2.0353 |
|
| 94 |
+
| 1.8387 | 29.8851 | 7800 | 2.0337 |
|
| 95 |
+
| 1.8413 | 30.6513 | 8000 | 2.0330 |
|
| 96 |
+
| 1.8217 | 31.4176 | 8200 | 2.0358 |
|
| 97 |
+
| 1.8356 | 32.1839 | 8400 | 2.0306 |
|
| 98 |
+
| 1.8243 | 32.9502 | 8600 | 2.0289 |
|
| 99 |
+
| 1.8242 | 33.7165 | 8800 | 2.0294 |
|
| 100 |
+
| 1.8197 | 34.4828 | 9000 | 2.0276 |
|
| 101 |
+
| 1.8116 | 35.2490 | 9200 | 2.0281 |
|
| 102 |
+
| 1.8229 | 36.0153 | 9400 | 2.0274 |
|
| 103 |
+
| 1.8135 | 36.7816 | 9600 | 2.0271 |
|
| 104 |
+
| 1.8135 | 37.5479 | 9800 | 2.0270 |
|
| 105 |
+
| 1.8113 | 38.3142 | 10000 | 2.0264 |
|
| 106 |
+
| 1.8165 | 39.0805 | 10200 | 2.0253 |
|
| 107 |
+
| 1.8133 | 39.8467 | 10400 | 2.0244 |
|
| 108 |
+
| 1.8082 | 40.6130 | 10600 | 2.0236 |
|
| 109 |
+
| 1.8048 | 41.3793 | 10800 | 2.0230 |
|
| 110 |
+
| 1.8077 | 42.1456 | 11000 | 2.0257 |
|
| 111 |
+
| 1.8022 | 42.9119 | 11200 | 2.0237 |
|
| 112 |
+
| 1.8005 | 43.6782 | 11400 | 2.0244 |
|
| 113 |
+
| 1.806 | 44.4444 | 11600 | 2.0236 |
|
| 114 |
+
| 1.8028 | 45.2107 | 11800 | 2.0243 |
|
| 115 |
+
| 1.8053 | 45.9770 | 12000 | 2.0237 |
|
| 116 |
+
| 1.8074 | 46.7433 | 12200 | 2.0235 |
|
| 117 |
+
| 1.8009 | 47.5096 | 12400 | 2.0240 |
|
| 118 |
+
| 1.7992 | 48.2759 | 12600 | 2.0240 |
|
| 119 |
+
| 1.8069 | 49.0421 | 12800 | 2.0240 |
|
| 120 |
|
| 121 |
|
| 122 |
### Framework versions
|
runs/Oct02_20-44-56_04354b73b469/events.out.tfevents.1759437898.04354b73b469.19.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:69444963531cec4e3504b6215d2f8b91ec63474200c5055ab9a677301c1781c9
|
| 3 |
+
size 36362
|