End of training
Browse files
README.md
CHANGED
|
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on the None dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
-
- Loss:
|
| 20 |
- Rouge1: 0.0
|
| 21 |
- Rouge2: 0.0
|
| 22 |
- Rougel: 0.0
|
| 23 |
- Rougelsum: 0.0
|
| 24 |
-
- Gen Len: 6.
|
| 25 |
|
| 26 |
## Model description
|
| 27 |
|
|
@@ -52,10 +52,10 @@ The following hyperparameters were used during training:
|
|
| 52 |
|
| 53 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 54 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
| 55 |
-
| No log | 1.0 | 40 |
|
| 56 |
-
| No log | 2.0 | 80 |
|
| 57 |
-
| No log | 3.0 | 120 |
|
| 58 |
-
| No log | 4.0 | 160 |
|
| 59 |
|
| 60 |
|
| 61 |
### Framework versions
|
|
|
|
| 16 |
|
| 17 |
This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on the None dataset.
|
| 18 |
It achieves the following results on the evaluation set:
|
| 19 |
+
- Loss: 1.0832
|
| 20 |
- Rouge1: 0.0
|
| 21 |
- Rouge2: 0.0
|
| 22 |
- Rougel: 0.0
|
| 23 |
- Rougelsum: 0.0
|
| 24 |
+
- Gen Len: 6.5
|
| 25 |
|
| 26 |
## Model description
|
| 27 |
|
|
|
|
| 52 |
|
| 53 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 54 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
| 55 |
+
| No log | 1.0 | 40 | 1.1185 | 0.0 | 0.0 | 0.0 | 0.0 | 5.925 |
|
| 56 |
+
| No log | 2.0 | 80 | 1.0907 | 0.0 | 0.0 | 0.0 | 0.0 | 6.5 |
|
| 57 |
+
| No log | 3.0 | 120 | 1.0562 | 0.0 | 0.0 | 0.0 | 0.0 | 6.3875 |
|
| 58 |
+
| No log | 4.0 | 160 | 1.0832 | 0.0 | 0.0 | 0.0 | 0.0 | 6.5 |
|
| 59 |
|
| 60 |
|
| 61 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 2444578688
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fc470e120a0a0fa8b379a00cc8ba5ef901d950ac7660c5b37ae4b201a67042f8
|
| 3 |
size 2444578688
|
runs/Mar20_01-11-10_daca2a613645/events.out.tfevents.1710897072.daca2a613645.256.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9fe9243f31d1b0815620959cac308e657d6a9fc7fc0e80d55a2607bda4ce0d21
|
| 3 |
+
size 7898
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5048
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:01fe2e45e138ea0412d3a328b91d6c46f6e8274f06f9f6e6f776b84d9212044f
|
| 3 |
size 5048
|