End of training
Browse files- README.md +14 -12
- events.out.tfevents.1717772145.25dcf196d7e6.1682.5 +2 -2
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -22,7 +22,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 22 |
|
| 23 |
This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the stsb_multi_mt dataset.
|
| 24 |
It achieves the following results on the evaluation set:
|
| 25 |
-
- Loss: 2.
|
| 26 |
- Accuracy: 0.1762
|
| 27 |
|
| 28 |
## Model description
|
|
@@ -51,21 +51,23 @@ The following hyperparameters were used during training:
|
|
| 51 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 52 |
- lr_scheduler_type: linear
|
| 53 |
- lr_scheduler_warmup_steps: 1000
|
| 54 |
-
- training_steps:
|
| 55 |
- mixed_precision_training: Native AMP
|
| 56 |
|
| 57 |
### Training results
|
| 58 |
|
| 59 |
-
| Training Loss | Epoch | Step
|
| 60 |
-
|:-------------:|:-------:|:----:|:---------------:|:--------:|
|
| 61 |
-
| 0.
|
| 62 |
-
| 0.
|
| 63 |
-
| 0.
|
| 64 |
-
| 0.
|
| 65 |
-
| 0.
|
| 66 |
-
| 0.
|
| 67 |
-
| 0.
|
| 68 |
-
| 0.
|
|
|
|
|
|
|
| 69 |
|
| 70 |
|
| 71 |
### Framework versions
|
|
|
|
| 22 |
|
| 23 |
This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the stsb_multi_mt dataset.
|
| 24 |
It achieves the following results on the evaluation set:
|
| 25 |
+
- Loss: 2.7711
|
| 26 |
- Accuracy: 0.1762
|
| 27 |
|
| 28 |
## Model description
|
|
|
|
| 51 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 52 |
- lr_scheduler_type: linear
|
| 53 |
- lr_scheduler_warmup_steps: 1000
|
| 54 |
+
- training_steps: 10000
|
| 55 |
- mixed_precision_training: Native AMP
|
| 56 |
|
| 57 |
### Training results
|
| 58 |
|
| 59 |
+
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|
| 60 |
+
|:-------------:|:-------:|:-----:|:---------------:|:--------:|
|
| 61 |
+
| 0.4658 | 5.5556 | 1000 | 2.8590 | 0.1762 |
|
| 62 |
+
| 0.4956 | 11.1111 | 2000 | 2.9260 | 0.1762 |
|
| 63 |
+
| 0.349 | 16.6667 | 3000 | 2.9019 | 0.1762 |
|
| 64 |
+
| 0.2575 | 22.2222 | 4000 | 2.8843 | 0.1762 |
|
| 65 |
+
| 0.2261 | 27.7778 | 5000 | 2.8242 | 0.1762 |
|
| 66 |
+
| 0.2592 | 33.3333 | 6000 | 2.8276 | 0.1762 |
|
| 67 |
+
| 0.1979 | 38.8889 | 7000 | 2.7752 | 0.1762 |
|
| 68 |
+
| 0.2125 | 44.4444 | 8000 | 2.7567 | 0.1762 |
|
| 69 |
+
| 0.157 | 50.0 | 9000 | 2.7864 | 0.1762 |
|
| 70 |
+
| 0.1751 | 55.5556 | 10000 | 2.7711 | 0.1762 |
|
| 71 |
|
| 72 |
|
| 73 |
### Framework versions
|
events.out.tfevents.1717772145.25dcf196d7e6.1682.5
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1d070af62e4b075198d398ce1f76a9143a19221189b404a9e258306750acb1ad
|
| 3 |
+
size 92971
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 437955572
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4a8a3efc3b0f970670ba9e3dfc40d50fcdfc8ea302d71f81440800c8bed6cbf5
|
| 3 |
size 437955572
|