End of training
Browse files- README.md +14 -30
- model.safetensors +1 -1
- runs/Jul11_00-51-23_0acce8ee3931/events.out.tfevents.1720659086.0acce8ee3931.378.8 +3 -0
- runs/Jul11_00-51-23_0acce8ee3931/events.out.tfevents.1720659802.0acce8ee3931.378.9 +3 -0
- runs/Jul11_01-04-03_0acce8ee3931/events.out.tfevents.1720659845.0acce8ee3931.378.10 +3 -0
- runs/Jul11_01-04-03_0acce8ee3931/events.out.tfevents.1720660100.0acce8ee3931.378.11 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 17 |
|
| 18 |
This model is a fine-tuned version of [dadashzadeh/tiny-bert-Sentiment-persian](https://huggingface.co/dadashzadeh/tiny-bert-Sentiment-persian) on an unknown dataset.
|
| 19 |
It achieves the following results on the evaluation set:
|
| 20 |
-
- Loss: 0.
|
| 21 |
-
- Accuracy: 0.
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
@@ -38,39 +38,23 @@ More information needed
|
|
| 38 |
|
| 39 |
The following hyperparameters were used during training:
|
| 40 |
- learning_rate: 2e-05
|
| 41 |
-
- train_batch_size:
|
| 42 |
-
- eval_batch_size:
|
| 43 |
-
- seed:
|
| 44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 45 |
- lr_scheduler_type: linear
|
| 46 |
-
- num_epochs:
|
| 47 |
|
| 48 |
### Training results
|
| 49 |
|
| 50 |
-
| Training Loss | Epoch | Step
|
| 51 |
-
|
| 52 |
-
| 0.
|
| 53 |
-
| 0.
|
| 54 |
-
| 0.
|
| 55 |
-
| 0.
|
| 56 |
-
| 0.
|
| 57 |
-
| 0.
|
| 58 |
-
| 0.7764 | 7.0 | 16688 | 0.7438 | 0.6756 |
|
| 59 |
-
| 0.7763 | 8.0 | 19072 | 0.7328 | 0.6829 |
|
| 60 |
-
| 0.7717 | 9.0 | 21456 | 0.7300 | 0.6825 |
|
| 61 |
-
| 0.7733 | 10.0 | 23840 | 0.6943 | 0.7106 |
|
| 62 |
-
| 0.7517 | 11.0 | 26224 | 0.6859 | 0.7210 |
|
| 63 |
-
| 0.7515 | 12.0 | 28608 | 0.6538 | 0.7287 |
|
| 64 |
-
| 0.7497 | 13.0 | 30992 | 0.6930 | 0.7084 |
|
| 65 |
-
| 0.7177 | 14.0 | 33376 | 0.7055 | 0.6972 |
|
| 66 |
-
| 0.734 | 15.0 | 35760 | 0.6893 | 0.7123 |
|
| 67 |
-
| 0.7247 | 16.0 | 38144 | 0.7026 | 0.7045 |
|
| 68 |
-
| 0.7317 | 17.0 | 40528 | 0.6711 | 0.7210 |
|
| 69 |
-
| 0.7145 | 18.0 | 42912 | 0.7192 | 0.6911 |
|
| 70 |
-
| 0.7136 | 19.0 | 45296 | 0.6951 | 0.7102 |
|
| 71 |
-
| 0.7159 | 20.0 | 47680 | 0.6776 | 0.7197 |
|
| 72 |
-
| 0.7278 | 21.0 | 50064 | 0.6814 | 0.7162 |
|
| 73 |
-
| 0.6952 | 22.0 | 52448 | 0.6829 | 0.7158 |
|
| 74 |
|
| 75 |
|
| 76 |
### Framework versions
|
|
|
|
| 17 |
|
| 18 |
This model is a fine-tuned version of [dadashzadeh/tiny-bert-Sentiment-persian](https://huggingface.co/dadashzadeh/tiny-bert-Sentiment-persian) on an unknown dataset.
|
| 19 |
It achieves the following results on the evaluation set:
|
| 20 |
+
- Loss: 0.6672
|
| 21 |
+
- Accuracy: 0.7279
|
| 22 |
|
| 23 |
## Model description
|
| 24 |
|
|
|
|
| 38 |
|
| 39 |
The following hyperparameters were used during training:
|
| 40 |
- learning_rate: 2e-05
|
| 41 |
+
- train_batch_size: 20
|
| 42 |
+
- eval_batch_size: 20
|
| 43 |
+
- seed: 10
|
| 44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 45 |
- lr_scheduler_type: linear
|
| 46 |
+
- num_epochs: 6
|
| 47 |
|
| 48 |
### Training results
|
| 49 |
|
| 50 |
+
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|
| 51 |
+
|:-------------:|:-----:|:----:|:---------------:|:--------:|
|
| 52 |
+
| 0.696 | 1.0 | 1431 | 0.6928 | 0.7136 |
|
| 53 |
+
| 0.6826 | 2.0 | 2862 | 0.6893 | 0.7153 |
|
| 54 |
+
| 0.6737 | 3.0 | 4293 | 0.6620 | 0.7257 |
|
| 55 |
+
| 0.6801 | 4.0 | 5724 | 0.6579 | 0.7317 |
|
| 56 |
+
| 0.6682 | 5.0 | 7155 | 0.6662 | 0.7270 |
|
| 57 |
+
| 0.6756 | 6.0 | 8586 | 0.6672 | 0.7279 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 58 |
|
| 59 |
|
| 60 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 17549828
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6b719c5f5b86ed89c4905a111b9a94acde408ac626af3879f1429404cf74c54f
|
| 3 |
size 17549828
|
runs/Jul11_00-51-23_0acce8ee3931/events.out.tfevents.1720659086.0acce8ee3931.378.8
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:49ee5546c9762868bbf75ce7778c0392bea4289847c78c14bc88976a185b1259
|
| 3 |
+
size 23569
|
runs/Jul11_00-51-23_0acce8ee3931/events.out.tfevents.1720659802.0acce8ee3931.378.9
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:317ef75101cf6b8d37b423e69bf6a5a2e4a9928a8d87e86c8eae83747c82a107
|
| 3 |
+
size 417
|
runs/Jul11_01-04-03_0acce8ee3931/events.out.tfevents.1720659845.0acce8ee3931.378.10
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:72b5a3c704ca991e5edd559310e96dc554b03be3a2ea7d9a88e86332cae3e2a8
|
| 3 |
+
size 10947
|
runs/Jul11_01-04-03_0acce8ee3931/events.out.tfevents.1720660100.0acce8ee3931.378.11
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:235897759b2054705cb508abdf45f004251fd372bd3233ff01a401f9d6af10f8
|
| 3 |
+
size 411
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5112
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a90c212a44e2ab507b605b3ed5d0c455488aaf0d884d66018be144eeabc926c8
|
| 3 |
size 5112
|