End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0228
 ## Model description
@@ -39,17 +39,22 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 4    | 3.1650          |
-| No log        | 2.0   | 8    | 3.0043          |
-| 2.3478        | 3.0   | 12   | 2.7466          |
-| 2.3478        | 4.0   | 16   | 2.4101          |
-| 1.7944        | 5.0   | 20   | 2.0228          |
 ### Framework versions

 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0142
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 4    | 2.0420          |
+| No log        | 2.0   | 8    | 1.9309          |
+| 1.3898        | 3.0   | 12   | 1.7451          |
+| 1.3898        | 4.0   | 16   | 1.4869          |
+| 1.0743        | 5.0   | 20   | 1.1878          |
+| 1.0743        | 6.0   | 24   | 0.8755          |
+| 1.0743        | 7.0   | 28   | 0.5689          |
+| 0.3932        | 8.0   | 32   | 0.2711          |
+| 0.3932        | 9.0   | 36   | 0.0638          |
+| 0.0673        | 10.0  | 40   | 0.0142          |
 ### Framework versions

logs/events.out.tfevents.1709816825.mintj.10499.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e879157b44fd013c39132d73c77e3d690fb815e40ba58914f6055c7b1f276eed
-size 5026

 version https://git-lfs.github.com/spec/v1
+oid sha256:7fa7698204b1e55bb2435ae6f37230508c8dc54f490381984455f2df8d41266c
+size 8596

logs/events.out.tfevents.1709817256.mintj.10499.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:7e718b709edfdfcc3b185c8f0bd1c01551d6b3d75ad7b9170814705b705f6121
+size 306

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:410591b2f247f85fdb38adf7286fc8e5a2995e8d305b39416a424b2694fe5f24
 size 498615900

 version https://git-lfs.github.com/spec/v1
+oid sha256:663f1139e280431154b0ae8ed5d49d185f850c701e2ecf01974d0208aea6d9a4
 size 498615900