End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1000
 ## Model description
@@ -32,7 +32,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0008
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
@@ -45,13 +45,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 15.2493       | 0.71  | 10   | 10.7980         |
-| 7.1712        | 1.43  | 20   | 5.1215          |
-| 3.4797        | 2.14  | 30   | 2.4679          |
-| 2.0141        | 2.86  | 40   | 1.5039          |
-| 1.3795        | 3.57  | 50   | 1.2629          |
-| 1.193         | 4.29  | 60   | 1.1451          |
-| 0.957         | 5.0   | 70   | 1.1000          |
 ### Framework versions

 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1729
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.001
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 16.9944       | 0.56  | 10   | 14.4855         |
+| 8.9359        | 1.11  | 20   | 6.6117          |
+| 4.7661        | 1.67  | 30   | 2.9825          |
+| 2.0337        | 2.22  | 40   | 1.8939          |
+| 1.4419        | 2.78  | 50   | 1.5266          |
+| 0.9889        | 3.33  | 60   | 1.3596          |
+| 0.8627        | 3.89  | 70   | 1.2588          |
+| 0.7604        | 4.44  | 80   | 1.1964          |
+| 0.8141        | 5.0   | 90   | 1.1729          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:530d6683b73ebf4645dbfa6077dfc5d76a577b8a22438d3466cd4753ba833bcc
 size 3132464008

 version https://git-lfs.github.com/spec/v1
+oid sha256:e685cbe9b8ccef4fb7e1763c38e40920dd6f5b3e2fbca6f62b32fb78f65498ef
 size 3132464008

runs/Nov27_13-50-08_christopher-System-Product-Name/events.out.tfevents.1701053409.christopher-System-Product-Name.3714847.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4b34ed5121b8fdbf7907916778200c8d1c20ec6b96fdda0685f619291e34a7a6
+size 10103

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:079f7efc72c3779560b457c0fc0512d9e70459bb7c3f5bf28e5a4a7e34a9886a
 size 4347

 version https://git-lfs.github.com/spec/v1
+oid sha256:b8bbdbe726d66dc031ac2c3f52d15237238072773cd3c9f72d69e3329b903090
 size 4347