End of training
Browse files
README.md
CHANGED
|
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 13 |
|
| 14 |
This model was trained from scratch on an unknown dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
-
- Loss: 1.
|
| 17 |
|
| 18 |
## Model description
|
| 19 |
|
|
@@ -32,7 +32,7 @@ More information needed
|
|
| 32 |
### Training hyperparameters
|
| 33 |
|
| 34 |
The following hyperparameters were used during training:
|
| 35 |
-
- learning_rate: 0.
|
| 36 |
- train_batch_size: 4
|
| 37 |
- eval_batch_size: 4
|
| 38 |
- seed: 42
|
|
@@ -45,13 +45,15 @@ The following hyperparameters were used during training:
|
|
| 45 |
|
| 46 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 47 |
|:-------------:|:-----:|:----:|:---------------:|
|
| 48 |
-
|
|
| 49 |
-
|
|
| 50 |
-
|
|
| 51 |
-
| 2.
|
| 52 |
-
| 1.
|
| 53 |
-
|
|
| 54 |
-
| 0.
|
|
|
|
|
|
|
| 55 |
|
| 56 |
|
| 57 |
### Framework versions
|
|
|
|
| 13 |
|
| 14 |
This model was trained from scratch on an unknown dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
+
- Loss: 1.1729
|
| 17 |
|
| 18 |
## Model description
|
| 19 |
|
|
|
|
| 32 |
### Training hyperparameters
|
| 33 |
|
| 34 |
The following hyperparameters were used during training:
|
| 35 |
+
- learning_rate: 0.001
|
| 36 |
- train_batch_size: 4
|
| 37 |
- eval_batch_size: 4
|
| 38 |
- seed: 42
|
|
|
|
| 45 |
|
| 46 |
| Training Loss | Epoch | Step | Validation Loss |
|
| 47 |
|:-------------:|:-----:|:----:|:---------------:|
|
| 48 |
+
| 16.9944 | 0.56 | 10 | 14.4855 |
|
| 49 |
+
| 8.9359 | 1.11 | 20 | 6.6117 |
|
| 50 |
+
| 4.7661 | 1.67 | 30 | 2.9825 |
|
| 51 |
+
| 2.0337 | 2.22 | 40 | 1.8939 |
|
| 52 |
+
| 1.4419 | 2.78 | 50 | 1.5266 |
|
| 53 |
+
| 0.9889 | 3.33 | 60 | 1.3596 |
|
| 54 |
+
| 0.8627 | 3.89 | 70 | 1.2588 |
|
| 55 |
+
| 0.7604 | 4.44 | 80 | 1.1964 |
|
| 56 |
+
| 0.8141 | 5.0 | 90 | 1.1729 |
|
| 57 |
|
| 58 |
|
| 59 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 3132464008
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e685cbe9b8ccef4fb7e1763c38e40920dd6f5b3e2fbca6f62b32fb78f65498ef
|
| 3 |
size 3132464008
|
runs/Nov27_13-50-08_christopher-System-Product-Name/events.out.tfevents.1701053409.christopher-System-Product-Name.3714847.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4b34ed5121b8fdbf7907916778200c8d1c20ec6b96fdda0685f619291e34a7a6
|
| 3 |
+
size 10103
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4347
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b8bbdbe726d66dc031ac2c3f52d15237238072773cd3c9f72d69e3329b903090
|
| 3 |
size 4347
|