JudeChaer commited on
Commit
05b2c37
·
verified ·
1 Parent(s): 93da9e6

End of training

Browse files
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 2.0228
17
 
18
  ## Model description
19
 
@@ -39,17 +39,22 @@ The following hyperparameters were used during training:
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
  - lr_scheduler_warmup_steps: 500
42
- - num_epochs: 5
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
- | No log | 1.0 | 4 | 3.1650 |
49
- | No log | 2.0 | 8 | 3.0043 |
50
- | 2.3478 | 3.0 | 12 | 2.7466 |
51
- | 2.3478 | 4.0 | 16 | 2.4101 |
52
- | 1.7944 | 5.0 | 20 | 2.0228 |
 
 
 
 
 
53
 
54
 
55
  ### Framework versions
 
13
 
14
  This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 0.0142
17
 
18
  ## Model description
19
 
 
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
  - lr_scheduler_warmup_steps: 500
42
+ - num_epochs: 10
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
+ | No log | 1.0 | 4 | 2.0420 |
49
+ | No log | 2.0 | 8 | 1.9309 |
50
+ | 1.3898 | 3.0 | 12 | 1.7451 |
51
+ | 1.3898 | 4.0 | 16 | 1.4869 |
52
+ | 1.0743 | 5.0 | 20 | 1.1878 |
53
+ | 1.0743 | 6.0 | 24 | 0.8755 |
54
+ | 1.0743 | 7.0 | 28 | 0.5689 |
55
+ | 0.3932 | 8.0 | 32 | 0.2711 |
56
+ | 0.3932 | 9.0 | 36 | 0.0638 |
57
+ | 0.0673 | 10.0 | 40 | 0.0142 |
58
 
59
 
60
  ### Framework versions
logs/events.out.tfevents.1709816825.mintj.10499.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e879157b44fd013c39132d73c77e3d690fb815e40ba58914f6055c7b1f276eed
3
- size 5026
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7fa7698204b1e55bb2435ae6f37230508c8dc54f490381984455f2df8d41266c
3
+ size 8596
logs/events.out.tfevents.1709817256.mintj.10499.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e718b709edfdfcc3b185c8f0bd1c01551d6b3d75ad7b9170814705b705f6121
3
+ size 306
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:410591b2f247f85fdb38adf7286fc8e5a2995e8d305b39416a424b2694fe5f24
3
  size 498615900
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:663f1139e280431154b0ae8ed5d49d185f850c701e2ecf01974d0208aea6d9a4
3
  size 498615900