CodeIsAbstract commited on
Commit
8136f45
·
verified ·
1 Parent(s): 4d62c3d

Model save

Browse files
Files changed (2) hide show
  1. README.md +17 -10
  2. model.safetensors +1 -1
README.md CHANGED
@@ -15,12 +15,12 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.7176
19
- - Rouge1: 72.4979
20
- - Rouge2: 55.09
21
- - Rougel: 72.5668
22
- - Rougelsum: 72.5151
23
- - Gen Len: 16.5209
24
 
25
  ## Model description
26
 
@@ -45,15 +45,22 @@ The following hyperparameters were used during training:
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
- - num_epochs: 3
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
54
- | No log | 1.0 | 76 | 0.7237 | 72.7109 | 55.346 | 72.7556 | 72.7163 | 16.4921 |
55
- | No log | 2.0 | 152 | 0.7188 | 72.4823 | 55.1403 | 72.5559 | 72.5034 | 16.5288 |
56
- | No log | 3.0 | 228 | 0.7176 | 72.4979 | 55.09 | 72.5668 | 72.5151 | 16.5209 |
 
 
 
 
 
 
 
57
 
58
 
59
  ### Framework versions
 
15
 
16
  This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6757
19
+ - Rouge1: 73.1568
20
+ - Rouge2: 56.3431
21
+ - Rougel: 73.2739
22
+ - Rougelsum: 73.2387
23
+ - Gen Len: 16.5471
24
 
25
  ## Model description
26
 
 
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 10
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
54
+ | No log | 1.0 | 76 | 0.7087 | 72.4084 | 55.2193 | 72.4899 | 72.4272 | 16.5916 |
55
+ | No log | 2.0 | 152 | 0.6998 | 72.7389 | 55.4449 | 72.7601 | 72.7258 | 16.5497 |
56
+ | No log | 3.0 | 228 | 0.6946 | 72.674 | 55.5275 | 72.7467 | 72.712 | 16.5288 |
57
+ | No log | 4.0 | 304 | 0.6888 | 72.7071 | 55.7658 | 72.7673 | 72.7402 | 16.5524 |
58
+ | No log | 5.0 | 380 | 0.6829 | 72.8829 | 55.8072 | 72.9415 | 72.9187 | 16.5602 |
59
+ | No log | 6.0 | 456 | 0.6801 | 73.067 | 55.9923 | 73.137 | 73.1117 | 16.5681 |
60
+ | 0.8082 | 7.0 | 532 | 0.6791 | 73.1192 | 56.0297 | 73.2107 | 73.1619 | 16.5707 |
61
+ | 0.8082 | 8.0 | 608 | 0.6768 | 73.0697 | 56.0297 | 73.1433 | 73.1279 | 16.5785 |
62
+ | 0.8082 | 9.0 | 684 | 0.6763 | 72.9717 | 55.9654 | 73.0873 | 73.0365 | 16.5576 |
63
+ | 0.8082 | 10.0 | 760 | 0.6757 | 73.1568 | 56.3431 | 73.2739 | 73.2387 | 16.5471 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c68110779c8a588b19162983e32c1f1b62aa87e9a3c4f28cd0957d52732c4713
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dcf6bb7f5be98a0b03f9f42ba56e808363fc3041fdd81940521b972d879d0afc
3
  size 242041896