1miqi1 commited on
Commit
bcfdf21
·
verified ·
1 Parent(s): ab74cc3

End of training

Browse files
Files changed (1) hide show
  1. README.md +101 -101
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 0.0735
18
 
19
  ## Model description
20
 
@@ -46,106 +46,106 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 0.8250 | 1.0 | 9 | 0.5389 |
50
- | 0.5193 | 2.0 | 18 | 0.4522 |
51
- | 0.4503 | 3.0 | 27 | 0.3941 |
52
- | 0.3845 | 4.0 | 36 | 0.3543 |
53
- | 0.3437 | 5.0 | 45 | 0.3019 |
54
- | 0.3053 | 6.0 | 54 | 0.2775 |
55
- | 0.2728 | 7.0 | 63 | 0.2565 |
56
- | 0.2477 | 8.0 | 72 | 0.2398 |
57
- | 0.2210 | 9.0 | 81 | 0.2069 |
58
- | 0.2017 | 10.0 | 90 | 0.1838 |
59
- | 0.1840 | 11.0 | 99 | 0.1749 |
60
- | 0.1710 | 12.0 | 108 | 0.1689 |
61
- | 0.1587 | 13.0 | 117 | 0.1615 |
62
- | 0.1501 | 14.0 | 126 | 0.1510 |
63
- | 0.1379 | 15.0 | 135 | 0.1463 |
64
- | 0.1315 | 16.0 | 144 | 0.1350 |
65
- | 0.1208 | 17.0 | 153 | 0.1382 |
66
- | 0.1249 | 18.0 | 162 | 0.1331 |
67
- | 0.1135 | 19.0 | 171 | 0.1234 |
68
- | 0.1007 | 20.0 | 180 | 0.1183 |
69
- | 0.0951 | 21.0 | 189 | 0.1091 |
70
- | 0.0874 | 22.0 | 198 | 0.1087 |
71
- | 0.0798 | 23.0 | 207 | 0.1005 |
72
- | 0.0749 | 24.0 | 216 | 0.1027 |
73
- | 0.0727 | 25.0 | 225 | 0.1002 |
74
- | 0.0669 | 26.0 | 234 | 0.0971 |
75
- | 0.0622 | 27.0 | 243 | 0.0930 |
76
- | 0.0580 | 28.0 | 252 | 0.0927 |
77
- | 0.0534 | 29.0 | 261 | 0.0938 |
78
- | 0.0519 | 30.0 | 270 | 0.0914 |
79
- | 0.0467 | 31.0 | 279 | 0.0874 |
80
- | 0.0447 | 32.0 | 288 | 0.0859 |
81
- | 0.0422 | 33.0 | 297 | 0.0896 |
82
- | 0.0385 | 34.0 | 306 | 0.0885 |
83
- | 0.0382 | 35.0 | 315 | 0.0846 |
84
- | 0.0377 | 36.0 | 324 | 0.0886 |
85
- | 0.0364 | 37.0 | 333 | 0.0865 |
86
- | 0.0337 | 38.0 | 342 | 0.0850 |
87
- | 0.0299 | 39.0 | 351 | 0.0846 |
88
- | 0.0295 | 40.0 | 360 | 0.0799 |
89
- | 0.0290 | 41.0 | 369 | 0.0799 |
90
- | 0.0265 | 42.0 | 378 | 0.0821 |
91
- | 0.0245 | 43.0 | 387 | 0.0807 |
92
- | 0.0251 | 44.0 | 396 | 0.0791 |
93
- | 0.0219 | 45.0 | 405 | 0.0776 |
94
- | 0.0204 | 46.0 | 414 | 0.0776 |
95
- | 0.0192 | 47.0 | 423 | 0.0769 |
96
- | 0.0181 | 48.0 | 432 | 0.0784 |
97
- | 0.0191 | 49.0 | 441 | 0.0800 |
98
- | 0.0171 | 50.0 | 450 | 0.0787 |
99
- | 0.0172 | 51.0 | 459 | 0.0762 |
100
- | 0.0163 | 52.0 | 468 | 0.0752 |
101
- | 0.0139 | 53.0 | 477 | 0.0764 |
102
- | 0.0134 | 54.0 | 486 | 0.0759 |
103
- | 0.0128 | 55.0 | 495 | 0.0746 |
104
- | 0.0127 | 56.0 | 504 | 0.0750 |
105
- | 0.0117 | 57.0 | 513 | 0.0748 |
106
- | 0.0102 | 58.0 | 522 | 0.0746 |
107
- | 0.0093 | 59.0 | 531 | 0.0736 |
108
- | 0.0091 | 60.0 | 540 | 0.0749 |
109
- | 0.0081 | 61.0 | 549 | 0.0736 |
110
- | 0.0080 | 62.0 | 558 | 0.0722 |
111
- | 0.0080 | 63.0 | 567 | 0.0731 |
112
- | 0.0083 | 64.0 | 576 | 0.0764 |
113
- | 0.0076 | 65.0 | 585 | 0.0744 |
114
- | 0.0084 | 66.0 | 594 | 0.0731 |
115
- | 0.0071 | 67.0 | 603 | 0.0721 |
116
- | 0.0065 | 68.0 | 612 | 0.0741 |
117
- | 0.0073 | 69.0 | 621 | 0.0733 |
118
- | 0.0062 | 70.0 | 630 | 0.0728 |
119
- | 0.0062 | 71.0 | 639 | 0.0727 |
120
- | 0.0061 | 72.0 | 648 | 0.0741 |
121
- | 0.0058 | 73.0 | 657 | 0.0740 |
122
- | 0.0055 | 74.0 | 666 | 0.0746 |
123
- | 0.0056 | 75.0 | 675 | 0.0725 |
124
- | 0.0053 | 76.0 | 684 | 0.0736 |
125
- | 0.0053 | 77.0 | 693 | 0.0748 |
126
- | 0.0048 | 78.0 | 702 | 0.0728 |
127
- | 0.0045 | 79.0 | 711 | 0.0736 |
128
- | 0.0049 | 80.0 | 720 | 0.0746 |
129
- | 0.0044 | 81.0 | 729 | 0.0746 |
130
- | 0.0046 | 82.0 | 738 | 0.0746 |
131
- | 0.0043 | 83.0 | 747 | 0.0746 |
132
- | 0.0039 | 84.0 | 756 | 0.0746 |
133
- | 0.0037 | 85.0 | 765 | 0.0746 |
134
- | 0.0039 | 86.0 | 774 | 0.0743 |
135
- | 0.0039 | 87.0 | 783 | 0.0740 |
136
- | 0.0037 | 88.0 | 792 | 0.0744 |
137
- | 0.0034 | 89.0 | 801 | 0.0747 |
138
- | 0.0037 | 90.0 | 810 | 0.0744 |
139
- | 0.0035 | 91.0 | 819 | 0.0742 |
140
- | 0.0036 | 92.0 | 828 | 0.0739 |
141
- | 0.0032 | 93.0 | 837 | 0.0737 |
142
- | 0.0031 | 94.0 | 846 | 0.0735 |
143
- | 0.0034 | 95.0 | 855 | 0.0735 |
144
- | 0.0034 | 96.0 | 864 | 0.0736 |
145
- | 0.0035 | 97.0 | 873 | 0.0735 |
146
- | 0.0034 | 98.0 | 882 | 0.0734 |
147
- | 0.0033 | 99.0 | 891 | 0.0735 |
148
- | 0.0034 | 100.0 | 900 | 0.0735 |
149
 
150
 
151
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 0.0826
18
 
19
  ## Model description
20
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 3.2338 | 1.0 | 9 | 2.3613 |
50
+ | 1.9485 | 2.0 | 18 | 1.5220 |
51
+ | 1.3333 | 3.0 | 27 | 1.1768 |
52
+ | 1.0688 | 4.0 | 36 | 0.9485 |
53
+ | 0.9111 | 5.0 | 45 | 0.8320 |
54
+ | 0.8187 | 6.0 | 54 | 0.7387 |
55
+ | 0.7141 | 7.0 | 63 | 0.6621 |
56
+ | 0.6359 | 8.0 | 72 | 0.5770 |
57
+ | 0.5749 | 9.0 | 81 | 0.5312 |
58
+ | 0.5475 | 10.0 | 90 | 0.4928 |
59
+ | 0.5034 | 11.0 | 99 | 0.4602 |
60
+ | 0.4597 | 12.0 | 108 | 0.4101 |
61
+ | 0.4166 | 13.0 | 117 | 0.3805 |
62
+ | 0.3871 | 14.0 | 126 | 0.3521 |
63
+ | 0.3576 | 15.0 | 135 | 0.3196 |
64
+ | 0.3265 | 16.0 | 144 | 0.2917 |
65
+ | 0.2991 | 17.0 | 153 | 0.2617 |
66
+ | 0.2761 | 18.0 | 162 | 0.2420 |
67
+ | 0.2511 | 19.0 | 171 | 0.2203 |
68
+ | 0.2274 | 20.0 | 180 | 0.2161 |
69
+ | 0.2190 | 21.0 | 189 | 0.2181 |
70
+ | 0.2122 | 22.0 | 198 | 0.2023 |
71
+ | 0.1999 | 23.0 | 207 | 0.1845 |
72
+ | 0.1812 | 24.0 | 216 | 0.1789 |
73
+ | 0.1778 | 25.0 | 225 | 0.1648 |
74
+ | 0.1650 | 26.0 | 234 | 0.1537 |
75
+ | 0.1475 | 27.0 | 243 | 0.1457 |
76
+ | 0.1415 | 28.0 | 252 | 0.1407 |
77
+ | 0.1303 | 29.0 | 261 | 0.1361 |
78
+ | 0.1225 | 30.0 | 270 | 0.1319 |
79
+ | 0.1191 | 31.0 | 279 | 0.1264 |
80
+ | 0.1154 | 32.0 | 288 | 0.1231 |
81
+ | 0.1117 | 33.0 | 297 | 0.1197 |
82
+ | 0.1063 | 34.0 | 306 | 0.1172 |
83
+ | 0.0966 | 35.0 | 315 | 0.1190 |
84
+ | 0.0949 | 36.0 | 324 | 0.1121 |
85
+ | 0.0889 | 37.0 | 333 | 0.1081 |
86
+ | 0.0829 | 38.0 | 342 | 0.1096 |
87
+ | 0.0833 | 39.0 | 351 | 0.1102 |
88
+ | 0.0778 | 40.0 | 360 | 0.1014 |
89
+ | 0.0710 | 41.0 | 369 | 0.1024 |
90
+ | 0.0690 | 42.0 | 378 | 0.1019 |
91
+ | 0.0676 | 43.0 | 387 | 0.1013 |
92
+ | 0.0633 | 44.0 | 396 | 0.0980 |
93
+ | 0.0615 | 45.0 | 405 | 0.1016 |
94
+ | 0.0583 | 46.0 | 414 | 0.0944 |
95
+ | 0.0532 | 47.0 | 423 | 0.0941 |
96
+ | 0.0539 | 48.0 | 432 | 0.0946 |
97
+ | 0.0513 | 49.0 | 441 | 0.0911 |
98
+ | 0.0474 | 50.0 | 450 | 0.0912 |
99
+ | 0.0459 | 51.0 | 459 | 0.0907 |
100
+ | 0.0442 | 52.0 | 468 | 0.0899 |
101
+ | 0.0410 | 53.0 | 477 | 0.0935 |
102
+ | 0.0368 | 54.0 | 486 | 0.0898 |
103
+ | 0.0356 | 55.0 | 495 | 0.0887 |
104
+ | 0.0344 | 56.0 | 504 | 0.0896 |
105
+ | 0.0318 | 57.0 | 513 | 0.0894 |
106
+ | 0.0307 | 58.0 | 522 | 0.0884 |
107
+ | 0.0272 | 59.0 | 531 | 0.0889 |
108
+ | 0.0261 | 60.0 | 540 | 0.0857 |
109
+ | 0.0246 | 61.0 | 549 | 0.0834 |
110
+ | 0.0237 | 62.0 | 558 | 0.0875 |
111
+ | 0.0223 | 63.0 | 567 | 0.0865 |
112
+ | 0.0229 | 64.0 | 576 | 0.0864 |
113
+ | 0.0213 | 65.0 | 585 | 0.0884 |
114
+ | 0.0213 | 66.0 | 594 | 0.0848 |
115
+ | 0.0208 | 67.0 | 603 | 0.0848 |
116
+ | 0.0192 | 68.0 | 612 | 0.0845 |
117
+ | 0.0185 | 69.0 | 621 | 0.0868 |
118
+ | 0.0180 | 70.0 | 630 | 0.0844 |
119
+ | 0.0165 | 71.0 | 639 | 0.0843 |
120
+ | 0.0160 | 72.0 | 648 | 0.0843 |
121
+ | 0.0151 | 73.0 | 657 | 0.0862 |
122
+ | 0.0134 | 74.0 | 666 | 0.0832 |
123
+ | 0.0141 | 75.0 | 675 | 0.0840 |
124
+ | 0.0138 | 76.0 | 684 | 0.0857 |
125
+ | 0.0134 | 77.0 | 693 | 0.0840 |
126
+ | 0.0131 | 78.0 | 702 | 0.0853 |
127
+ | 0.0133 | 79.0 | 711 | 0.0858 |
128
+ | 0.0123 | 80.0 | 720 | 0.0844 |
129
+ | 0.0118 | 81.0 | 729 | 0.0842 |
130
+ | 0.0117 | 82.0 | 738 | 0.0845 |
131
+ | 0.0103 | 83.0 | 747 | 0.0845 |
132
+ | 0.0112 | 84.0 | 756 | 0.0834 |
133
+ | 0.0104 | 85.0 | 765 | 0.0831 |
134
+ | 0.0101 | 86.0 | 774 | 0.0833 |
135
+ | 0.0100 | 87.0 | 783 | 0.0823 |
136
+ | 0.0095 | 88.0 | 792 | 0.0837 |
137
+ | 0.0098 | 89.0 | 801 | 0.0823 |
138
+ | 0.0091 | 90.0 | 810 | 0.0843 |
139
+ | 0.0088 | 91.0 | 819 | 0.0834 |
140
+ | 0.0090 | 92.0 | 828 | 0.0840 |
141
+ | 0.0089 | 93.0 | 837 | 0.0837 |
142
+ | 0.0082 | 94.0 | 846 | 0.0839 |
143
+ | 0.0085 | 95.0 | 855 | 0.0836 |
144
+ | 0.0079 | 96.0 | 864 | 0.0835 |
145
+ | 0.0079 | 97.0 | 873 | 0.0832 |
146
+ | 0.0080 | 98.0 | 882 | 0.0829 |
147
+ | 0.0083 | 99.0 | 891 | 0.0826 |
148
+ | 0.0076 | 100.0 | 900 | 0.0826 |
149
 
150
 
151
  ### Framework versions