End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0087
 ## Model description
@@ -38,15 +38,17 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 1.9656        | 1.0   | 2609 | 2.0155          |
-| 1.9671        | 2.0   | 5218 | 2.0055          |
-| 1.9943        | 3.0   | 7827 | 2.0087          |
 ### Framework versions

 This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2297
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 0.9876        | 1.0   | 2609  | nan             |
+| 1.0599        | 2.0   | 5218  | 1.2590          |
+| 1.175         | 3.0   | 7827  | 1.2065          |
+| 0.9931        | 4.0   | 10436 | 1.2508          |
+| 1.0682        | 5.0   | 13045 | 1.2297          |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "hangman-bert-base-2",
   "architectures": [
     "BertForMaskedLM"
   ],

 {
+  "_name_or_path": "hangman-bert-large",
   "architectures": [
     "BertForMaskedLM"
   ],

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8ccf0d2bd44c4a094a3c2ebd91ce1cc882dac623f6866c0095707bf1b396aeda
 size 438126133

 version https://git-lfs.github.com/spec/v1
+oid sha256:bcf73dec3331e998bb32b3200715a8f78c2429366730cf0d7f9394c408d88e34
 size 438126133

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8f79a45a8a4a4d69f0aa4197461ddd43579e432e052a42c1b42fafcfa143328c
 size 4027

 version https://git-lfs.github.com/spec/v1
+oid sha256:7ad0ee0ce7f70e420fac68bbb23f0db94615d8af139371b7e45aedada99a65fc
 size 4027