hung200504
/

bert-31

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [hung200504/bert-21](https://huggingface.co/hung200504/bert-21) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 11.6244
 ## Model description
@@ -35,60 +35,60 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-07
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 10.4595       | 0.05  | 5    | 11.9220         |
-| 10.4917       | 0.09  | 10   | 11.9078         |
-| 10.7603       | 0.14  | 15   | 11.8942         |
-| 11.3788       | 0.18  | 20   | 11.8816         |
-| 11.2043       | 0.23  | 25   | 11.8684         |
-| 10.4152       | 0.28  | 30   | 11.8558         |
-| 10.6386       | 0.32  | 35   | 11.8436         |
-| 10.7594       | 0.37  | 40   | 11.8319         |
-| 10.9593       | 0.41  | 45   | 11.8201         |
-| 10.3177       | 0.46  | 50   | 11.8090         |
-| 11.2145       | 0.5   | 55   | 11.7982         |
-| 10.4266       | 0.55  | 60   | 11.7877         |
-| 9.9668        | 0.6   | 65   | 11.7774         |
-| 9.8843        | 0.64  | 70   | 11.7681         |
-| 10.6546       | 0.69  | 75   | 11.7585         |
-| 10.9206       | 0.73  | 80   | 11.7497         |
-| 10.5989       | 0.78  | 85   | 11.7411         |
-| 10.1503       | 0.83  | 90   | 11.7322         |
-| 10.241        | 0.87  | 95   | 11.7242         |
-| 10.3243       | 0.92  | 100  | 11.7163         |
-| 10.0939       | 0.96  | 105  | 11.7089         |
-| 10.2936       | 1.01  | 110  | 11.7015         |
-| 9.8946        | 1.06  | 115  | 11.6947         |
-| 10.806        | 1.1   | 120  | 11.6880         |
-| 10.3278       | 1.15  | 125  | 11.6818         |
-| 9.9841        | 1.19  | 130  | 11.6760         |
-| 10.375        | 1.24  | 135  | 11.6701         |
-| 10.7491       | 1.28  | 140  | 11.6654         |
-| 10.0501       | 1.33  | 145  | 11.6600         |
-| 10.4491       | 1.38  | 150  | 11.6553         |
-| 9.9726        | 1.42  | 155  | 11.6511         |
-| 10.2234       | 1.47  | 160  | 11.6471         |
-| 10.2598       | 1.51  | 165  | 11.6434         |
-| 9.9445        | 1.56  | 170  | 11.6407         |
-| 9.9564        | 1.61  | 175  | 11.6372         |
-| 10.7074       | 1.65  | 180  | 11.6345         |
-| 10.7075       | 1.7   | 185  | 11.6321         |
-| 10.3899       | 1.74  | 190  | 11.6301         |
-| 10.5313       | 1.79  | 195  | 11.6285         |
-| 10.8444       | 1.83  | 200  | 11.6271         |
-| 10.8303       | 1.88  | 205  | 11.6257         |
-| 10.7634       | 1.93  | 210  | 11.6249         |
-| 10.237        | 1.97  | 215  | 11.6244         |
 ### Framework versions

 This model is a fine-tuned version of [hung200504/bert-21](https://huggingface.co/hung200504/bert-21) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 11.6562
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-07
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 9.9408        | 0.02  | 5    | 11.9244         |
+| 10.9745       | 0.05  | 10   | 11.9123         |
+| 11.7105       | 0.07  | 15   | 11.8995         |
+| 10.574        | 0.09  | 20   | 11.8876         |
+| 11.2567       | 0.11  | 25   | 11.8761         |
+| 10.1985       | 0.14  | 30   | 11.8651         |
+| 11.1306       | 0.16  | 35   | 11.8543         |
+| 11.0848       | 0.18  | 40   | 11.8435         |
+| 10.9051       | 0.21  | 45   | 11.8331         |
+| 11.2139       | 0.23  | 50   | 11.8228         |
+| 9.4434        | 0.25  | 55   | 11.8132         |
+| 10.6242       | 0.28  | 60   | 11.8038         |
+| 10.2756       | 0.3   | 65   | 11.7948         |
+| 11.1823       | 0.32  | 70   | 11.7861         |
+| 11.3154       | 0.34  | 75   | 11.7776         |
+| 10.4026       | 0.37  | 80   | 11.7694         |
+| 11.4274       | 0.39  | 85   | 11.7615         |
+| 10.1923       | 0.41  | 90   | 11.7535         |
+| 10.8907       | 0.44  | 95   | 11.7463         |
+| 10.5215       | 0.46  | 100  | 11.7395         |
+| 11.2088       | 0.48  | 105  | 11.7323         |
+| 10.3167       | 0.5   | 110  | 11.7258         |
+| 10.6535       | 0.53  | 115  | 11.7197         |
+| 10.8819       | 0.55  | 120  | 11.7137         |
+| 10.0389       | 0.57  | 125  | 11.7080         |
+| 10.0161       | 0.6   | 130  | 11.7025         |
+| 10.4476       | 0.62  | 135  | 11.6975         |
+| 10.3089       | 0.64  | 140  | 11.6930         |
+| 10.6388       | 0.67  | 145  | 11.6885         |
+| 11.1704       | 0.69  | 150  | 11.6842         |
+| 10.6095       | 0.71  | 155  | 11.6804         |
+| 10.8112       | 0.73  | 160  | 11.6766         |
+| 10.0912       | 0.76  | 165  | 11.6734         |
+| 10.8292       | 0.78  | 170  | 11.6704         |
+| 10.3543       | 0.8   | 175  | 11.6676         |
+| 9.7421        | 0.83  | 180  | 11.6649         |
+| 11.3465       | 0.85  | 185  | 11.6626         |
+| 9.4446        | 0.87  | 190  | 11.6609         |
+| 10.4486       | 0.89  | 195  | 11.6593         |
+| 10.2593       | 0.92  | 200  | 11.6579         |
+| 10.379        | 0.94  | 205  | 11.6571         |
+| 9.7728        | 0.96  | 210  | 11.6567         |
+| 9.7654        | 0.99  | 215  | 11.6562         |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bc4b819c8118003d79c5a4cb1b003ab4e88942c03748c0d70d3fdf525fe1a210
 size 430953062

 version https://git-lfs.github.com/spec/v1
+oid sha256:ebbb68b88b8ab428f75c82ef3ec0e86a2f1773f1c6c91c79be4efe5b1083273f
 size 430953062

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4cd01788aa90ffc91925a66d54b57acec31dd28521190aeaaf5f69e33c6d9a33
 size 4472

 version https://git-lfs.github.com/spec/v1
+oid sha256:00da6c47f2c88e65093029c1d97ed5c3d3676aa2f00f2c2e6e6cfa916f4058e7
 size 4472