Upload TFBertForPreTraining
Browse files- README.md +43 -23
- tf_model.h5 +1 -1
README.md
CHANGED
|
@@ -13,8 +13,8 @@ probably proofread and complete it, then remove this comment. -->
|
|
| 13 |
|
| 14 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
-
- Train Loss:
|
| 17 |
-
- Epoch:
|
| 18 |
|
| 19 |
## Model description
|
| 20 |
|
|
@@ -33,33 +33,53 @@ More information needed
|
|
| 33 |
### Training hyperparameters
|
| 34 |
|
| 35 |
The following hyperparameters were used during training:
|
| 36 |
-
- optimizer: {'name': 'Adam', 'learning_rate':
|
| 37 |
- training_precision: float32
|
| 38 |
|
| 39 |
### Training results
|
| 40 |
|
| 41 |
| Train Loss | Epoch |
|
| 42 |
|:----------:|:-----:|
|
| 43 |
-
|
|
| 44 |
-
|
|
| 45 |
-
|
|
| 46 |
-
|
|
| 47 |
-
|
|
| 48 |
-
|
|
| 49 |
-
|
|
| 50 |
-
|
|
| 51 |
-
|
|
| 52 |
-
|
|
| 53 |
-
|
|
| 54 |
-
|
|
| 55 |
-
|
|
| 56 |
-
|
|
| 57 |
-
|
|
| 58 |
-
|
|
| 59 |
-
|
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 63 |
|
| 64 |
|
| 65 |
### Framework versions
|
|
|
|
| 13 |
|
| 14 |
This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
+
- Train Loss: 3.7745
|
| 17 |
+
- Epoch: 39
|
| 18 |
|
| 19 |
## Model description
|
| 20 |
|
|
|
|
| 33 |
### Training hyperparameters
|
| 34 |
|
| 35 |
The following hyperparameters were used during training:
|
| 36 |
+
- optimizer: {'name': 'Adam', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
|
| 37 |
- training_precision: float32
|
| 38 |
|
| 39 |
### Training results
|
| 40 |
|
| 41 |
| Train Loss | Epoch |
|
| 42 |
|:----------:|:-----:|
|
| 43 |
+
| 9.8117 | 0 |
|
| 44 |
+
| 8.2576 | 1 |
|
| 45 |
+
| 7.4407 | 2 |
|
| 46 |
+
| 6.6293 | 3 |
|
| 47 |
+
| 6.5469 | 4 |
|
| 48 |
+
| 6.2164 | 5 |
|
| 49 |
+
| 6.0521 | 6 |
|
| 50 |
+
| 5.9713 | 7 |
|
| 51 |
+
| 5.9086 | 8 |
|
| 52 |
+
| 5.8189 | 9 |
|
| 53 |
+
| 5.6795 | 10 |
|
| 54 |
+
| 5.5906 | 11 |
|
| 55 |
+
| 5.5204 | 12 |
|
| 56 |
+
| 5.5486 | 13 |
|
| 57 |
+
| 5.4477 | 14 |
|
| 58 |
+
| 5.2403 | 15 |
|
| 59 |
+
| 5.0455 | 16 |
|
| 60 |
+
| 5.3176 | 17 |
|
| 61 |
+
| 5.0164 | 18 |
|
| 62 |
+
| 4.9527 | 19 |
|
| 63 |
+
| 4.8094 | 20 |
|
| 64 |
+
| 4.5558 | 21 |
|
| 65 |
+
| 4.5773 | 22 |
|
| 66 |
+
| 4.4212 | 23 |
|
| 67 |
+
| 4.6842 | 24 |
|
| 68 |
+
| 4.3020 | 25 |
|
| 69 |
+
| 4.3645 | 26 |
|
| 70 |
+
| 4.3142 | 27 |
|
| 71 |
+
| 4.1144 | 28 |
|
| 72 |
+
| 4.2619 | 29 |
|
| 73 |
+
| 4.1658 | 30 |
|
| 74 |
+
| 3.9685 | 31 |
|
| 75 |
+
| 4.0776 | 32 |
|
| 76 |
+
| 4.0119 | 33 |
|
| 77 |
+
| 4.0048 | 34 |
|
| 78 |
+
| 3.9660 | 35 |
|
| 79 |
+
| 3.8173 | 36 |
|
| 80 |
+
| 3.8051 | 37 |
|
| 81 |
+
| 3.6915 | 38 |
|
| 82 |
+
| 3.7745 | 39 |
|
| 83 |
|
| 84 |
|
| 85 |
### Framework versions
|
tf_model.h5
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 526681688
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:61aa1799dc56bdd9d3031d86bd230933519f21396ce76ec51f46e637d6fcd677
|
| 3 |
size 526681688
|