Tural
/

language-modeling-from-scratch-ml

Generated from Trainer

Model card Files Files and versions

Tural commited on Oct 15, 2023

Commit

adad8f3

·

1 Parent(s): bf6b7a5

End of training

Files changed (2) hide show

README.md +5 -18
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -1,21 +1,17 @@
 ---
-license: apache-2.0
-base_model: bert-base-uncased
 tags:
 - generated_from_trainer
 model-index:
-- name: bert-base-uncased-ml
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# bert-base-uncased-ml
-This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 2.1621
 ## Model description
@@ -36,20 +32,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
 - train_batch_size: 150
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3.0
-### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 3.7729        | 1.0   | 14050 | 3.8005          |
-| 2.408         | 2.0   | 28100 | 2.3630          |
-| 2.1739        | 3.0   | 42150 | 2.1621          |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 model-index:
+- name: language-modeling-from-scratch-ml
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# language-modeling-from-scratch-ml
+This model was trained from scratch on the None dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
 - train_batch_size: 150
+- eval_batch_size: 250
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 512
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6171c49f1c7cfd357d61b02ecbfce218bdd9a9f1f7262803a29e82df583722ef
 size 438126133

 version https://git-lfs.github.com/spec/v1
+oid sha256:ec35adfe3a47ead0543eac57dfd99edce63cb78b17a9ea8201e4114bfcdd42e8
 size 438126133