Model save

Browse files

Files changed (2) hide show

README.md +9 -12
runs/Aug11_15-55-50_bf11bf8ef52d/events.out.tfevents.1723391750.bf11bf8ef52d.34.3 +2 -2

README.md CHANGED Viewed

@@ -9,14 +9,14 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/uwzjiu3j)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/uwzjiu3j)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/uwzjiu3j)
 # gpt2
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.3085
 ## Model description
@@ -36,22 +36,19 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 4
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 3.2909        | 1.0   | 5000  | 3.3493          |
-| 3.2351        | 2.0   | 10000 | 3.3100          |
-| 3.2898        | 3.0   | 15000 | 3.3052          |
-| 3.1497        | 4.0   | 20000 | 3.3064          |
-| 3.1327        | 5.0   | 25000 | 3.3085          |
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/lbif3rjw)
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/lbif3rjw)
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/boghdady95/huggingface/runs/lbif3rjw)
 # gpt2
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.4161
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 3.3059        | 1.0   | 10000 | 3.4161          |
 ### Framework versions

runs/Aug11_15-55-50_bf11bf8ef52d/events.out.tfevents.1723391750.bf11bf8ef52d.34.3 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:42cef9896e32b27e82012e06bbb49d8df4b1522a5ca507bee2657fa1ece2feda
-size 9264

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ee56e4faf8a621047182570235c780e05fb6f3cd2fd72b4b445e519bd1f6a62
+size 9889