End of training

Browse files

Files changed (3) hide show

README.md +2 -23
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -3,9 +3,6 @@ license: mit
 base_model: microsoft/git-base
 tags:
 - generated_from_trainer
-metrics:
-- wer
-- rouge
 model-index:
 - name: git-base-env
   results: []
@@ -17,13 +14,6 @@ should probably proofread and complete it, then remove this comment. -->
 # git-base-env
 This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 8.4059
-- Wer: 59.0853
-- Rouge1: 1.86
-- Rouge2: 0.57
-- Rougel: 1.63
-- Rougelsum: 1.63
 ## Model description
@@ -42,24 +32,13 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-07
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
-### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer     | Rouge1 | Rouge2 | Rougel | Rougelsum |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:------:|:------:|:---------:|
-| 8.6175        | 0.6   | 30   | 8.4884          | 59.0612 | 1.84   | 0.5    | 1.56   | 1.56      |
-| 8.5757        | 1.2   | 60   | 8.4512          | 58.9443 | 1.81   | 0.53   | 1.57   | 1.57      |
-| 8.541         | 1.8   | 90   | 8.4258          | 59.2653 | 1.86   | 0.56   | 1.62   | 1.62      |
-| 8.5196        | 2.4   | 120  | 8.4114          | 58.9926 | 1.84   | 0.57   | 1.62   | 1.62      |
-| 8.5091        | 3.0   | 150  | 8.4059          | 59.0853 | 1.86   | 0.57   | 1.63   | 1.63      |
 ### Framework versions

 base_model: microsoft/git-base
 tags:
 - generated_from_trainer
 model-index:
 - name: git-base-env
   results: []
 # git-base-env
 This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d800d6d8cd07558d0d467f48d17ae16cde75faf67dabc59c77bcbf2ad206d512
 size 706584273

 version https://git-lfs.github.com/spec/v1
+oid sha256:2bb19154d60ae63b3c0f48b8bba14ade2d67ace854ee6b7101f0d428c35d9946
 size 706584273

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:911607e0512ff0a1471e3a02ab86e58868a12ada2fdd3f3a320d088d33aea6b8
 size 4027

 version https://git-lfs.github.com/spec/v1
+oid sha256:aa9fbc2bc75f566f204e7b3a142a4d8b40f24d65a9cb39b7cfe674c3aef4e23b
 size 4027