gremlin97
/

remote_sensing_gpt

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

gremlin97 commited on Mar 10, 2024

Commit

9bdc901

·

verified ·

1 Parent(s): d755b36

End of training

Files changed (2) hide show

README.md +6 -13
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.3832
 ## Model description
@@ -35,28 +35,21 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0002
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 5.0298        | 1.0   | 829  | 4.7124          |
-| 4.681         | 2.0   | 1658 | 4.5977          |
-| 4.6154        | 3.0   | 2487 | 4.5345          |
-| 4.5244        | 4.0   | 3316 | 4.4902          |
-| 4.4751        | 5.0   | 4145 | 4.4582          |
-| 4.4559        | 6.0   | 4974 | 4.4347          |
-| 4.4124        | 7.0   | 5803 | 4.4158          |
-| 4.3982        | 8.0   | 6632 | 4.4015          |
-| 4.3716        | 9.0   | 7461 | 4.3889          |
-| 4.341         | 10.0  | 8290 | 4.3832          |
 ### Framework versions

 This model is a fine-tuned version of [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.8886
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0003
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 415  | 5.0368          |
+| 5.1787        | 2.0   | 830  | 4.9311          |
+| 4.9154        | 3.0   | 1245 | 4.8886          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:30d9a576b1600f1067e69976c1392cf8d294095d7aa81e05278f5a36a3c01638
 size 9444296

 version https://git-lfs.github.com/spec/v1
+oid sha256:2b1f92cf0cc7bdaeffcd0aac8912d774561c43ae896e84f43f6ecf517c0d0181
 size 9444296