End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: transformers
-license: apache-2.0
-base_model: bert-base-uncased
 tags:
 - generated_from_trainer
 model-index:
@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
 # results
-This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8851
 ## Model description
@@ -35,25 +35,27 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
-- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 500
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.9405        | 1.0   | 50   | 0.8851          |
 ### Framework versions
-- Transformers 4.46.2
 - Pytorch 2.5.1+cu121
-- Datasets 3.1.0
-- Tokenizers 0.20.3

 ---
 library_name: transformers
+license: mit
+base_model: microsoft/phi-1_5
 tags:
 - generated_from_trainer
 model-index:
 # results
+This model is a fine-tuned version of [microsoft/phi-1_5](https://huggingface.co/microsoft/phi-1_5) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8877
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 8
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 2
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 6.0811        | 1.0   | 7    | 1.0800          |
+| 6.0811        | 1.8   | 12   | 0.8877          |
 ### Framework versions
+- Transformers 4.47.1
 - Pytorch 2.5.1+cu121
+- Tokenizers 0.21.0

generation_config.json ADDED Viewed

+{
+  "_from_model_config": true,
+  "transformers_version": "4.47.1"
+}

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:77766be96c509ea44c8da1c15402ef6a1f002679599e2e2c2975877fffd539b1
 size 4984916152

 version https://git-lfs.github.com/spec/v1
+oid sha256:a12095a1bf17aa4b24781a844d4473967a914ef1227b8bf0426809f47ad3900f
 size 4984916152

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c91ef643478afe943b3df3720627d7795f285779f445567d2225463728d5c9d
 size 688204064

 version https://git-lfs.github.com/spec/v1
+oid sha256:4c0b2480555bad2a5939b7f9f33d8905bfbff11a215682bf836f63e0ee479465
 size 688204064