End of training

Browse files

Files changed (3) hide show

README.md +18 -17
generation_config.json +1 -2
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
-base_model: facebook/bart-base
 license: apache-2.0
 tags:
 - generated_from_trainer
 model-index:
@@ -15,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5106
 ## Model description
@@ -38,7 +39,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1486
 - num_epochs: 10
@@ -47,21 +48,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 0.7243        | 1.0   | 1487  | 0.6370          |
-| 0.6341        | 2.0   | 2974  | 0.5809          |
-| 0.6233        | 3.0   | 4461  | 0.5453          |
-| 0.5423        | 4.0   | 5948  | 0.5398          |
-| 0.5203        | 5.0   | 7435  | 0.5270          |
-| 0.4592        | 6.0   | 8922  | 0.5196          |
-| 0.4737        | 7.0   | 10409 | 0.5184          |
-| 0.4591        | 8.0   | 11896 | 0.5141          |
-| 0.4384        | 9.0   | 13383 | 0.5106          |
-| 0.4199        | 10.0  | 14870 | 0.5128          |
 ### Framework versions
-- Transformers 4.44.0
-- Pytorch 2.4.0+cu124
-- Datasets 2.21.0
-- Tokenizers 0.19.1

 ---
+library_name: transformers
 license: apache-2.0
+base_model: facebook/bart-base
 tags:
 - generated_from_trainer
 model-index:
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5104
 ## Model description
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1486
 - num_epochs: 10
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 0.7257        | 1.0   | 1487  | 0.6373          |
+| 0.6347        | 2.0   | 2974  | 0.5829          |
+| 0.6218        | 3.0   | 4461  | 0.5461          |
+| 0.5425        | 4.0   | 5948  | 0.5402          |
+| 0.5202        | 5.0   | 7435  | 0.5261          |
+| 0.46          | 6.0   | 8922  | 0.5193          |
+| 0.4749        | 7.0   | 10409 | 0.5174          |
+| 0.4587        | 8.0   | 11896 | 0.5141          |
+| 0.439         | 9.0   | 13383 | 0.5104          |
+| 0.42          | 10.0  | 14870 | 0.5126          |
 ### Framework versions
+- Transformers 4.48.3
+- Pytorch 2.6.0+cu126
+- Datasets 3.0.1
+- Tokenizers 0.21.0

generation_config.json CHANGED Viewed

@@ -9,6 +9,5 @@
   "no_repeat_ngram_size": 3,
   "num_beams": 4,
   "pad_token_id": 1,
-  "transformers_version": "4.44.0",
-  "use_cache": false
 }

   "no_repeat_ngram_size": 3,
   "num_beams": 4,
   "pad_token_id": 1,
+  "transformers_version": "4.48.3"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:77932635c7d253ec82b49abcdcca5e3f26377b5296c241122f8dcefb8442dd2c
 size 557921848

 version https://git-lfs.github.com/spec/v1
+oid sha256:8f25fd538cc5d0d90b14bdeff8a72fd21c29a33e028ae4366cb2e0c4ca5ab2c5
 size 557921848