JulianS
/

bart-base-finetuned-summscreen

text2text-generation

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

JulianS commited on Feb 16, 2023

Commit

2ab6c60

·

1 Parent(s): 2349436

update model card README.md

Files changed (1) hide show

README.md +73 -0

README.md ADDED Viewed

	@@ -0,0 +1,73 @@

+---
+license: apache-2.0
+tags:
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: bart-base-finetuned-summscreen-bestval-100-genlen-10-epochs
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# bart-base-finetuned-summscreen-bestval-100-genlen-10-epochs
+This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 3.0979
+- Rouge1: 31.5373
+- Rouge2: 6.6821
+- Rougel: 18.6754
+- Rougelsum: 27.4448
+- Gen Len: 80.1927
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 1
+- eval_batch_size: 1
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
+| 3.4849        | 0.99  | 3500  | 3.2071          | 28.6828 | 5.2634 | 17.218  | 25.487    | 94.059  |
+| 3.2933        | 1.99  | 7000  | 3.1329          | 29.9774 | 5.7038 | 17.7705 | 26.2492   | 88.2358 |
+| 3.1088        | 2.98  | 10500 | 3.1010          | 29.6903 | 5.6976 | 17.7468 | 25.9472   | 81.3129 |
+| 2.9605        | 3.98  | 14000 | 3.0811          | 30.2088 | 6.1092 | 18.157  | 26.3051   | 77.8844 |
+| 2.8778        | 4.97  | 17500 | 3.0747          | 30.6996 | 6.3038 | 18.4725 | 26.8669   | 81.6168 |
+| 2.788         | 5.97  | 21000 | 3.0896          | 30.7478 | 6.4468 | 18.3755 | 26.8789   | 85.6395 |
+| 2.7218        | 6.96  | 24500 | 3.0961          | 30.994  | 6.4407 | 18.4929 | 26.9802   | 79.1315 |
+| 2.6753        | 7.96  | 28000 | 3.0892          | 31.336  | 6.6768 | 18.8122 | 27.389    | 83.2313 |
+| 2.5753        | 8.95  | 31500 | 3.0960          | 31.3248 | 6.4093 | 18.6552 | 27.2087   | 80.1474 |
+| 2.5918        | 9.95  | 35000 | 3.0979          | 31.5373 | 6.6821 | 18.6754 | 27.4448   | 80.1927 |
+### Framework versions
+- Transformers 4.26.0
+- Pytorch 1.13.1
+- Datasets 2.9.0
+- Tokenizers 0.13.2