---
license: apache-2.0
tags:
- generated_from_trainer
metrics:
- rouge
model-index:
- name: bart-base-finetuned-xsum
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# bart-base-finetuned-xsum

This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 0.5585
- Rouge1: 0.8859
- Rouge2: 0.8467
- Rougel: 0.8883
- Rougelsum: 0.8879

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 30

### Training results

| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
| No log        | 1.0   | 41   | 0.4596          | 0.8845 | 0.849  | 0.8874 | 0.8865    |
| No log        | 2.0   | 82   | 0.4047          | 0.8839 | 0.8466 | 0.8852 | 0.8836    |
| No log        | 3.0   | 123  | 0.4587          | 0.8765 | 0.836  | 0.8783 | 0.8755    |
| No log        | 4.0   | 164  | 0.4488          | 0.8785 | 0.8389 | 0.8811 | 0.8784    |
| No log        | 5.0   | 205  | 0.4443          | 0.8564 | 0.8084 | 0.855  | 0.8543    |
| No log        | 6.0   | 246  | 0.4643          | 0.8965 | 0.8614 | 0.8981 | 0.8965    |
| No log        | 7.0   | 287  | 0.4782          | 0.8831 | 0.8468 | 0.885  | 0.8836    |
| No log        | 8.0   | 328  | 0.4870          | 0.853  | 0.8051 | 0.8554 | 0.8541    |
| No log        | 9.0   | 369  | 0.4766          | 0.9029 | 0.8659 | 0.9052 | 0.9027    |
| No log        | 10.0  | 410  | 0.5023          | 0.8924 | 0.8528 | 0.895  | 0.8926    |
| No log        | 11.0  | 451  | 0.5254          | 0.8689 | 0.8234 | 0.8699 | 0.8692    |
| No log        | 12.0  | 492  | 0.4996          | 0.8833 | 0.8424 | 0.8851 | 0.8843    |
| 0.1489        | 13.0  | 533  | 0.5095          | 0.8747 | 0.8345 | 0.8762 | 0.8749    |
| 0.1489        | 14.0  | 574  | 0.5034          | 0.868  | 0.8226 | 0.8699 | 0.8689    |
| 0.1489        | 15.0  | 615  | 0.4976          | 0.8609 | 0.8112 | 0.8632 | 0.8617    |
| 0.1489        | 16.0  | 656  | 0.5122          | 0.9055 | 0.8722 | 0.9068 | 0.9069    |
| 0.1489        | 17.0  | 697  | 0.5204          | 0.845  | 0.7954 | 0.8482 | 0.8461    |
| 0.1489        | 18.0  | 738  | 0.5363          | 0.8911 | 0.8528 | 0.8934 | 0.8919    |
| 0.1489        | 19.0  | 779  | 0.5572          | 0.8943 | 0.8594 | 0.8963 | 0.8956    |
| 0.1489        | 20.0  | 820  | 0.5469          | 0.9031 | 0.8688 | 0.9047 | 0.9047    |
| 0.1489        | 21.0  | 861  | 0.5508          | 0.8848 | 0.8472 | 0.887  | 0.8869    |
| 0.1489        | 22.0  | 902  | 0.5579          | 0.8724 | 0.8306 | 0.8747 | 0.8737    |
| 0.1489        | 23.0  | 943  | 0.5508          | 0.8772 | 0.8397 | 0.8808 | 0.8803    |
| 0.1489        | 24.0  | 984  | 0.5658          | 0.8627 | 0.8153 | 0.8645 | 0.8637    |
| 0.0336        | 25.0  | 1025 | 0.5539          | 0.904  | 0.8702 | 0.9052 | 0.9058    |
| 0.0336        | 26.0  | 1066 | 0.5605          | 0.9004 | 0.8659 | 0.9026 | 0.9017    |
| 0.0336        | 27.0  | 1107 | 0.5589          | 0.899  | 0.8644 | 0.9012 | 0.9005    |
| 0.0336        | 28.0  | 1148 | 0.5558          | 0.8872 | 0.8488 | 0.8894 | 0.889     |
| 0.0336        | 29.0  | 1189 | 0.5570          | 0.8859 | 0.8467 | 0.8883 | 0.8879    |
| 0.0336        | 30.0  | 1230 | 0.5585          | 0.8859 | 0.8467 | 0.8883 | 0.8879    |


### Framework versions

- Transformers 4.26.0.dev0
- Pytorch 1.13.0
- Datasets 2.8.0
- Tokenizers 0.13.2