bart-model2-0611-e4 / README.md
theojolliffe's picture
update model card README.md
79eb229
metadata
license: apache-2.0
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: bart-model2-0611-e4
    results: []

bart-model2-0611-e4

This model is a fine-tuned version of theojolliffe/bart-paraphrase-v4-e1-feedback on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.1594
  • Rouge1: 60.6252
  • Rouge2: 57.2694
  • Rougel: 60.2575
  • Rougelsum: 60.3721
  • Gen Len: 20.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 2
  • eval_batch_size: 2
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 4
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
0.5809 1.0 624 0.3684 58.0755 49.7811 56.3216 56.4602 19.8485
0.2627 2.0 1248 0.2291 58.4775 54.4415 58.3879 58.1972 20.0
0.1537 3.0 1872 0.1848 60.3497 56.6674 60.1125 59.9956 20.0
0.1152 4.0 2496 0.1594 60.6252 57.2694 60.2575 60.3721 20.0

Framework versions

  • Transformers 4.24.0
  • Pytorch 1.12.1+cu113
  • Datasets 2.6.1
  • Tokenizers 0.13.1