yunu919
/

pegasus-large-dialogue-summarization

text2text-generation

dialogue-summarization

Model card Files Files and versions

pegasus-large-dialogue-summarization

This model is a fine-tuned version of google/pegasus-large for English dialogue summarization.

It takes an English multi-turn dialogue as input and generates a short English summary.

Model Details

Base model: google/pegasus-large
Task: Dialogue Summarization
Language: English
Framework: Hugging Face Transformers
Training environment: Google Colab with NVIDIA T4 GPU

Training Data

The model was trained on a dialogue summarization dataset with the following fields:

input: dialogue
target: summary

After cleaning, the dataset size was:

train: 14,730
validation: 818
test: 819

For a quicker experiment, training and evaluation were run on subsets:

train: 3,000
validation: 500
test: 500

Preprocessing

The following preprocessing steps were applied:

removed placeholder tokens such as <file_gif>
normalized whitespace
removed empty rows
removed duplicate examples

Training Setup

Max source length: 512
Max target length: 48
Epochs: 2
Learning rate: 2e-5
Beam size: 2
Per-device train batch size: 1
Per-device eval batch size: 1
Gradient accumulation steps: 8
Optimizer: Adafactor

Results

Validation

ROUGE-1: 44.4848
ROUGE-2: 21.2919
ROUGE-L: 37.0605
ROUGE-Lsum: 40.5285

Test

ROUGE-1: 44.5355
ROUGE-2: 20.5233
ROUGE-L: 37.2629
ROUGE-Lsum: 40.2454

Example

Input

Hannah: Hey, do you have Betty's number?
Amanda: Lemme check
Amanda: Sorry, can't find it.
Amanda: Ask Larry
Amanda: He called her last time we were at the park together
Hannah: I don't know him well
Amanda: Don't be shy, he's very nice
Hannah: I'd rather you texted him
Amanda: Just text him 🙂
Hannah: Urgh.. Alright
Hannah: Bye
Amanda: Bye bye

Generated Summary

Betty's number is on Amanda's phone. Amanda will text Larry.

Downloads last month: 1

Safetensors

Model size

0.6B params

Tensor type

F32

·