alexfabbri/multi_news
Updated • 5.82k • 79
How to use cs608/multi-news-diff-weight with Transformers:
# Use a pipeline as a high-level helper
# Warning: Pipeline type "summarization" is no longer supported in transformers v5.
# You must load the model directly (see below) or downgrade to v4.x with:
# 'pip install "transformers<5.0.0'
from transformers import pipeline
pipe = pipeline("summarization", model="cs608/multi-news-diff-weight") # Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("cs608/multi-news-diff-weight")
model = AutoModelForSeq2SeqLM.from_pretrained("cs608/multi-news-diff-weight")This model is a fine-tuned version of facebook/bart-base on the multi_news dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|---|---|---|---|---|---|---|---|
| 2.75 | 1.0 | 19225 | 2.4494 | 9.5021 | 3.5429 | 7.3531 | 8.6912 |
| 2.456 | 2.0 | 38450 | 2.3665 | 9.8103 | 3.8494 | 7.6256 | 8.9991 |
| 2.285 | 3.0 | 57675 | 2.3427 | 9.815 | 3.8774 | 7.6169 | 8.9863 |