NortheasternUniversity/big_patent
Viewer • Updated • 2.68M • 66.3k • 71
How to use ShubhamSP/nd_pegasus_bigpatent_cnn_xsum_model with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("ShubhamSP/nd_pegasus_bigpatent_cnn_xsum_model")
model = AutoModelForSeq2SeqLM.from_pretrained("ShubhamSP/nd_pegasus_bigpatent_cnn_xsum_model")This model is a fine-tuned version of google/pegasus-cnn_dailymail on the big_patent dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|---|---|---|---|---|---|---|---|---|
| 3.5734 | 1.0 | 80 | 3.1804 | 0.3468 | 0.1231 | 0.2262 | 0.2268 | 89.95 |
| 3.3146 | 2.0 | 160 | 3.1037 | 0.3465 | 0.1181 | 0.2258 | 0.227 | 85.75 |