|
|
--- |
|
|
language: en |
|
|
license: apache-2.0 |
|
|
tags: |
|
|
- summarization |
|
|
datasets: |
|
|
- xsum |
|
|
model-index: |
|
|
- name: google/roberta2roberta_L-24_bbc |
|
|
results: |
|
|
- task: |
|
|
type: summarization |
|
|
name: Summarization |
|
|
dataset: |
|
|
name: xglue |
|
|
type: xglue |
|
|
config: mlqa |
|
|
split: test.ar |
|
|
metrics: |
|
|
- type: rouge |
|
|
value: 0.0213 |
|
|
name: ROUGE-1 |
|
|
verified: true |
|
|
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjRhNmFhMDhmNTllNWM3ZjFlMTNjZDcxNDU1ZmU5YmRhZjczNTQwMjQ4MDQzMDY5NGNhZDhkM2EzM2NiODg1MyIsInZlcnNpb24iOjF9.-BNXeOa4KVK5T7kX1oDGpFWE1fiPV0HSafnwbm5cnaxM8c1BnZmfLu8uvRPCgiurWjgSQihk4og4hkcfdvffBQ |
|
|
- type: rouge |
|
|
value: 0.0019 |
|
|
name: ROUGE-2 |
|
|
verified: true |
|
|
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZTQ0OTBmNDc1NjkxMGM3ODJlMDY1YjJlYTQyNzFhZGYyNzA1NTIyYjkwYWUxNzk0MDc5NjU0NTc1ZDU3YTY0MyIsInZlcnNpb24iOjF9.x0cqOoYSUIMjJnWTF2p9rntIRYJQGbGhm_K_UbkDKiIX9eYmRr9jO7pXHx_35TZVDbkYjG39PZMTwxXxu_I-CQ |
|
|
- type: rouge |
|
|
value: 0.0201 |
|
|
name: ROUGE-L |
|
|
verified: true |
|
|
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjExOTEwZjA5ODQwNWRjMWQ0MGFmYzJiYjdlYmU4YjA0NGZlYTBmY2VkOTQ3YTFkOTUyZjdiOTg0N2E0YmQ3ZCIsInZlcnNpb24iOjF9.C1j9T_WQrGeHmJLDBDXMlLHxPjzb0kaAp7MdNuDno4tehA-WV8QMOEwQTaJc7QtRveeH1wrrsYgKxZ4P-6NEAA |
|
|
- type: rouge |
|
|
value: 0.0196 |
|
|
name: ROUGE-LSUM |
|
|
verified: true |
|
|
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZGFiOTVjNTEzNDBlMGIxNTc2MTZhN2I2MjUzN2U1ZGM3ZDVkYjhmNTdjZjA2OGUxZjlkYjY3YjhjODAwMzYwZCIsInZlcnNpb24iOjF9.mUtFHxPvgbB9CPonENnm_lynnps3Z2q5ncKBf0ZkzVc1E28cNN37OE_aUhLTqvdcZ2hwUqS4zHJlndZv1PONAg |
|
|
- type: loss |
|
|
value: 6.101626396179199 |
|
|
name: loss |
|
|
verified: true |
|
|
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMmI4M2UxZTE1ZmM4NzhkZjI4NDk1MzUwOGVjNjZlZThkMmVkOGM4YzU3MjRiOTZkMjFjZTU5YTJhNWJhNzJmNSIsInZlcnNpb24iOjF9.M8qQtenTtqRVTBHt23uY1XqRL3hGNJv4JgXeJlHV0xWnc8xp8b2b_ycvOh7FwZzcW2gPUVJmU8hpqozHTUkXDQ |
|
|
- type: gen_len |
|
|
value: 42.5961 |
|
|
name: gen_len |
|
|
verified: true |
|
|
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiODExYmMxZDQ3ZWM0Yzc5ZjgxZmI0ZjQ3MGNlMDEwNmNhZDVkMjZmYTU1MzBjZWQ4YWM0MjRhYjM3NDZlMTI0MCIsInZlcnNpb24iOjF9.U2blxgJeU7XDoP8sU4AInQmewyVhgnJ0qPQXGCx5d-cqSsx0WveMpgx93Yxedb_r7mbrVigOwIpZm03M_Y6oBg |
|
|
--- |
|
|
|
|
|
# Roberta2Roberta_L-24_bbc EncoderDecoder model |
|
|
|
|
|
The model was introduced in |
|
|
[this paper](https://arxiv.org/abs/1907.12461) by Sascha Rothe, Shashi Narayan, Aliaksei Severyn and first released in [this repository](https://tfhub.dev/google/bertseq2seq/roberta24_bbc/1). |
|
|
|
|
|
The model is an encoder-decoder model that was initialized on the `roberta-large` checkpoints for both the encoder |
|
|
and decoder and fine-tuned on extreme summarization on the BBC XSum dataset, which is linked above. |
|
|
|
|
|
Disclaimer: The model card has been written by the Hugging Face team. |
|
|
|
|
|
## How to use |
|
|
|
|
|
You can use this model for extreme summarization, *e.g.* |
|
|
|
|
|
```python |
|
|
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM |
|
|
|
|
|
tokenizer = AutoTokenizer.from_pretrained("google/roberta2roberta_L-24_bbc") |
|
|
model = AutoModelForSeq2SeqLM.from_pretrained("google/roberta2roberta_L-24_bbc") |
|
|
|
|
|
article = """The problem is affecting people using the older |
|
|
versions of the PlayStation 3, called the "Fat" |
|
|
model.The problem isn't affecting the newer PS3 |
|
|
Slim systems that have been on sale since |
|
|
September last year.Sony have also said they are |
|
|
aiming to have the problem fixed shortly but is |
|
|
advising some users to avoid using their console |
|
|
for the time being."We hope to resolve this |
|
|
problem within the next 24 hours," a statement |
|
|
reads. "In the meantime, if you have a model other |
|
|
than the new slim PS3, we advise that you do not |
|
|
use your PS3 system, as doing so may result in |
|
|
errors in some functionality, such as recording |
|
|
obtained trophies, and not being able to restore |
|
|
certain data."We believe we have identified that |
|
|
this problem is being caused by a bug in the clock |
|
|
functionality incorporated in the system."The |
|
|
PlayStation Network is used by millions of people |
|
|
around the world.It allows users to play their |
|
|
friends at games like Fifa over the internet and |
|
|
also do things like download software or visit |
|
|
online stores.""" |
|
|
|
|
|
input_ids = tokenizer(article, return_tensors="pt").input_ids |
|
|
output_ids = model.generate(input_ids)[0] |
|
|
print(tokenizer.decode(output_ids, skip_special_tokens=True)) |
|
|
# should output |
|
|
# Some Sony PlayStation gamers are being advised to stay away from the network because of a problem with the PlayStation 3 network. |
|
|
``` |
|
|
|