ArtifactAI
commited on
Commit
·
b958628
1
Parent(s):
e0babc4
Update README.md
Browse files
README.md
CHANGED
|
@@ -5,11 +5,13 @@ datasets:
|
|
| 5 |
- billsum
|
| 6 |
---
|
| 7 |
|
| 8 |
-
|
|
|
|
| 9 |
|
| 10 |
-
|
| 11 |
|
| 12 |
|
|
|
|
| 13 |
```
|
| 14 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
| 15 |
|
|
|
|
| 5 |
- billsum
|
| 6 |
---
|
| 7 |
|
| 8 |
+
# Longformer Encoder-Decoder (LED) fine-tuned on Billsum
|
| 9 |
+
This model is a fine-tuned version of led-large-16384 on the billsum dataset.
|
| 10 |
|
| 11 |
+
As described in Longformer: The Long-Document Transformer by Iz Beltagy, Matthew E. Peters, Arman Cohan, led-base-16384 was initialized from bart-base since both models share the exact same architecture. To be able to process 16K tokens, bart-base's position embedding matrix was simply copied 16 times.
|
| 12 |
|
| 13 |
|
| 14 |
+
# Use In Transformers
|
| 15 |
```
|
| 16 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
| 17 |
|