Update README.md
Browse files
README.md
CHANGED
|
@@ -12,6 +12,7 @@ pipeline_tag: text2text-generation
|
|
| 12 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/62a7d1e152aa8695f9209345/P-TlY6ia0gLJeJxBA_04g.gif" />
|
| 13 |
</p>
|
| 14 |
<hr>
|
|
|
|
| 15 |
This is a custom huggingface model port of the [PyTorch implementation of the original transformer](https://github.com/ubaada/scratch-transformer) model from 2017 introduced in the paper "[Attention Is All You Need](https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf)". This is the 65M parameter base model version trained to do English-to-German translations.
|
| 16 |
|
| 17 |
## Usage:
|
|
|
|
| 12 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/62a7d1e152aa8695f9209345/P-TlY6ia0gLJeJxBA_04g.gif" />
|
| 13 |
</p>
|
| 14 |
<hr>
|
| 15 |
+
|
| 16 |
This is a custom huggingface model port of the [PyTorch implementation of the original transformer](https://github.com/ubaada/scratch-transformer) model from 2017 introduced in the paper "[Attention Is All You Need](https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf)". This is the 65M parameter base model version trained to do English-to-German translations.
|
| 17 |
|
| 18 |
## Usage:
|