Update README.md
Browse files
README.md
CHANGED
|
@@ -8,6 +8,10 @@ language:
|
|
| 8 |
pipeline_tag: text2text-generation
|
| 9 |
---
|
| 10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
This is a custom huggingface model port of the [PyTorch implementation of the original transformer](https://github.com/ubaada/scratch-transformer) model from 2017 introduced in the paper "[Attention Is All You Need](https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf)". This is the 65M parameter base model version trained to do English-to-German translations.
|
| 12 |
|
| 13 |
## Usage:
|
|
|
|
| 8 |
pipeline_tag: text2text-generation
|
| 9 |
---
|
| 10 |
|
| 11 |
+
<p align="center">
|
| 12 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/62a7d1e152aa8695f9209345/P-TlY6ia0gLJeJxBA_04g.gif" />
|
| 13 |
+
</p>
|
| 14 |
+
<hr>
|
| 15 |
This is a custom huggingface model port of the [PyTorch implementation of the original transformer](https://github.com/ubaada/scratch-transformer) model from 2017 introduced in the paper "[Attention Is All You Need](https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf)". This is the 65M parameter base model version trained to do English-to-German translations.
|
| 16 |
|
| 17 |
## Usage:
|