Update README.md
Browse files
README.md
CHANGED
|
@@ -54,7 +54,7 @@ python train_tranformer.py -c runs/trian/transformer/MHA/config.yaml
|
|
| 54 |
|
| 55 |
| Model Variant | Decoding Strategy | BLEU Score |
|
| 56 |
| --------------------- | ----------------- | ---------- |
|
| 57 |
-
|
|
| 58 |
| | Beam Search | **14.56** |
|
| 59 |
| Transformer (MQA) | Greedy Search | 11.00 |
|
| 60 |
| | Beam Search | 12.10 |
|
|
@@ -66,11 +66,11 @@ python train_tranformer.py -c runs/trian/transformer/MHA/config.yaml
|
|
| 66 |
|
| 67 |
| Alignment Function | Decoding Strategy | BLEU Score |
|
| 68 |
| ------------------------ | ----------------- | ---------- |
|
| 69 |
-
|
|
| 70 |
| | Beam Search | 9.44 |
|
| 71 |
| Multiplicative (general) | Greedy Search | 9.20 |
|
| 72 |
| | Beam Search | 9.88 |
|
| 73 |
-
| Additive (concat) | Greedy Search | 10.44
|
| 74 |
| | Beam Search | 10.09 |
|
| 75 |
|
| 76 |
|
|
|
|
| 54 |
|
| 55 |
| Model Variant | Decoding Strategy | BLEU Score |
|
| 56 |
| --------------------- | ----------------- | ---------- |
|
| 57 |
+
| Transformer (MHA) | Greedy Search | 13.61 |
|
| 58 |
| | Beam Search | **14.56** |
|
| 59 |
| Transformer (MQA) | Greedy Search | 11.00 |
|
| 60 |
| | Beam Search | 12.10 |
|
|
|
|
| 66 |
|
| 67 |
| Alignment Function | Decoding Strategy | BLEU Score |
|
| 68 |
| ------------------------ | ----------------- | ---------- |
|
| 69 |
+
| Dot Product (dot) | Greedy Search | 8.95 |
|
| 70 |
| | Beam Search | 9.44 |
|
| 71 |
| Multiplicative (general) | Greedy Search | 9.20 |
|
| 72 |
| | Beam Search | 9.88 |
|
| 73 |
+
| Additive (concat) | Greedy Search | **10.44** |
|
| 74 |
| | Beam Search | 10.09 |
|
| 75 |
|
| 76 |
|