Commit ·
329e204
1
Parent(s): 4dea387
update ckpt, bleuu
Browse files
README.md
CHANGED
|
@@ -71,21 +71,22 @@ optimizer used is Adam with a learning rate of 3e-4, β1 = 0.9, β2 = 0.98 and
|
|
| 71 |
ε = 1e-8, a weight decay of 0.01, learning rate warmup for 1,000 steps and linear decay of the learning
|
| 72 |
rate after.
|
| 73 |
|
| 74 |
-
We tracked experiments using Tensorboard which can be found in `Training Metrics` tab.
|
| 75 |
|
| 76 |
#### **Pretraining Results 📊**
|
| 77 |
|
| 78 |
-
Our model reached **eval loss of ~2.6** around ~
|
| 79 |
|
| 80 |
|Language |BLEU-1|BLEU-2|BLEU-3|BLEU-4|
|
| 81 |
|--------------------------|------|------|------|------|
|
| 82 |
-
|English | 0.
|
| 83 |
-
|Spanish | 0.
|
| 84 |
-
|German | 0.
|
| 85 |
-
|French | 0.
|
| 86 |
|
| 87 |
-
|
| 88 |
|
|
|
|
| 89 |
|
| 90 |
## **App Demo**
|
| 91 |
|
|
|
|
| 71 |
ε = 1e-8, a weight decay of 0.01, learning rate warmup for 1,000 steps and linear decay of the learning
|
| 72 |
rate after.
|
| 73 |
|
| 74 |
+
We tracked experiments using Tensorboard which can be found in `Training Metrics` tab. BLEU scores for languages other than English might be wrongly tracked but the model gives good performance in other languages too as evident from the evaluation scores.
|
| 75 |
|
| 76 |
#### **Pretraining Results 📊**
|
| 77 |
|
| 78 |
+
Our model reached **eval loss of ~2.6** around ~60k steps. Here are the BLEU scores (out of 1) for different languages:
|
| 79 |
|
| 80 |
|Language |BLEU-1|BLEU-2|BLEU-3|BLEU-4|
|
| 81 |
|--------------------------|------|------|------|------|
|
| 82 |
+
|English | 0.13083| 0.08887| 0.06681 | 0.04899|
|
| 83 |
+
|Spanish | 0.15981| 0.09858| 0.06918| 0.04776|
|
| 84 |
+
|German | 0.14234| 0.09817| 0.07405| 0.0515|
|
| 85 |
+
|French | 0.13021| 0.08862| 0.06598| 0.04647|
|
| 86 |
|
| 87 |
+
Model used: ckpt-51999/
|
| 88 |
|
| 89 |
+
In order to reproduce the results, one can use the [evaluation script](https://github.com/gchhablani/multilingual-image-captioning/blob/main/evaluation.py) available in this project's repository.
|
| 90 |
|
| 91 |
## **App Demo**
|
| 92 |
|
flax_model.msgpack
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 2188672582
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e19869c5129cb6338a9213928b4f0f4bf9dd6dfb418db64eaf1a1cd74af8e438
|
| 3 |
size 2188672582
|
logs/{events.out.tfevents.1626777663.t1v-n-e32efd45-w-0.1846718.3.v2 → events.out.tfevents.1626901021.t1v-n-e32efd45-w-0.2243015.3.v2}
RENAMED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:08b67ae69db766025a6e695b192868ab8e00f02bc3ebe79b139f9066d8bdc677
|
| 3 |
+
size 9032126
|