Update README.md
Browse files
README.md
CHANGED
|
@@ -5,4 +5,15 @@ license: cc-by-nc-nd-4.0
|
|
| 5 |
## Introduction
|
| 6 |
|
| 7 |
This model is defined as proposed in the book "mastering pytorch".
|
| 8 |
-
It is based on CNN-encoder and a LSTM-decoder.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
## Introduction
|
| 6 |
|
| 7 |
This model is defined as proposed in the book "mastering pytorch".
|
| 8 |
+
It is based on CNN-encoder and a LSTM-decoder.
|
| 9 |
+
|
| 10 |
+
The CNN-encoder is based on a pretrained RESNET-152. The last layer of the resnet is replaced by a vector embedding layer of 256 elements.
|
| 11 |
+
The LSTM-decoder use an input of 256, a hidden layer of 512, and uses the vocabulary size.
|
| 12 |
+
|
| 13 |
+
The model has been trained as a pure learning exercise, and so the model performances remain relatively mean.
|
| 14 |
+
|
| 15 |
+
## Training procedure
|
| 16 |
+
|
| 17 |
+
For the sake of the exercise, the model has been trained for only 5 epochs.
|
| 18 |
+
|
| 19 |
+
It has been trained on the COCO dataset.
|