| | --- |
| | tags: |
| | - generated_from_trainer |
| | model-index: |
| | name: Persian-Image-Captioning |
| | --- |
| | |
| | <!-- This model card has been generated automatically according to the information the Trainer had access to. You |
| | should probably proofread and complete it, then remove this comment. --> |
| |
|
| | # Persian-Image-Captioning |
| |
|
| | This model is a fine-tuned version of [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/vision-encoder-decoder) on coco-flickr-farsi. |
| |
|
| |
|
| | ### Framework versions |
| |
|
| | - Transformers 4.12.5 |
| | - Pytorch 1.9.1 |
| | - Datasets 1.16.1 |
| | - Tokenizers 0.10.3 |
| |
|