nlpconnect
/

vit-gpt2-image-captioning

vision-encoder-decoder

image-text-to-text

image-captioning

Model card Files Files and versions

Any good?

#1

by Nabil - opened Jun 7, 2022

Hi,

how accurate and how fast does it generate the captions.

Thank you

NLP Connect org Jun 20, 2022

This image captioning model was trained by @ydshieh in flax, this is the PyTorch version of https://huggingface.co/ydshieh/vit-gpt2-coco-en-ckpts model.

Please see this: https://huggingface.co/ydshieh/vit-gpt2-coco-en-ckpts/tensorboard

NLP Connect org Nov 27, 2022

https://ankur3107.github.io/blogs/the-illustrated-image-captioning-using-transformers

ankur310794 changed discussion status to closed Nov 27, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment