Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Xenova
/
vit-gpt2-image-captioning
like
28
Image-to-Text
Transformers.js
ONNX
vision-encoder-decoder
image-text-to-text
image-captioning
Model card
Files
Files and versions
xet
Community
3
Use this model
main
vit-gpt2-image-captioning
/
vocab.json
Xenova
HF Staff
Upload 16 files
54b5f17
almost 3 years ago
raw
Copy download link
history
contribute
delete
Safe
798 kB
File too large to display, you can
check the raw version
instead.