Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
gurumurthy3
/
vision-gpt-flickr8k_v2
like
0
Image-to-Text
jxie/flickr8k
English
vision
image-captioning
gpt2
vision-transformer
flickr8k
multimodal
cross-attention
License:
mit
Model card
Files
Files and versions
xet
Community
main
vision-gpt-flickr8k_v2
1.51 GB
1 contributor
History:
2 commits
gurumurthy3
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
e9d638a
verified
3 months ago
model_fp16
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
3 months ago
model_fp32
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
3 months ago
.gitattributes
Safe
253 Bytes
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
3 months ago
README.md
Safe
4.58 kB
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
3 months ago