Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
gurumurthy3
/
vision-gpt-flickr8k_v2
like
0
Image-to-Text
jxie/flickr8k
English
vision
image-captioning
gpt2
vision-transformer
flickr8k
multimodal
cross-attention
License:
mit
Model card
Files
Files and versions
xet
Community
main
vision-gpt-flickr8k_v2
/
model_fp16
/
tokenizer
4.81 MB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
gurumurthy3
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
e9d638a
verified
6 months ago
added_tokens.json
Safe
23 Bytes
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
6 months ago
merges.txt
Safe
456 kB
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
6 months ago
special_tokens_map.json
Safe
239 Bytes
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
6 months ago
tokenizer.json
Safe
3.56 MB
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
6 months ago
tokenizer_config.json
Safe
674 Bytes
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
6 months ago
vocab.json
Safe
798 kB
Upload Vision-GPT: FP32 and FP16 versions with fixed quantization
6 months ago