Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Skyler215
/
VIT_Captioning
like
0
Image-Text-to-Text
Transformers
TensorBoard
Safetensors
vision-encoder-decoder
Generated from Trainer
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
VIT_Captioning
/
added_tokens.json
Skyler215
End of training
4ac5dbc
verified
about 1 year ago
raw
Copy download link
history
blame
contribute
delete
Safe
83 Bytes
{
"<|endoftext|>"
:
50257
,
"[CLS]"
:
50258
,
"[PAD]"
:
50259
,
"[SEP]"
:
50260
}