Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Caplin43
/
multimodal-vision-language-mini

Image-to-Text
English
vision-encoder-decoder
vision-language
multimodal
image-captioning
transformer
Model card Files Files and versions
xet
Community
multimodal-vision-language-mini
2.94 kB
  • 1 contributor
History: 4 commits
Caplin43's picture
Caplin43
Create tokenizer.json
1206d67 verified 7 days ago
  • .gitattributes
    1.52 kB
    initial commit 7 days ago
  • README.md
    894 Bytes
    Create README.md 7 days ago
  • config.json
    367 Bytes
    Create config.json 7 days ago
  • tokenizer.json
    161 Bytes
    Create tokenizer.json 7 days ago