Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

voxreality
/
rgb_language_cap

Image-to-Text
Transformers
PyTorch
English
vision-encoder-decoder
image-text-to-text
text-generation-inference
Model card Files Files and versions
xet
Community
1

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    1.73 kB
    Update README.md over 1 year ago
  • config.json
    4.85 kB
    Upload 10 files over 1 year ago
  • generation_config.json
    149 Bytes
    Upload 10 files over 1 year ago
  • merges.txt
    456 kB
    Upload 10 files over 1 year ago
  • preprocessor_config.json
    374 Bytes
    Upload 10 files over 1 year ago
  • pytorch_model.bin
    957 MB
    xet
    Upload 10 files over 1 year ago
  • special_tokens_map.json
    131 Bytes
    Upload 10 files over 1 year ago
  • tokenizer.json
    2.11 MB
    Upload 10 files over 1 year ago
  • tokenizer_config.json
    476 Bytes
    Upload 10 files over 1 year ago
  • training_args.bin
    6.14 kB
    xet
    Upload 10 files over 1 year ago
  • vocab.json
    798 kB
    Upload 10 files over 1 year ago