redcaps-dev / todo.md
zamborg's picture
breaks
95ad869

A newer version of the Streamlit SDK is available: 1.52.2

Upgrade

TODO:

  • Fix sample images
  • Allow other image types
  • Allow the model to iteratively sample text
  • Add nucleus size and other advanced options

Please note that this model was explicitly not trained on images of people, and as a result is not designed to caption images with humans. This demo accompanies our paper RedCaps Created by [Karan Desai](mailto:kdexd@umich.edu), [Gaurav Kaul](mailto:kaulg@umich.edu), [Zubin Aysola](mailto:aysola@umich.edu), [Justin Johnson](justincj@umich.edu)