Annotate and describe images with text prompts
Generate descriptions and answers about images
a tiny vision language model