Purpose of special tokens

by tdeboissiere - opened Jul 30, 2024

Jul 30, 2024

Hello !

Thanks for the detailed blog post, very helpful.
I was curious about the special tokens (e.g. ['<od>', '</od>', '<ocr>', '</ocr>']) in the Florence2Processor

These tokens don't seem to be used anywhere, so what is their purpose ?
Related: how was Florence-2 initially trained, say, for object detection ? (Were the inputs to the model the image + a text prompt such as "Locate the objects with category name in the image." + the category + the actual location of the objects in the image ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment