Visual Grounding
#20
by Maverick17 - opened
Haven't tried it yet... But is this model capable of providing bounding boxes based on a query, aka visual grounding?
I was not able to get a bounding box. It always tells me to use yolo and provides me python code ;-)