Chuntao Dan
p051tr0n
·
AI & ML interests
all kinds
Organizations
Voice
Multimodal
-
Salesforce/blip-itm-base-coco
Updated • 66.7k • 28 -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.1M • 861 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 429k • 194 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 12.2M • 2.04k
Agentic
Voice
Vision
Multimodal
-
Salesforce/blip-itm-base-coco
Updated • 66.7k • 28 -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.1M • 861 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 429k • 194 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 12.2M • 2.04k
Robot