FastVLM Collection Efficient Vision Encoding for Vision Language Models • 8 items • Updated 29 days ago • 109
MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 27 items • Updated 29 days ago • 58