baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated about 1 month ago • 14.9k • 528
Running 24 GIM: Learning Generalizable Image Matcher From Internet Videos 🤗 24 Match images to find keypoints and geometry
Running on T4 Featured 115 SAM2 Video Predictor 🔥 115 Segment objects in a video with click‑based masks
nomic-ai/nomic-embed-vision-v1.5 Image Feature Extraction • 92.9M • Updated Mar 31, 2025 • 389k • 216