Running on Zero Featured 87 SAM3 Video Segmentation 🐠 87 Track and label objects in videos using text prompts or clicks
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 1 day ago • 399k • 1.55k
Running on Zero MCP Featured 200 ViTPose Transformers ⚡ 200 Detect and estimate human poses in images and videos
Running on Zero Featured 567 Chat with DeepSeek-VL2-small 🌍 567 Generate responses using images and text input
Running on Zero Featured 109 VLM Object Understanding 🦀 109 Explore object detection, visual grounding, keypoint Detecti