Running on Zero Featured 100 SAM3 Video Segmentation 🐠 100 Track and label objects in videos using text prompts or clicks
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 193k • 1.56k
Running on Zero MCP Featured 211 ViTPose Transformers ⚡ 211 Detect and estimate human poses in images and videos
Running on Zero Featured 577 Chat with DeepSeek-VL2-small 🌍 577 Generate responses using images and text input
Running on Zero Featured 112 VLM Object Understanding 🦀 112 Explore object detection, visual grounding, keypoint Detecti