microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 23 days ago • 248k • 1.55k
Running on Zero MCP Featured 1.71k Qwen Image Edit Camera Control 🎬 1.71k Fast 4 step inference with Qwen Image Edit 2509
Running on Zero Featured 111 VLM Object Understanding 🦀 111 Explore object detection, visual grounding, keypoint Detecti