microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 533k • 1.61k
jq/whisper-large-v3-salt-plus-xog-myx-kin-swa Automatic Speech Recognition • 2B • Updated Feb 4, 2025 • 20 • 1
Running on Zero Agents Featured 969 MMAudio — generating synchronized audio from video/text 🔊 969 Generate synchronized audio for videos from text prompts