microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 439k • 1.6k
jq/whisper-large-v3-salt-plus-xog-myx-kin-swa Automatic Speech Recognition • 2B • Updated Feb 4, 2025 • 10 • 1
Running on Zero Agents Featured 955 MMAudio — generating synchronized audio from video/text 🔊 955 Generate synchronized audio for videos or from text prompts