microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 305k • 1.58k
jq/whisper-large-v3-salt-plus-xog-myx-kin-swa Automatic Speech Recognition • 2B • Updated Feb 4, 2025 • 2 • 1
Running on Zero Featured 936 MMAudio — generating synchronized audio from video/text 🔊 936 Generate synchronized audio for videos from text prompts