microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 17 days ago • 284k • 1.55k
jq/whisper-large-v3-salt-plus-xog-myx-kin-swa Automatic Speech Recognition • 2B • Updated Feb 4 • 25 • 1
Running on Zero Featured 903 MMAudio — generating synchronized audio from video/text 🔊 903 Generate audio from video or text prompts