microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated Dec 10, 2025 β’ 388k β’ 1.6k
Running on Zero Agents Featured 2.86k F5-TTS π£ 2.86k F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Running on Zero Agents 236 GPT SoVITS V2 Pro Plus π€ 236 Generate speech from text using a reference voice
Runtime error Agents Featured 697 Fish Audio S1 π 697 Convert text to natural-sounding speech audio
Runtime error Agents 150 Multi Voice TTS(English/Chinese/Japanese) π 150 [δΈζ/English/ζ₯ζ¬θͺ]multilingual text-to-speech
pythainlp/thainer-corpus-v2-base-model Token Classification β’ 0.1B β’ Updated Mar 23, 2023 β’ 139k β’ β’ 16
Configuration error Agents Featured 178 NaturalSpeech3 FACodec π 178 Convert and reconstruct speech files