Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 193k • 1.56k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 193k • 1.56k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 915 • 723 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 43.5k • 2.32k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10, 2025 • 2.14M • • 5.62k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21, 2025 • 2
Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 193k • 1.56k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 193k • 1.56k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 915 • 723 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 43.5k • 2.32k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10, 2025 • 2.14M • • 5.62k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21, 2025 • 2