Text-to-Speech
Core ML
Supertonic
speech
audio
tts
ane
apple-silicon
flow-matching
diffusion
multilingual
Instructions to use FluidInference/supertonic-3-coreml with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Supertonic
How to use FluidInference/supertonic-3-coreml with Supertonic:
from supertonic import TTS tts = TTS(auto_download=True) style = tts.get_voice_style(voice_name="M1") text = "The train delay was announced at 4:45 PM on Wed, Apr 3, 2024 due to track maintenance." wav, duration = tts.synthesize(text, voice_style=style) tts.save_audio(wav, "output.wav")
- Notebooks
- Google Colab
- Kaggle
Ctrl+K
- DurationPredictor.mlmodelc
- DurationPredictor.mlpackage
- TextEncoder.mlmodelc
- TextEncoder.mlpackage
- VectorEstimator.mlmodelc
- VectorEstimator.mlpackage
- VectorEstimator_int8.mlmodelc
- VectorEstimator_int8.mlpackage
- Vocoder.mlmodelc
- Vocoder.mlpackage
- voice_styles
- 1.52 kB
- 2 Bytes
- 9.24 kB
- 8.56 kB
- 11.1 kB
- 45 Bytes
- 8.25 kB
- 278 kB