Text-to-Speech
Transformers
Safetensors
English
vibevoice_streaming
Realtime TTS
Streaming text input
Long-form speech generation
Instructions to use Sumit23/VibeVoice-Realtime-0.5B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Sumit23/VibeVoice-Realtime-0.5B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="Sumit23/VibeVoice-Realtime-0.5B")# Load model directly from transformers import VibeVoiceStreamingForConditionalGenerationInference model = VibeVoiceStreamingForConditionalGenerationInference.from_pretrained("Sumit23/VibeVoice-Realtime-0.5B", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Xet hash:
- f4ef91562d7f139a6d69649c4415c758ce4ab69f11826b2f6bbaac904e0fc0ef
- Size of remote file:
- 2.04 GB
- SHA256:
- 7758b150b8139deb48ac1ff6f181f745c8fedd5511232fd974b3eb217d83b514
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.