Instructions to use openbmb/VoxCPM1.5 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- VoxCPM
How to use openbmb/VoxCPM1.5 with VoxCPM:
import soundfile as sf from voxcpm import VoxCPM model = VoxCPM.from_pretrained("openbmb/VoxCPM1.5") wav = model.generate( text="VoxCPM is an innovative end-to-end TTS model from ModelBest, designed to generate highly expressive speech.", prompt_wav_path=None, # optional: path to a prompt speech for voice cloning prompt_text=None, # optional: reference text cfg_value=2.0, # LM guidance on LocDiT, higher for better adherence to the prompt, but maybe worse inference_timesteps=10, # LocDiT inference timesteps, higher for better result, lower for fast speed normalize=True, # enable external TN tool denoise=True, # enable external Denoise tool retry_badcase=True, # enable retrying mode for some bad cases (unstoppable) retry_badcase_max_times=3, # maximum retrying times retry_badcase_ratio_threshold=6.0, # maximum length restriction for bad case detection (simple but effective), it could be adjusted for slow pace speech ) sf.write("output.wav", wav, 16000) print("saved: output.wav") - Notebooks
- Google Colab
- Kaggle
ported languages ?
what supported languages ?
Currently, it supports mainly Chinese and English, but you can try to fine-tuning it with other language corpus (Ref: https://github.com/OpenBMB/VoxCPM/issues/114). We plan to support more languages in the next version! :)
Currently, it supports mainly Chinese and English, but you can try to fine-tuning it with other language corpus (Ref: https://github.com/OpenBMB/VoxCPM/issues/114). We plan to support more languages in the next version! :)
What languages are on the road map? German maybe? Any estimation, when the multilanguage support will be released?
Currently, it supports mainly Chinese and English, but you can try to fine-tuning it with other language corpus (Ref: https://github.com/OpenBMB/VoxCPM/issues/114). We plan to support more languages in the next version! :)
What languages are on the road map? German maybe? Any estimation, when the multilanguage support will be released?
@Bennet85 Yes, German is on our roadmap. We plan to release the multilingual version in 2026 Q1. Please stay tuned!