--- language: - zh pipeline_tag: text-to-speech base_model: Jackellie/ellie-Bert-VITS2 license: cc-by-4.0 --- Taiwan accent TTS model from JackEllie. ## Usage Using this checkpoint from Hugging Face Transformers: ```python from transformers import AutoModel, AutoProcessor from scipy.io.wavfile import write import torch model_name = "BricksDisplay/ellie-Bert-VITS2" model = AutoModel.from_pretrained(model_name, trust_remote_code=True) processor = AutoProcessor.from_pretrained(model_name, trust_remote_code=True) with torch.no_grad(): inputs = processor("你好", language="zh", return_tensors="pt") result = model(**inputs) result = result["waveform"] write("output.wav", model.config.sampling_rate, result[0].numpy()) ```