Sunbird/tts
Viewer • Updated • 90.4k • 46
Single-speaker lug finetune of Qwen/Qwen3-TTS-12Hz-1.7B-Base on the salt_lug_0001 voice
from Sunbird/tts.
Qwen/Qwen3-TTS-12Hz-1.7B-Baselug), filtered to speaker_id == salt_lug_0001import torch
import soundfile as sf
from qwen_tts import Qwen3TTSModel
tts = Qwen3TTSModel.from_pretrained(
"patrickcmd/qwen3-tts-salt-lug-0001",
device_map="cuda:0",
dtype=torch.bfloat16,
attn_implementation="flash_attention_2",
)
wavs, sr = tts.generate_custom_voice(
text="Oli otya?",
speaker="salt_lug_0001",
)
sf.write("out.wav", wavs[0], sr)
salt_lug_0001); voice cloning to other speakers is not the goal of this finetune.Base model
Qwen/Qwen3-TTS-12Hz-1.7B-Base