How to use from the
Use from the
Chatterbox library
# pip install chatterbox-tts
import torchaudio as ta
from chatterbox.tts import ChatterboxTTS

model = ChatterboxTTS.from_pretrained(device="cuda")

text = "Ezreal and Jinx teamed up with Ahri, Yasuo, and Teemo to take down the enemy's Nexus in an epic late-game pentakill."
wav = model.generate(text)
ta.save("test-1.wav", wav, model.sr)

# If you want to synthesize with a different voice, specify the audio prompt
AUDIO_PROMPT_PATH="YOUR_FILE.wav"
wav = model.generate(text, audio_prompt_path=AUDIO_PROMPT_PATH)
ta.save("test-2.wav", wav, model.sr)

Chatterbox TTS Fine-Tuned Model

This is a fine-tuned version of the Chatterbox TTS model.

Model Details

  • Base Model: Chatterbox TTS Standard
  • Training Epochs: 50
  • Batch Size: 24
  • Learning Rate: 5e-05

Usage

from huggingface_hub import hf_hub_download
from safetensors.torch import load_file

# Download the fine-tuned weights
model_path = hf_hub_download(
    repo_id="Chandan683/chatterbox-finetuned",
    filename="t3_finetuned.safetensors"
)

# Load weights
state_dict = load_file(model_path)

Training Data

This model was fine-tuned on custom speech data.

License

MIT License

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support