theoracleguy/Chatterbox-Multilingual-MLX-v2-fp16
This model was converted to MLX format from ResembleAI/chatterbox using mlx-audio version 0.2.10.
Note: This model requires the S3Tokenizer weights from mlx-community/S3TokenizerV2, which will be downloaded automatically.
Use with mlx-audio
pip install -U mlx-audio
Command line
curl -L -o ko.wav https://huggingface.co/litmudoc/Chatterbox-Multilingual-MLX-v2-fp16/resolve/main/ko.wav
mlx_audio.tts.generate \
--model litmudoc/Chatterbox-Multilingual-MLX-v2-fp16 \
--text ", μ§λλ¬ μ°λ¦¬λ μ νλΈ μ±λμμ μ΄μμ΅ μ‘°νμλΌλ μλ‘μ΄ μ΄μ νμ λλ¬νμ΅λλ€." \
--lang_code ko \
--ref_audio ko.wav \
--ref_text "μ°λ¦¬λ μ λ§λ‘ νλ¦ν νΈν
μ 묡μμ§λ§, κ·Έλλ ν볡νλ€." \
--verbose --play
Python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text=", μ§λλ¬ μ°λ¦¬λ μ νλΈ μ±λμμ μ΄μμ΅ μ‘°νμλΌλ μλ‘μ΄ μ΄μ νμ λλ¬νμ΅λλ€.",
model="litmudoc/Chatterbox-Multilingual-MLX-v2-fp16",
lang_code="ko",
ref_audio="ko.wav",
ref_text="μ°λ¦¬λ μ λ§λ‘ νλ¦ν νΈν
μ 묡μμ§λ§, κ·Έλλ ν볡νλ€.",
file_prefix="output",
)
- Downloads last month
- -
Model tree for theoracleguy/Chatterbox-Multilingual-MLX-v2-fp16
Base model
ResembleAI/chatterbox