litmudoc's picture
Update README.md
0335bb1 verified
---
library_name: mlx-audio
base_model:
- ResembleAI/chatterbox
tags:
- mlx
pipeline_tag: text-to-speech
---
# litmudoc/Chatterbox-Multilingual-MLX-v2-Q8
This model was converted to MLX format from [ResembleAI/chatterbox](https://huggingface.co/ResembleAI/chatterbox) using [mlx-audio](https://github.com/Blaizzy/mlx-audio) version **0.2.10**.
**Note:** This model requires the S3Tokenizer weights from [mlx-community/S3TokenizerV2](https://huggingface.co/mlx-community/S3TokenizerV2), which will be downloaded automatically.
## Use with mlx-audio
```bash
pip install -U mlx-audio
```
### Command line
```bash
curl -L -o ko.wav https://huggingface.co/litmudoc/Chatterbox-Multilingual-MLX-v2-Q8/resolve/main/ko.wav
mlx_audio.tts.generate \
--model litmudoc/Chatterbox-Multilingual-MLX-v2-Q8 \
--text ", μ§€λ‚œλ‹¬ μš°λ¦¬λŠ” 유튜브 μ±„λ„μ—μ„œ 이십얡 μ‘°νšŒμˆ˜λΌλŠ” μƒˆλ‘œμš΄ μ΄μ •ν‘œμ— λ„λ‹¬ν–ˆμŠ΅λ‹ˆλ‹€." \
--lang_code ko \
--ref_audio ko.wav \
--ref_text "μš°λ¦¬λŠ” μ •λ§λ‘œ ν—ˆλ¦„ν•œ ν˜Έν…”μ— λ¬΅μ—ˆμ§€λ§Œ, κ·Έλž˜λ„ ν–‰λ³΅ν–ˆλ‹€." \
--verbose --play
```
### Python
```python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text=", μ§€λ‚œλ‹¬ μš°λ¦¬λŠ” 유튜브 μ±„λ„μ—μ„œ 이십얡 μ‘°νšŒμˆ˜λΌλŠ” μƒˆλ‘œμš΄ μ΄μ •ν‘œμ— λ„λ‹¬ν–ˆμŠ΅λ‹ˆλ‹€.",
model="litmudoc/Chatterbox-Multilingual-MLX-v2-Q8",
lang_code="ko",
ref_audio="ko.wav",
ref_text="μš°λ¦¬λŠ” μ •λ§λ‘œ ν—ˆλ¦„ν•œ ν˜Έν…”μ— λ¬΅μ—ˆμ§€λ§Œ, κ·Έλž˜λ„ ν–‰λ³΅ν–ˆλ‹€.",
file_prefix="output",
)
```