File size: 2,486 Bytes
f27fa21 447f7a7 b125b98 447f7a7 f27fa21 447f7a7 f27fa21 447f7a7 f27fa21 447f7a7 b125b98 447f7a7 b125b98 447f7a7 f27fa21 447f7a7 f27fa21 60ce74a 447f7a7 f27fa21 447f7a7 b125b98 447f7a7 b125b98 447f7a7 b125b98 447f7a7 b125b98 f27fa21 447f7a7 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 |
---
license: apache-2.0
language:
- ar
- da
- de
- el
- en
- es
- fi
- fr
- he
- hi
- it
- ja
- ko
- ms
- nl
- no
- pl
- pt
- ru
- sv
- sw
- tr
- zh
base_model:
- ResembleAI/chatterbox
pipeline_tag: text-to-speech
library_name: mlx-audio
tags:
- text-to-speech
- speech
- speech generation
- voice cloning
- mlx
- tts
---
# mlx-community/chatterbox-5bit
This model was converted to MLX format from [`ResembleAI/chatterbox`](https://huggingface.co/ResembleAI/chatterbox) using mlx-audio version **0.2.7**.
Refer to the [original model card](https://huggingface.co/ResembleAI/chatterbox) for more details on the model.
## Use with mlx
```bash
pip install -U mlx-audio
```
### CLI Example:
```bash
python -m mlx_audio.tts.generate --model mlx-community/chatterbox-5bit --text "Hello, this is a test."
```
### Python Example:
```python
from mlx_audio.tts.utils import load_model
from mlx_audio.tts.generate import generate_audio
model = load_model("mlx-community/chatterbox-5bit")
generate_audio(
model=model, text="Hello, this is a test.",
file_prefix="test_audio",
)
```
### Voice Cloning:
```python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text="Hello, this is a test of Chatterbox TTS.",
model="mlx-community/chatterbox-5bit",
ref_audio="path_to_file.wav",
ref_text="Text transcript", # Optional
file_prefix="vc_output",
)
```
## Multilingual Quickstart
Generate speech in 23 languages by specifying the `lang_code` parameter. The model automatically handles pronunciation, intonation, and language-specific phonemes.
```python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text="Olá, tudo bem?",
model="mlx-community/chatterbox-5bit",
lang_code="pt",
file_prefix="multilingual_output",
)
```
### Supported Languages
| Code | Language | Code | Language | Code | Language |
|------|------------|------|------------|------|------------|
| `ar` | Arabic | `he` | Hebrew | `no` | Norwegian |
| `da` | Danish | `hi` | Hindi | `pl` | Polish |
| `de` | German | `it` | Italian | `pt` | Portuguese |
| `el` | Greek | `ja` | Japanese | `ru` | Russian |
| `en` | English | `ko` | Korean | `sv` | Swedish |
| `es` | Spanish | `ms` | Malay | `sw` | Swahili |
| `fi` | Finnish | `nl` | Dutch | `tr` | Turkish |
| `fr` | French | `zh` | Chinese | | |
|