File size: 1,974 Bytes
54faf99
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e7ae14f
 
 
54faf99
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
---
library_name: mlx-audio-plus
base_model:
- ResembleAI/chatterbox
tags:
- mlx
- multilingual
- tts
- text-to-speech
- japanese
language:
- ar  # Arabic
- da  # Danish
- de  # German
- el  # Greek
- en  # English
- es  # Spanish
- fi  # Finnish
- fr  # French
- he  # Hebrew
- hi  # Hindi
- it  # Italian
- ja  # Japanese
- ko  # Korean
- ms  # Malay
- nl  # Dutch
- "no"  # Norwegian
- pl  # Polish
- pt  # Portuguese
- ru  # Russian
- sv  # Swedish
- sw  # Swahili
- tr  # Turkish
- zh  # Chinese
pipeline_tag: text-to-speech
---

## ๐Ÿšจใƒขใƒ‡ใƒซใฎๅฎŒๅ…จใชๅ‹•ไฝœ็ขบ่ชใŒใพใ ใงใใฆใ„ใพใ›ใ‚“๏ผ (๐ŸšจWe're still working on fully testing the model!)

# YUGOROU/Chatterbox-Multilingual-MLX-4bit

Chatterbox Multilingual TTS converted to MLX format for Apple Silicon devices.

## ๐ŸŒ Supported Languages (23 languages)

Arabic, Danish, German, Greek, English, Spanish, Finnish, French, Hebrew, Hindi,
Italian, **Japanese**, Korean, Malay, Dutch, Norwegian, Polish, Portuguese,
Russian, Swedish, Swahili, Turkish, Chinese

## ๐Ÿ“ฅ Installation
```bash
pip install -U mlx-audio-plus
```

## ๐Ÿš€ Usage

### Command Line
```bash
mlx_audio.tts.generate \\
    --model {model_name} \\
    --text "ใ“ใ‚“ใซใกใฏใ€ๅ…ƒๆฐ—ใงใ™ใ‹๏ผŸ" \\
    --ref_audio reference.wav
```

### Python
```python
from mlx_audio.tts.generate import generate_audio

generate_audio(
    text="ใ“ใ‚“ใซใกใฏใ€ๅ…ƒๆฐ—ใงใ™ใ‹๏ผŸ",
    model="{model_name}",
    ref_audio="reference.wav",
    file_prefix="output",
)
```

## ๐Ÿ“Š Model Details

- **Base Model**: ResembleAI/chatterbox
- **Tokenizer**: 2454 tokens (Multilingual)
- **Quantization**: {'4-bit' if '4bit' in model_name else '8-bit' if '8bit' in model_name else 'fp16'}
- **Framework**: MLX (Apple Silicon optimized)

## ๐Ÿ”— Related

- Original PyTorch model: [ResembleAI/chatterbox](https://huggingface.co/ResembleAI/chatterbox)
- S3Tokenizer: [mlx-community/S3TokenizerV2](https://huggingface.co/mlx-community/S3TokenizerV2)