File size: 1,974 Bytes
54faf99 e7ae14f 54faf99 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 |
---
library_name: mlx-audio-plus
base_model:
- ResembleAI/chatterbox
tags:
- mlx
- multilingual
- tts
- text-to-speech
- japanese
language:
- ar # Arabic
- da # Danish
- de # German
- el # Greek
- en # English
- es # Spanish
- fi # Finnish
- fr # French
- he # Hebrew
- hi # Hindi
- it # Italian
- ja # Japanese
- ko # Korean
- ms # Malay
- nl # Dutch
- "no" # Norwegian
- pl # Polish
- pt # Portuguese
- ru # Russian
- sv # Swedish
- sw # Swahili
- tr # Turkish
- zh # Chinese
pipeline_tag: text-to-speech
---
## ๐จใขใใซใฎๅฎๅ
จใชๅไฝ็ขบ่ชใใพใ ใงใใฆใใพใใ๏ผ (๐จWe're still working on fully testing the model!)
# YUGOROU/Chatterbox-Multilingual-MLX-4bit
Chatterbox Multilingual TTS converted to MLX format for Apple Silicon devices.
## ๐ Supported Languages (23 languages)
Arabic, Danish, German, Greek, English, Spanish, Finnish, French, Hebrew, Hindi,
Italian, **Japanese**, Korean, Malay, Dutch, Norwegian, Polish, Portuguese,
Russian, Swedish, Swahili, Turkish, Chinese
## ๐ฅ Installation
```bash
pip install -U mlx-audio-plus
```
## ๐ Usage
### Command Line
```bash
mlx_audio.tts.generate \\
--model {model_name} \\
--text "ใใใซใกใฏใๅ
ๆฐใงใใ๏ผ" \\
--ref_audio reference.wav
```
### Python
```python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text="ใใใซใกใฏใๅ
ๆฐใงใใ๏ผ",
model="{model_name}",
ref_audio="reference.wav",
file_prefix="output",
)
```
## ๐ Model Details
- **Base Model**: ResembleAI/chatterbox
- **Tokenizer**: 2454 tokens (Multilingual)
- **Quantization**: {'4-bit' if '4bit' in model_name else '8-bit' if '8bit' in model_name else 'fp16'}
- **Framework**: MLX (Apple Silicon optimized)
## ๐ Related
- Original PyTorch model: [ResembleAI/chatterbox](https://huggingface.co/ResembleAI/chatterbox)
- S3Tokenizer: [mlx-community/S3TokenizerV2](https://huggingface.co/mlx-community/S3TokenizerV2) |