YUGOROU's picture
Update README.md
e7ae14f verified
---
library_name: mlx-audio-plus
base_model:
- ResembleAI/chatterbox
tags:
- mlx
- multilingual
- tts
- text-to-speech
- japanese
language:
- ar # Arabic
- da # Danish
- de # German
- el # Greek
- en # English
- es # Spanish
- fi # Finnish
- fr # French
- he # Hebrew
- hi # Hindi
- it # Italian
- ja # Japanese
- ko # Korean
- ms # Malay
- nl # Dutch
- "no" # Norwegian
- pl # Polish
- pt # Portuguese
- ru # Russian
- sv # Swedish
- sw # Swahili
- tr # Turkish
- zh # Chinese
pipeline_tag: text-to-speech
---
## ๐Ÿšจใƒขใƒ‡ใƒซใฎๅฎŒๅ…จใชๅ‹•ไฝœ็ขบ่ชใŒใพใ ใงใใฆใ„ใพใ›ใ‚“๏ผ (๐ŸšจWe're still working on fully testing the model!)
# YUGOROU/Chatterbox-Multilingual-MLX-4bit
Chatterbox Multilingual TTS converted to MLX format for Apple Silicon devices.
## ๐ŸŒ Supported Languages (23 languages)
Arabic, Danish, German, Greek, English, Spanish, Finnish, French, Hebrew, Hindi,
Italian, **Japanese**, Korean, Malay, Dutch, Norwegian, Polish, Portuguese,
Russian, Swedish, Swahili, Turkish, Chinese
## ๐Ÿ“ฅ Installation
```bash
pip install -U mlx-audio-plus
```
## ๐Ÿš€ Usage
### Command Line
```bash
mlx_audio.tts.generate \\
--model {model_name} \\
--text "ใ“ใ‚“ใซใกใฏใ€ๅ…ƒๆฐ—ใงใ™ใ‹๏ผŸ" \\
--ref_audio reference.wav
```
### Python
```python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text="ใ“ใ‚“ใซใกใฏใ€ๅ…ƒๆฐ—ใงใ™ใ‹๏ผŸ",
model="{model_name}",
ref_audio="reference.wav",
file_prefix="output",
)
```
## ๐Ÿ“Š Model Details
- **Base Model**: ResembleAI/chatterbox
- **Tokenizer**: 2454 tokens (Multilingual)
- **Quantization**: {'4-bit' if '4bit' in model_name else '8-bit' if '8bit' in model_name else 'fp16'}
- **Framework**: MLX (Apple Silicon optimized)
## ๐Ÿ”— Related
- Original PyTorch model: [ResembleAI/chatterbox](https://huggingface.co/ResembleAI/chatterbox)
- S3Tokenizer: [mlx-community/S3TokenizerV2](https://huggingface.co/mlx-community/S3TokenizerV2)