YUGOROU commited on
Commit
54faf99
Β·
verified Β·
1 Parent(s): 6dfbcd9

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +85 -0
README.md ADDED
@@ -0,0 +1,85 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: mlx-audio-plus
3
+ base_model:
4
+ - ResembleAI/chatterbox
5
+ tags:
6
+ - mlx
7
+ - multilingual
8
+ - tts
9
+ - text-to-speech
10
+ - japanese
11
+ language:
12
+ - ar # Arabic
13
+ - da # Danish
14
+ - de # German
15
+ - el # Greek
16
+ - en # English
17
+ - es # Spanish
18
+ - fi # Finnish
19
+ - fr # French
20
+ - he # Hebrew
21
+ - hi # Hindi
22
+ - it # Italian
23
+ - ja # Japanese
24
+ - ko # Korean
25
+ - ms # Malay
26
+ - nl # Dutch
27
+ - "no" # Norwegian
28
+ - pl # Polish
29
+ - pt # Portuguese
30
+ - ru # Russian
31
+ - sv # Swedish
32
+ - sw # Swahili
33
+ - tr # Turkish
34
+ - zh # Chinese
35
+ pipeline_tag: text-to-speech
36
+ ---
37
+
38
+ # {model_name}
39
+
40
+ Chatterbox Multilingual TTS converted to MLX format for Apple Silicon devices.
41
+
42
+ ## 🌍 Supported Languages (23 languages)
43
+
44
+ Arabic, Danish, German, Greek, English, Spanish, Finnish, French, Hebrew, Hindi,
45
+ Italian, **Japanese**, Korean, Malay, Dutch, Norwegian, Polish, Portuguese,
46
+ Russian, Swedish, Swahili, Turkish, Chinese
47
+
48
+ ## πŸ“₯ Installation
49
+ ```bash
50
+ pip install -U mlx-audio-plus
51
+ ```
52
+
53
+ ## πŸš€ Usage
54
+
55
+ ### Command Line
56
+ ```bash
57
+ mlx_audio.tts.generate \\
58
+ --model {model_name} \\
59
+ --text "γ“γ‚“γ«γ‘γ―γ€ε…ƒζ°—γ§γ™γ‹οΌŸ" \\
60
+ --ref_audio reference.wav
61
+ ```
62
+
63
+ ### Python
64
+ ```python
65
+ from mlx_audio.tts.generate import generate_audio
66
+
67
+ generate_audio(
68
+ text="γ“γ‚“γ«γ‘γ―γ€ε…ƒζ°—γ§γ™γ‹οΌŸ",
69
+ model="{model_name}",
70
+ ref_audio="reference.wav",
71
+ file_prefix="output",
72
+ )
73
+ ```
74
+
75
+ ## πŸ“Š Model Details
76
+
77
+ - **Base Model**: ResembleAI/chatterbox
78
+ - **Tokenizer**: 2454 tokens (Multilingual)
79
+ - **Quantization**: {'4-bit' if '4bit' in model_name else '8-bit' if '8bit' in model_name else 'fp16'}
80
+ - **Framework**: MLX (Apple Silicon optimized)
81
+
82
+ ## πŸ”— Related
83
+
84
+ - Original PyTorch model: [ResembleAI/chatterbox](https://huggingface.co/ResembleAI/chatterbox)
85
+ - S3Tokenizer: [mlx-community/S3TokenizerV2](https://huggingface.co/mlx-community/S3TokenizerV2)