Aratako commited on
Commit
9860e50
·
verified ·
1 Parent(s): b379f5f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -27
README.md CHANGED
@@ -1,22 +1,22 @@
1
- ---
2
- language:
3
- - ja
4
- - en
5
- library_name: transformers
6
- license: other
7
- license_name: lfm1.0
8
- license_link: LICENSE
9
- datasets:
10
- - nvidia/hifitts-2
11
- - amphion/Emilia-Dataset
12
- base_model:
13
- - LiquidAI/LFM2-350M
14
- pipeline_tag: text-to-speech
15
- tags:
16
- - speech
17
- - tts
18
- - voice
19
- ---
20
 
21
  # MioTTS-0.4B: Lightweight & Fast LLM-based TTS
22
 
@@ -31,14 +31,16 @@ This model supports zero-shot voice cloning and is built on top of the efficient
31
 
32
  We offer a range of model sizes to suit different performance and resource requirements.
33
 
34
- | Model Name | Parameters | Base Model | License |
35
- | :--- | :---: | :--- | :--- |
36
- | [MioTTS-0.1B]((https://huggingface.co/Aratako/MioTTS-0.1B)) | 0.1B | [tiiuae/Falcon-H1-Tiny-Multilingual-100M-Base](https://huggingface.co/tiiuae/Falcon-H1-Tiny-Multilingual-100M-Base) | [Falcon-LLM License](https://falconllm.tii.ae/falcon-terms-and-conditions.html) |
37
- | **MioTTS-0.4B** | **0.4B** | **[LiquidAI/LFM2-350M](https://huggingface.co/LiquidAI/LFM2-350M)** | **[LFM Open License v1.0](https://huggingface.co/LiquidAI/LFM2-350M/blob/main/LICENSE)** |
38
- | [MioTTS-0.6B](https://huggingface.co/Aratako/MioTTS-0.6B) | 0.6B | [Qwen/Qwen3-0.6B-Base](https://huggingface.co/Qwen/Qwen3-0.6B-Base) | [Apache 2.0](https://choosealicense.com/licenses/apache-2.0/) |
39
- | [MioTTS-1.2B](https://huggingface.co/Aratako/MioTTS-1.2B) | 1.2B | [LiquidAI/LFM2.5-1.2B-Base](https://huggingface.co/LiquidAI/LFM2.5-1.2B-Base) | [LFM Open License v1.0](https://huggingface.co/LiquidAI/LFM2.5-1.2B-Base/blob/main/LICENSE) |
40
- | [MioTTS-1.7B](https://huggingface.co/Aratako/MioTTS-1.7B) | 1.7B | [Qwen/Qwen3-1.7B-Base](https://huggingface.co/Qwen/Qwen3-1.7B-Base) | [Apache 2.0](https://choosealicense.com/licenses/apache-2.0/) |
41
- | [MioTTS-2.6B](https://huggingface.co/Aratako/MioTTS-2.6B) | 2.6B | [LiquidAI/LFM2-2.6B](https://huggingface.co/LiquidAI/LFM2-2.6B) | [LFM Open License v1.0](https://huggingface.co/LiquidAI/LFM2-2.6B/blob/main/LICENSE) |
 
 
42
 
43
  ## 🌟 Key Features
44
 
 
1
+ ---
2
+ language:
3
+ - ja
4
+ - en
5
+ library_name: transformers
6
+ license: other
7
+ license_name: lfm1.0
8
+ license_link: LICENSE
9
+ datasets:
10
+ - nvidia/hifitts-2
11
+ - amphion/Emilia-Dataset
12
+ base_model:
13
+ - LiquidAI/LFM2-350M
14
+ pipeline_tag: text-to-speech
15
+ tags:
16
+ - speech
17
+ - tts
18
+ - voice
19
+ ---
20
 
21
  # MioTTS-0.4B: Lightweight & Fast LLM-based TTS
22
 
 
31
 
32
  We offer a range of model sizes to suit different performance and resource requirements.
33
 
34
+ | Model Name | Parameters | Base Model | License | RTF (Real-Time Factor) |
35
+ | :--- | :---: | :--- | :--- | :---: |
36
+ | [MioTTS-0.1B](https://huggingface.co/Aratako/MioTTS-0.1B) | 0.1B | [tiiuae/Falcon-H1-Tiny-Multilingual-100M-Base](https://huggingface.co/tiiuae/Falcon-H1-Tiny-Multilingual-100M-Base) | [Falcon-LLM License](https://falconllm.tii.ae/falcon-terms-and-conditions.html) | 0.04 - 0.05 |
37
+ | **MioTTS-0.4B** | **0.4B** | **[LiquidAI/LFM2-350M](https://huggingface.co/LiquidAI/LFM2-350M)** | **[LFM Open License v1.0](https://huggingface.co/LiquidAI/LFM2-350M/blob/main/LICENSE)** | **0.035 - 0.045** |
38
+ | [MioTTS-0.6B](https://huggingface.co/Aratako/MioTTS-0.6B) | 0.6B | [Qwen/Qwen3-0.6B-Base](https://huggingface.co/Qwen/Qwen3-0.6B-Base) | [Apache 2.0](https://choosealicense.com/licenses/apache-2.0/) | 0.055 - 0.065 |
39
+ | [MioTTS-1.2B](https://huggingface.co/Aratako/MioTTS-1.2B) | 1.2B | [LiquidAI/LFM2.5-1.2B-Base](https://huggingface.co/LiquidAI/LFM2.5-1.2B-Base) | [LFM Open License v1.0](https://huggingface.co/LiquidAI/LFM2.5-1.2B-Base/blob/main/LICENSE) | 0.065 - 0.075 |
40
+ | [MioTTS-1.7B](https://huggingface.co/Aratako/MioTTS-1.7B) | 1.7B | [Qwen/Qwen3-1.7B-Base](https://huggingface.co/Qwen/Qwen3-1.7B-Base) | [Apache 2.0](https://choosealicense.com/licenses/apache-2.0/) | 0.10 - 0.11 |
41
+ | [MioTTS-2.6B](https://huggingface.co/Aratako/MioTTS-2.6B) | 2.6B | [LiquidAI/LFM2-2.6B](https://huggingface.co/LiquidAI/LFM2-2.6B) | [LFM Open License v1.0](https://huggingface.co/LiquidAI/LFM2-2.6B/blob/main/LICENSE) | 0.135 - 0.145 |
42
+
43
+ > RTF values represent the range observed when generating approximately 15 seconds of audio across multiple runs. Measured on an **NVIDIA RTX 5090** using **vLLM 0.15.1**.
44
 
45
  ## 🌟 Key Features
46