MeloTTS-EN: Optimized for Qualcomm Devices
MeloTTS is a high-quality multi-lingual text-to-speech library for English, Chinese and Spanish language.
This is based on the implementation of MeloTTS-EN found here. This repository contains pre-exported model files optimized for Qualcomm® devices. You can use the Qualcomm® AI Hub Models library to export with custom configurations. More details on model performance across various devices, can be found here.
Qualcomm AI Hub Models uses Qualcomm AI Hub Workbench to compile, profile, and evaluate this model. Sign up to run these models on a hosted Qualcomm® device.
Getting Started
There are two ways to deploy this model on your device:
Option 1: Download Pre-Exported Models
Below are pre-exported model assets ready for deployment.
| Runtime | Precision | Chipset | SDK Versions | Download |
|---|---|---|---|---|
| VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Gen 5 Mobile | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Snapdragon® X2 Elite | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Snapdragon® X Elite | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Snapdragon® 8 Gen 3 Mobile | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Qualcomm® QCS8550 (Proxy) | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Qualcomm® SA8775P | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Snapdragon® 8 Elite For Galaxy Mobile | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Qualcomm® SA7255P | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Qualcomm® SA8295P | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Qualcomm® QCS9075 | QAIRT 2.45 | Download |
| VOICE_AI | mixed_with_float | Qualcomm® QCS8450 (Proxy) | QAIRT 2.45 | Download |
For more device-specific assets and performance metrics, visit MeloTTS-EN on Qualcomm® AI Hub.
Option 2: Export with Custom Configurations
Use the Qualcomm® AI Hub Models Python library to compile and export the model with your own:
- Custom weights (e.g., fine-tuned checkpoints)
- Custom input shapes
- Target device and runtime configurations
This option is ideal if you need to customize the model beyond the default configuration provided here.
See our repository for MeloTTS-EN on GitHub for usage instructions.
Model Details
Model Type: Model_use_case.audio_generation
Model Stats:
- Model checkpoint: myshell-ai/MeloTTS-English
- Max decoded sequence length: 512 tokens
- Number of parameters (encoder): 8.30M
- Model size (encoder) (float): 31.8 MB
- Number of parameters (flow): 20.1M
- Model size (flow) (float): 76.9 MB
- Number of parameters (decoder): 14.5M
- Model size (decoder) (float): 55.5 MB
- Number of parameters (bert_wrapper): 94.5M
- Model size (bert_wrapper) (float): 360 MB
- Number of parameters (t5_encoder): 15.1M
- Model size (t5_encoder) (float): 57.5 MB
- Number of parameters (t5_decoder): 5.72M
- Model size (t5_decoder) (float): 21.8 MB
Performance Summary
| Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit |
|---|---|---|---|---|---|---|
| bert_wrapper | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Gen 5 Mobile | 2.777 ms | 0 - 9 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Mobile | 3.446 ms | 0 - 13 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Snapdragon® X2 Elite | 3.353 ms | 0 - 0 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 7.36 ms | 0 - 0 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 7.36 ms | 0 - 0 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Snapdragon® 8 Gen 3 Mobile | 4.7 ms | 0 - 7 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® QCS8275 (Proxy) | 29.927 ms | 0 - 8 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® QCS8550 (Proxy) | 6.792 ms | 0 - 1 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 9.089 ms | 0 - 9 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 9.089 ms | 0 - 9 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 9.089 ms | 0 - 9 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® QCS9075 | 8.792 ms | 0 - 2 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® QCS8450 (Proxy) | 10.219 ms | 0 - 9 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® SA7255P | 29.927 ms | 0 - 8 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Qualcomm® SA8295P | 11.297 ms | 0 - 6 MB | NPU |
| bert_wrapper | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite For Galaxy Mobile | 3.446 ms | 0 - 13 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Gen 5 Mobile | 42.577 ms | 0 - 10 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Mobile | 48.477 ms | 0 - 9 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Snapdragon® X2 Elite | 40.75 ms | 0 - 0 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 82.715 ms | 0 - 0 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 82.715 ms | 0 - 0 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Gen 3 Mobile | 60.784 ms | 0 - 7 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8275 (Proxy) | 135.029 ms | 0 - 9 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8550 (Proxy) | 85.3 ms | 1 - 2 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 83.831 ms | 0 - 9 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 83.831 ms | 0 - 9 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 83.831 ms | 0 - 9 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS9075 | 83.266 ms | 0 - 2 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8450 (Proxy) | 114.794 ms | 1 - 10 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® SA7255P | 135.029 ms | 0 - 9 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8295P | 100.064 ms | 0 - 6 MB | NPU |
| decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite For Galaxy Mobile | 48.477 ms | 0 - 9 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Gen 5 Mobile | 23.749 ms | 4 - 13 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Mobile | 24.87 ms | 2 - 15 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Snapdragon® X2 Elite | 25.311 ms | 4 - 4 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 39.907 ms | 4 - 4 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 39.907 ms | 4 - 4 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Gen 3 Mobile | 30.363 ms | 4 - 10 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8275 (Proxy) | 66.288 ms | 2 - 10 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8550 (Proxy) | 41.689 ms | 4 - 5 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 43.28 ms | 2 - 11 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 43.28 ms | 2 - 11 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 43.28 ms | 2 - 11 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS9075 | 43.494 ms | 6 - 11 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8450 (Proxy) | 48.998 ms | 4 - 13 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® SA7255P | 66.288 ms | 2 - 10 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8295P | 48.3 ms | 0 - 5 MB | NPU |
| encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite For Galaxy Mobile | 24.87 ms | 2 - 15 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Gen 5 Mobile | 62.889 ms | 2 - 11 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Mobile | 78.769 ms | 2 - 15 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Snapdragon® X2 Elite | 61.668 ms | 2 - 2 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 122.219 ms | 2 - 2 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 122.219 ms | 2 - 2 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Snapdragon® 8 Gen 3 Mobile | 91.598 ms | 2 - 9 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® QCS8275 (Proxy) | 235.035 ms | 2 - 11 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® QCS8550 (Proxy) | 121.928 ms | 3 - 5 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 121.067 ms | 2 - 12 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 121.067 ms | 2 - 12 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 121.067 ms | 2 - 12 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® QCS9075 | 120.441 ms | 2 - 6 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® QCS8450 (Proxy) | 211.241 ms | 2 - 12 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® SA7255P | 235.035 ms | 2 - 11 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Qualcomm® SA8295P | 150.534 ms | 0 - 5 MB | NPU |
| flow | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite For Galaxy Mobile | 78.769 ms | 2 - 15 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Gen 5 Mobile | 0.272 ms | 0 - 10 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Mobile | 0.271 ms | 0 - 9 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Snapdragon® X2 Elite | 0.36 ms | 1 - 1 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 0.425 ms | 1 - 1 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 0.425 ms | 1 - 1 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Gen 3 Mobile | 0.327 ms | 0 - 8 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8275 (Proxy) | 0.991 ms | 0 - 8 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8550 (Proxy) | 0.404 ms | 1 - 3 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 0.664 ms | 0 - 10 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 0.664 ms | 0 - 10 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 0.664 ms | 0 - 10 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS9075 | 0.518 ms | 1 - 3 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8450 (Proxy) | 0.585 ms | 1 - 10 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® SA7255P | 0.991 ms | 0 - 8 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Qualcomm® SA8295P | 0.79 ms | 0 - 5 MB | NPU |
| t5_decoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite For Galaxy Mobile | 0.271 ms | 0 - 9 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Gen 5 Mobile | 0.486 ms | 0 - 9 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite Mobile | 0.525 ms | 0 - 13 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Snapdragon® X2 Elite | 0.652 ms | 0 - 0 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 1.055 ms | 0 - 0 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Snapdragon® X Elite | 1.055 ms | 0 - 0 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Gen 3 Mobile | 0.639 ms | 0 - 8 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8275 (Proxy) | 2.765 ms | 0 - 8 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8550 (Proxy) | 0.888 ms | 0 - 1 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 1.249 ms | 0 - 9 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 1.249 ms | 0 - 9 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8775P | 1.249 ms | 0 - 9 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS9075 | 1.117 ms | 0 - 2 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® QCS8450 (Proxy) | 1.346 ms | 0 - 9 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® SA7255P | 2.765 ms | 0 - 8 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Qualcomm® SA8295P | 1.751 ms | 0 - 6 MB | NPU |
| t5_encoder | VOICE_AI | mixed_with_float | Snapdragon® 8 Elite For Galaxy Mobile | 0.525 ms | 0 - 13 MB | NPU |
License
- The license for the original implementation of MeloTTS-EN can be found here.
References
Community
- Join our AI Hub Slack community to collaborate, post questions and learn more about on-device AI.
- For questions or feedback please reach out to us.
