Instructions to use Edmon02/TTS_NB_ONNX with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Edmon02/TTS_NB_ONNX with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="Edmon02/TTS_NB_ONNX")# Load model directly from transformers import AutoProcessor, AutoModelForTextToSpectrogram processor = AutoProcessor.from_pretrained("Edmon02/TTS_NB_ONNX") model = AutoModelForTextToSpectrogram.from_pretrained("Edmon02/TTS_NB_ONNX") - Notebooks
- Google Colab
- Kaggle
Armenian SpeechT5 β ONNX export (TTS_NB_ONNX)
ONNX export of the SpeechT5 encoder/decoder for faster or edge inference (ONNXRuntime, mobile, C++). Pair with HiFi-GAN vocoder separately.
Files
| File | Role |
|---|---|
encoder_model.onnx |
Text β hidden states |
decoder_model.onnx |
Autoregressive mel decoder |
decoder_with_past_model.onnx |
Decoder with KV cache |
decoder_postnet_and_vocoder.onnx |
Postnet (vocoder may still be separate) |
spm_char.model |
SentencePiece tokenizer |
config.json / preprocessor_config.json |
Model config |
When to use
- ONNXRuntime deployment
- Environments without full PyTorch stack
- Latency-sensitive inference pipelines
For PyTorch + Hugging Face, prefer Edmon02/speecht5_finetuned_voxpopuli_hy or Edmon02/TTS_NB_2.
Source checkpoint
Exported from Edmon02/TTS_NB_2 (verify export date in commit history).
Limitations
- ONNX export may not cover full HF
generate_speechAPI β validate your runtime graph - Speaker embeddings must be supplied consistently with training
- Re-export when
TTS_NB_2architecture or opset changes
License
MIT
- Downloads last month
- 22