phoonnx_an_dii_unicode

A text-to-speech voice model for Aragonese (an) using the phoonnx VITS architecture, exported to ONNX for efficient CPU inference. The voice persona is Dii — a female speaker adapted via voice conversion from an Asturian base model (phoonnx_ast_dii_unicode).

Training data

Trained on TigreGotico/tts_vc_mcv-scripted-v23.0_an_dii, a synthetic voice-conversion dataset derived from Mozilla Common Voice scripted prompts (release 23.0). Audio was synthesised with a donor TTS voice and then converted to the Dii speaker identity, producing a single-speaker corpus for Aragonese — a critically low-resource Ibero-Romance language.

Usage

import wave
from phoonnx.config import SynthesisConfig
from phoonnx.voice import TTSVoice

voice = TTSVoice.from_hf("OpenVoiceOS/phoonnx_an_dii_unicode")

synthesis_config = SynthesisConfig(noise_scale=0.667, length_scale=1.0, noise_w_scale=0.8)

with wave.open("output.wav", "wb") as wav_file:
    voice.synthesize_wav("Buenas días, ¿cómo estás?", wav_file, synthesis_config)

Or via the OVOS TTS plugin:

{
  "tts": {
    "module": "ovos-tts-plugin-phoonnx",
    "ovos-tts-plugin-phoonnx": {
      "voice": "OpenVoiceOS/phoonnx_an_dii_unicode"
    }
  }
}

License

Apache 2.0

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for OpenVoiceOS/phoonnx_an_dii_unicode

Quantized
(1)
this model
Quantizations
1 model

Dataset used to train OpenVoiceOS/phoonnx_an_dii_unicode