Text-to-Speech
English
DiFlowTTS / README.md
ngocson2002's picture
Update README.md
b9f651f verified
|
Raw
History Blame Contribute Delete
1.22 kB
metadata
license: apache-2.0
language:
  - en
pipeline_tag: text-to-speech

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Discrete Flow Matching

GitHub Paper Demo Interspeech 2026

DiFlow-TTS is trained on 470 hours of the LibriTTS dataset, which consists of predominantly neutral speech. As a result, it may not perform well on prompts with strong emotional expression.

Download DiFlow-TTS checkpoint, and place it as follows:

root/
└── ckpts/
    └── diflow-tts.ckpt