How to use Fhrozen/tts_prodiff_jp_multispk with ESPnet:
from espnet2.bin.tts_inference import Text2Speech model = Text2Speech.from_pretrained("Fhrozen/tts_prodiff_jp_multispk") speech, *_ = model("text to generate speech from")