ESPnet How to use pyf98/librispeech_branchformer_e18_linear3072 with ESPnet:
from espnet2.bin.asr_inference import Speech2Text
model = Speech2Text.from_pretrained(
"pyf98/librispeech_branchformer_e18_linear3072"
)
speech, rate = soundfile.read("speech.wav")
text, *_ = model(speech)[0]