Automatic Speech Recognition
ESPnet
Japanese
How to use from the
Use from the
ESPnet library
from espnet2.bin.asr_inference import Speech2Text

model = Speech2Text.from_pretrained(
  "Dallyana/EspnetASR"
)

speech, rate = soundfile.read("speech.wav")
text, *_ = model(speech)[0]

reazonspeech-espnet-v1

reazonspeech-espnet-v1 is an ESPnet model trained for Japanese automatic speech recognition (ASR).

  • This model was trained on 15,000 hours of ReazonSpeech corpus.
  • Make sure that your audio file is sampled at 16khz when using this model.

For more details, please visit the official project page.

Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train Dallyana/EspnetASR