Dallyana
/

EspnetASR

Automatic Speech Recognition

Model card Files Files and versions

reazonspeech-espnet-v1

reazonspeech-espnet-v1 is an ESPnet model trained for Japanese automatic speech recognition (ASR).

This model was trained on 15,000 hours of ReazonSpeech corpus.
Make sure that your audio file is sampled at 16khz when using this model.

For more details, please visit the official project page.

Downloads last month: 1

Dataset used to train Dallyana/EspnetASR