TanelAlumae's picture
Update README.md
82d2322 verified
metadata
license: apache-2.0
language:
  - et
pipeline_tag: automatic-speech-recognition

This model is ONNX version of an Icefall Zipformer streaming ASR model for Estonian. It is identical in training details to https://huggingface.co/TalTechNLP/streaming-zipformer-large.et-en, but it's trained on transcript where number expressions are converted from words to digits.

Note that it tends to struggle with more complex numerical expressions (exceeding four digits).

Use sherpa-onnx to do ASR:

E.g., under Linux, using card 3 as input device:

sherpa-onnx-alsa --encoder=encoder.onnx --decoder=decoder.onnx --joiner=joiner.onnx --tokens=tokens.txt --decoding-method=modified_beam_search plughw:3,0