Automatic Speech Recognition
NeMo
Safetensors
PyTorch
sortformer
speaker-diarization
speaker-recognition
speech
audio
Transformer
FastConformer
Conformer
NEST
NeMo
Eval Results (legacy)
Instructions to use nvidia/diar_sortformer_4spk-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/diar_sortformer_4spk-v1 with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/diar_sortformer_4spk-v1") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Japanese?
#7
by riken12 - opened
the information says non-english, I'm wondering how much Japanese dataset was used compared to English? what's the use if It's only accurate In english.
We did not use Japanese data to train this model. Nevertheless, the model is not highly dependent on the language, so it can work for Japanese and other languages.