Automatic Speech Recognition
NeMo
Safetensors
PyTorch
sortformer
speaker-diarization
speaker-recognition
speech
audio
Transformer
FastConformer
Conformer
NEST
NeMo
Eval Results (legacy)
Instructions to use nvidia/diar_sortformer_4spk-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/diar_sortformer_4spk-v1 with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/diar_sortformer_4spk-v1") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Model output returns multiple speaker_id every fraction of a second
3
#13 opened 3 months ago
by
doggydogger
Update Readme
#11 opened 6 months ago
by
jbalam-nv
using model on CPU
2
#10 opened 9 months ago
by
mooncy
Remove NC Mention?
2
#9 opened 12 months ago
by
mrrfr
issue about the finetuning of this model
👀 1
1
#8 opened about 1 year ago
by
nassairgnigni
Japanese?
1
#7 opened about 1 year ago
by
riken12