Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

nvidia
/
diar_streaming_sortformer_4spk-v2

Automatic Speech Recognition
NeMo
PyTorch
speaker-diarization
speaker-recognition
speech
audio
Transformer
FastConformer
Conformer
NEST
NeMo
Eval Results (legacy)
Model card Files Files and versions
xet
Community
12

Instructions to use nvidia/diar_streaming_sortformer_4spk-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • NeMo

    How to use nvidia/diar_streaming_sortformer_4spk-v2 with NeMo:

    import nemo.collections.asr as nemo_asr
    asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/diar_streaming_sortformer_4spk-v2")
    
    transcriptions = asr_model.transcribe(["file.wav"])
  • Notebooks
  • Google Colab
  • Kaggle
diar_streaming_sortformer_4spk-v2 / figures
20.6 MB
Ctrl+K
Ctrl+K
  • 5 contributors
History: 4 commits
taejinp's picture
taejinp
Delete figures/streaming_sortformer_ani.gif
bb7e1e8 verified 10 months ago
  • aosc_3spk_example.gif
    5.49 MB
    xet
    Upload 2 files 10 months ago
  • aosc_4spk_example.gif
    13.6 MB
    xet
    Upload 2 files 10 months ago
  • fifo.png
    123 kB
    xet
    Upload 5 files about 1 year ago
  • sortformer-v1-model.png
    486 kB
    xet
    Upload sortformer-v1-model.png 10 months ago
  • sortformer_intro.png
    325 kB
    xet
    Upload 5 files about 1 year ago
  • streaming_steps.png
    637 kB
    xet
    Upload 5 files about 1 year ago