Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

zermok
/
segmentation

Voice Activity Detection
pyannote.audio
PyTorch
pyannote
pyannote-audio-model
audio
voice
speech
speaker
speaker-segmentation
overlapped-speech-detection
resegmentation
Model card Files Files and versions
xet
Community

Instructions to use zermok/segmentation with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • pyannote.audio

    How to use zermok/segmentation with pyannote.audio:

    from pyannote.audio import Model, Inference
    
    model = Model.from_pretrained("zermok/segmentation")
    inference = Inference(model)
    
    # inference on the whole file
    inference("file.wav")
    
    # inference on an excerpt
    from pyannote.core import Segment
    excerpt = Segment(start=2.0, end=5.0)
    inference.crop("file.wav", excerpt)
  • Notebooks
  • Google Colab
  • Kaggle
segmentation / reproducible_research
Ctrl+K
Ctrl+K
  • 4 contributors
History: 3 commits
Hervé Bredin
fix: update RR to match with latest code/paper version
d24d0d1 almost 5 years ago
  • dihard3_custom_split
    feat: rename /paper to /reproducible_research about 5 years ago
  • expected_outputs
    fix: update RR to match with latest code/paper version almost 5 years ago