Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

philschmid
/
pyannote-segmentation

Voice Activity Detection
pyannote.audio
PyTorch
pyannote
pyannote-audio-model
audio
voice
speech
speaker
speaker-segmentation
overlapped-speech-detection
resegmentation
Model card Files Files and versions
xet
Community

Instructions to use philschmid/pyannote-segmentation with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • pyannote.audio

    How to use philschmid/pyannote-segmentation with pyannote.audio:

    from pyannote.audio import Model, Inference
    
    model = Model.from_pretrained("philschmid/pyannote-segmentation")
    inference = Inference(model)
    
    # inference on the whole file
    inference("file.wav")
    
    # inference on an excerpt
    from pyannote.core import Segment
    excerpt = Segment(start=2.0, end=5.0)
    inference.crop("file.wav", excerpt)
  • Notebooks
  • Google Colab
  • Kaggle
pyannote-segmentation / reproducible_research
19 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
philschmid's picture
philschmid
add
d13283c over 3 years ago
  • dihard3_custom_split
    add over 3 years ago
  • expected_outputs
    add over 3 years ago