Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

drewThomasson
/
segmentation

Voice Activity Detection
pyannote.audio
PyTorch
pyannote
pyannote-audio-model
audio
voice
speech
speaker
speaker-segmentation
overlapped-speech-detection
resegmentation
Model card Files Files and versions
xet
Community

Instructions to use drewThomasson/segmentation with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • pyannote.audio

    How to use drewThomasson/segmentation with pyannote.audio:

    from pyannote.audio import Model, Inference
    
    model = Model.from_pretrained("drewThomasson/segmentation")
    inference = Inference(model)
    
    # inference on the whole file
    inference("file.wav")
    
    # inference on an excerpt
    from pyannote.core import Segment
    excerpt = Segment(start=2.0, end=5.0)
    inference.crop("file.wav", excerpt)
  • Notebooks
  • Google Colab
  • Kaggle
segmentation / reproducible_research
19 MB
Ctrl+K
Ctrl+K
  • 3 contributors
History: 1 commit
Drew
moved files
9aee05d 11 months ago
  • dihard3_custom_split
    moved files 11 months ago
  • expected_outputs
    moved files 11 months ago