Voice Activity Detection
pyannote.audio
PyTorch
pyannote
pyannote-audio-model
audio
voice
speech
speaker
speaker-diarization
speaker-change-detection
speaker-segmentation
overlapped-speech-detection
resegmentation
Instructions to use pyannote/segmentation-3.0 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- pyannote.audio
How to use pyannote/segmentation-3.0 with pyannote.audio:
from pyannote.audio import Model, Inference model = Model.from_pretrained("pyannote/segmentation-3.0") inference = Inference(model) # inference on the whole file inference("file.wav") # inference on an excerpt from pyannote.core import Segment excerpt = Segment(start=2.0, end=5.0) inference.crop("file.wav", excerpt) - Notebooks
- Google Colab
- Kaggle
Question: edge/mobile deployment β anyone tested?
#11
by 3morixd - opened
We benchmark models on 40 phones (Snapdragon 865) at Dispatch AI (FZE, UAE).
Question: has anyone tested this model on mobile/edge? Interested in:
- Inference speed (t/s)
- Model size after quantization
- RAM usage
Happy to share phone farm benchmark results.
- Dispatch AI (FZE), Sharjah UAE