Voice Activity Detection
pyannote.audio
PyTorch
pyannote
pyannote-audio-model
audio
voice
speech
speaker
speaker-segmentation
overlapped-speech-detection
resegmentation
Instructions to use anilbs/segmentation with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- pyannote.audio
How to use anilbs/segmentation with pyannote.audio:
from pyannote.audio import Model, Inference model = Model.from_pretrained("anilbs/segmentation") inference = Inference(model) # inference on the whole file inference("file.wav") # inference on an excerpt from pyannote.core import Segment excerpt = Segment(start=2.0, end=5.0) inference.crop("file.wav", excerpt) - Notebooks
- Google Colab
- Kaggle
Anil Battalahalli Sreenath commited on
Commit ·
f260c50
1
Parent(s): 39e3a80
modified README.md
Browse files
README.md
CHANGED
|
@@ -41,7 +41,7 @@ Relies on pyannote.audio 2.0 currently in development: see [installation instruc
|
|
| 41 |
|
| 42 |
```python
|
| 43 |
from pyannote.audio.pipelines import VoiceActivityDetection
|
| 44 |
-
pipeline = VoiceActivityDetection(segmentation="
|
| 45 |
HYPER_PARAMETERS = {
|
| 46 |
# onset/offset activation thresholds
|
| 47 |
"onset": 0.5, "offset": 0.5,
|
|
|
|
| 41 |
|
| 42 |
```python
|
| 43 |
from pyannote.audio.pipelines import VoiceActivityDetection
|
| 44 |
+
pipeline = VoiceActivityDetection(segmentation="anilbs/segmentation")
|
| 45 |
HYPER_PARAMETERS = {
|
| 46 |
# onset/offset activation thresholds
|
| 47 |
"onset": 0.5, "offset": 0.5,
|