Automatic Speech Recognition
pyannote.audio
pyannote
pyannote-audio-pipeline
audio
voice
speech
speaker
speaker-diarization
speaker-change-detection
voice-activity-detection
overlapped-speech-detection
Instructions to use eek/speaker-diarization with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- pyannote.audio
How to use eek/speaker-diarization with pyannote.audio:
from pyannote.audio import Pipeline pipeline = Pipeline.from_pretrained("eek/speaker-diarization") # inference on the whole file pipeline("file.wav") # inference on an excerpt from pyannote.core import Segment excerpt = Segment(start=2.0, end=5.0) from pyannote.audio import Audio waveform, sample_rate = Audio().crop("file.wav", excerpt) pipeline({"waveform": waveform, "sample_rate": sample_rate}) - Notebooks
- Google Colab
- Kaggle
Radu-Sebastian Amarie commited on
Commit ·
a3288b2
1
Parent(s): 6fac4b0
Update:speaker segmentation
Browse files- config.yaml +2 -2
config.yaml
CHANGED
|
@@ -6,8 +6,8 @@ pipeline:
|
|
| 6 |
clustering: AgglomerativeClustering
|
| 7 |
embedding: speechbrain/spkrec-ecapa-voxceleb
|
| 8 |
embedding_batch_size: 32
|
| 9 |
-
embedding_exclude_overlap:
|
| 10 |
-
segmentation:
|
| 11 |
segmentation_batch_size: 32
|
| 12 |
|
| 13 |
params:
|
|
|
|
| 6 |
clustering: AgglomerativeClustering
|
| 7 |
embedding: speechbrain/spkrec-ecapa-voxceleb
|
| 8 |
embedding_batch_size: 32
|
| 9 |
+
embedding_exclude_overlap: false
|
| 10 |
+
segmentation: eek/segmentation
|
| 11 |
segmentation_batch_size: 32
|
| 12 |
|
| 13 |
params:
|