aoiandroid
/

segmentation-3.0-onnx

speaker-diarization

Model card Files Files and versions

aoiandroid commited on 7 days ago

Commit

8e62a4e

·

verified ·

1 Parent(s): 6dae6ba

Add model card (README)

Files changed (1) hide show

README.md +18 -0

README.md ADDED Viewed

	@@ -0,0 +1,18 @@

+---
+license: mit
+tags:
+  - audio
+  - speaker-diarization
+  - onnx
+  - pyannote
+---
+# segmentation-3.0 ONNX
+ONNX export of [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentation-3.0) for speaker diarization (voice activity and speaker segmentation).
+- **Input**: waveform `[batch, channels, samples]`, 16 kHz mono, e.g. `[1, 1, 160000]` for 10 seconds.
+- **Output**: logits `[batch, num_frames, num_classes]` (7 classes, powerset decoding).
+- Exported with opset 14. Use ONNX Runtime to run on device (Core ML conversion is not supported for this model due to control-flow ops).
+Derived from pyannote.audio; see [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentation-3.0) for the original model and license.