Voice Activity Detection
LiteRT
LiteRT
multilingual
speaker-diarization
pyannote
diarization
on-device
soniqo
speech-cloud
speech-core
Instructions to use soniqo/Pyannote-Segmentation-LiteRT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- LiteRT
How to use soniqo/Pyannote-Segmentation-LiteRT with LiteRT:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
card: unified LiteRT model card with soniqo.audio + ecosystem links
Browse files
README.md
CHANGED
|
@@ -1,20 +1,39 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
-
language:
|
|
|
|
| 4 |
tags:
|
| 5 |
- speaker-diarization
|
| 6 |
- voice-activity-detection
|
| 7 |
- pyannote
|
|
|
|
| 8 |
- litert
|
| 9 |
- tflite
|
| 10 |
- on-device
|
| 11 |
-
-
|
|
|
|
|
|
|
| 12 |
base_model: pyannote/segmentation-3.0
|
| 13 |
library_name: litert
|
| 14 |
-
pipeline_tag:
|
| 15 |
---
|
| 16 |
|
| 17 |
-
# Pyannote Segmentation 3.0 β LiteRT
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
|
| 19 |
Powerset speaker segmentation (up to 3 local speakers) for Android,
|
| 20 |
exported in a streaming 1-second chunk configuration.
|
|
@@ -99,3 +118,35 @@ Upstream: [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentati
|
|
| 99 |
- [speech-android](https://github.com/soniqo/speech-android) β Android SDK
|
| 100 |
- [soniqo.audio](https://soniqo.audio) β website
|
| 101 |
- [blog](https://soniqo.audio/blog) β blog
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
+
language:
|
| 4 |
+
- multilingual
|
| 5 |
tags:
|
| 6 |
- speaker-diarization
|
| 7 |
- voice-activity-detection
|
| 8 |
- pyannote
|
| 9 |
+
- diarization
|
| 10 |
- litert
|
| 11 |
- tflite
|
| 12 |
- on-device
|
| 13 |
+
- soniqo
|
| 14 |
+
- speech-cloud
|
| 15 |
+
- speech-core
|
| 16 |
base_model: pyannote/segmentation-3.0
|
| 17 |
library_name: litert
|
| 18 |
+
pipeline_tag: voice-activity-detection
|
| 19 |
---
|
| 20 |
|
| 21 |
+
# Pyannote Segmentation 3.0 β LiteRT
|
| 22 |
+
|
| 23 |
+
Speaker-aware segmentation for diarization pipelines. 16 kHz, 5-second windows.
|
| 24 |
+
|
| 25 |
+
> Part of the [**soniqo.audio**](https://soniqo.audio) speech toolkit β
|
| 26 |
+
> an open, runtime-portable stack for speech AI. This bundle is the
|
| 27 |
+
> **LiteRT** export; served from cloud by
|
| 28 |
+
> [`speech-cloud`](https://github.com/soniqo/speech-cloud) and embeddable
|
| 29 |
+
> on-device through [`speech-core`](https://github.com/soniqo/speech-core).
|
| 30 |
+
> Browse all LiteRT bundles in the
|
| 31 |
+
> [**soniqo LiteRT collection**](https://huggingface.co/collections/soniqo/litert).
|
| 32 |
+
|
| 33 |
+
## Use cases on soniqo.audio
|
| 34 |
+
|
| 35 |
+
- [Meeting transcription](https://soniqo.audio/transcription/)
|
| 36 |
+
- [Long-form transcription](https://soniqo.audio/long-form-speech/)
|
| 37 |
|
| 38 |
Powerset speaker segmentation (up to 3 local speakers) for Android,
|
| 39 |
exported in a streaming 1-second chunk configuration.
|
|
|
|
| 118 |
- [speech-android](https://github.com/soniqo/speech-android) β Android SDK
|
| 119 |
- [soniqo.audio](https://soniqo.audio) β website
|
| 120 |
- [blog](https://soniqo.audio/blog) β blog
|
| 121 |
+
|
| 122 |
+
## Ecosystem
|
| 123 |
+
|
| 124 |
+
- [**soniqo.audio**](https://soniqo.audio) β use-case explorer (transcription, voice cloning, live ASR, voice agents).
|
| 125 |
+
- [**speech-cloud**](https://github.com/soniqo/speech-cloud) β C++ cloud API server. Runs LiteRT models behind `/v1/transcribe`, `/v1/realtime`, and (planned) `/v1/audio/speech`.
|
| 126 |
+
- [**speech-core**](https://github.com/soniqo/speech-core) β C++ orchestration library for voice agents. Abstract `STTInterface` / `TTSInterface` / `VADInterface` / `EnhancerInterface`; LiteRT implementations plug straight into the interfaces.
|
| 127 |
+
- [**speech-models**](https://github.com/soniqo/speech-models) β the exporters that produced this bundle.
|
| 128 |
+
- [**speech-swift**](https://github.com/soniqo/speech-swift) β Apple Silicon MLX companion runtime (model-specific MLX bundles linked above where applicable).
|
| 129 |
+
|
| 130 |
+
## Other LiteRT models in this collection
|
| 131 |
+
|
| 132 |
+
**ASR / Transcription**
|
| 133 |
+
|
| 134 |
+
- [Parakeet TDT 0.6B v3 β LiteRT (INT8)](https://huggingface.co/soniqo/Parakeet-TDT-0.6B-v3-LiteRT-INT8)
|
| 135 |
+
- [Nemotron Speech Streaming 0.6B β LiteRT](https://huggingface.co/soniqo/Nemotron-Speech-Streaming-LiteRT)
|
| 136 |
+
- [Omnilingual ASR CTC 300M β LiteRT](https://huggingface.co/soniqo/Omnilingual-ASR-CTC-300M-LiteRT)
|
| 137 |
+
- [Omnilingual ASR CTC 300M β LiteRT (INT8)](https://huggingface.co/soniqo/Omnilingual-ASR-CTC-300M-LiteRT-INT8)
|
| 138 |
+
- [Qwen3 ASR 0.6B Encoder β LiteRT (INT8)](https://huggingface.co/soniqo/Qwen3-ASR-0.6B-Encoder-LiteRT-INT8)
|
| 139 |
+
|
| 140 |
+
**VAD / Diarization**
|
| 141 |
+
|
| 142 |
+
- [Silero VAD v5 β LiteRT](https://huggingface.co/soniqo/Silero-VAD-v5-LiteRT)
|
| 143 |
+
- [WeSpeaker ResNet34-LM β LiteRT](https://huggingface.co/soniqo/WeSpeaker-ResNet34-LM-LiteRT)
|
| 144 |
+
|
| 145 |
+
**TTS / Voice Cloning**
|
| 146 |
+
|
| 147 |
+
- [VoxCPM2 β LiteRT (INT8)](https://huggingface.co/soniqo/VoxCPM2-LiteRT-INT8)
|
| 148 |
+
|
| 149 |
+
## License
|
| 150 |
+
|
| 151 |
+
This bundle inherits the upstream model license (**mit**). See the
|
| 152 |
+
linked `base_model` repository for the full terms.
|