aufklarer commited on
Commit
615cbc5
Β·
verified Β·
1 Parent(s): 97d0228

card: unified LiteRT model card with soniqo.audio + ecosystem links

Browse files
Files changed (1) hide show
  1. README.md +55 -4
README.md CHANGED
@@ -1,20 +1,39 @@
1
  ---
2
  license: mit
3
- language: multilingual
 
4
  tags:
5
  - speaker-diarization
6
  - voice-activity-detection
7
  - pyannote
 
8
  - litert
9
  - tflite
10
  - on-device
11
- - android
 
 
12
  base_model: pyannote/segmentation-3.0
13
  library_name: litert
14
- pipeline_tag: automatic-speech-recognition
15
  ---
16
 
17
- # Pyannote Segmentation 3.0 β€” LiteRT (streaming)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
 
19
  Powerset speaker segmentation (up to 3 local speakers) for Android,
20
  exported in a streaming 1-second chunk configuration.
@@ -99,3 +118,35 @@ Upstream: [pyannote/segmentation-3.0](https://huggingface.co/pyannote/segmentati
99
  - [speech-android](https://github.com/soniqo/speech-android) β€” Android SDK
100
  - [soniqo.audio](https://soniqo.audio) β€” website
101
  - [blog](https://soniqo.audio/blog) β€” blog
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ language:
4
+ - multilingual
5
  tags:
6
  - speaker-diarization
7
  - voice-activity-detection
8
  - pyannote
9
+ - diarization
10
  - litert
11
  - tflite
12
  - on-device
13
+ - soniqo
14
+ - speech-cloud
15
+ - speech-core
16
  base_model: pyannote/segmentation-3.0
17
  library_name: litert
18
+ pipeline_tag: voice-activity-detection
19
  ---
20
 
21
+ # Pyannote Segmentation 3.0 β€” LiteRT
22
+
23
+ Speaker-aware segmentation for diarization pipelines. 16 kHz, 5-second windows.
24
+
25
+ > Part of the [**soniqo.audio**](https://soniqo.audio) speech toolkit β€”
26
+ > an open, runtime-portable stack for speech AI. This bundle is the
27
+ > **LiteRT** export; served from cloud by
28
+ > [`speech-cloud`](https://github.com/soniqo/speech-cloud) and embeddable
29
+ > on-device through [`speech-core`](https://github.com/soniqo/speech-core).
30
+ > Browse all LiteRT bundles in the
31
+ > [**soniqo LiteRT collection**](https://huggingface.co/collections/soniqo/litert).
32
+
33
+ ## Use cases on soniqo.audio
34
+
35
+ - [Meeting transcription](https://soniqo.audio/transcription/)
36
+ - [Long-form transcription](https://soniqo.audio/long-form-speech/)
37
 
38
  Powerset speaker segmentation (up to 3 local speakers) for Android,
39
  exported in a streaming 1-second chunk configuration.
 
118
  - [speech-android](https://github.com/soniqo/speech-android) β€” Android SDK
119
  - [soniqo.audio](https://soniqo.audio) β€” website
120
  - [blog](https://soniqo.audio/blog) β€” blog
121
+
122
+ ## Ecosystem
123
+
124
+ - [**soniqo.audio**](https://soniqo.audio) β€” use-case explorer (transcription, voice cloning, live ASR, voice agents).
125
+ - [**speech-cloud**](https://github.com/soniqo/speech-cloud) β€” C++ cloud API server. Runs LiteRT models behind `/v1/transcribe`, `/v1/realtime`, and (planned) `/v1/audio/speech`.
126
+ - [**speech-core**](https://github.com/soniqo/speech-core) β€” C++ orchestration library for voice agents. Abstract `STTInterface` / `TTSInterface` / `VADInterface` / `EnhancerInterface`; LiteRT implementations plug straight into the interfaces.
127
+ - [**speech-models**](https://github.com/soniqo/speech-models) β€” the exporters that produced this bundle.
128
+ - [**speech-swift**](https://github.com/soniqo/speech-swift) β€” Apple Silicon MLX companion runtime (model-specific MLX bundles linked above where applicable).
129
+
130
+ ## Other LiteRT models in this collection
131
+
132
+ **ASR / Transcription**
133
+
134
+ - [Parakeet TDT 0.6B v3 β€” LiteRT (INT8)](https://huggingface.co/soniqo/Parakeet-TDT-0.6B-v3-LiteRT-INT8)
135
+ - [Nemotron Speech Streaming 0.6B β€” LiteRT](https://huggingface.co/soniqo/Nemotron-Speech-Streaming-LiteRT)
136
+ - [Omnilingual ASR CTC 300M β€” LiteRT](https://huggingface.co/soniqo/Omnilingual-ASR-CTC-300M-LiteRT)
137
+ - [Omnilingual ASR CTC 300M β€” LiteRT (INT8)](https://huggingface.co/soniqo/Omnilingual-ASR-CTC-300M-LiteRT-INT8)
138
+ - [Qwen3 ASR 0.6B Encoder β€” LiteRT (INT8)](https://huggingface.co/soniqo/Qwen3-ASR-0.6B-Encoder-LiteRT-INT8)
139
+
140
+ **VAD / Diarization**
141
+
142
+ - [Silero VAD v5 β€” LiteRT](https://huggingface.co/soniqo/Silero-VAD-v5-LiteRT)
143
+ - [WeSpeaker ResNet34-LM β€” LiteRT](https://huggingface.co/soniqo/WeSpeaker-ResNet34-LM-LiteRT)
144
+
145
+ **TTS / Voice Cloning**
146
+
147
+ - [VoxCPM2 β€” LiteRT (INT8)](https://huggingface.co/soniqo/VoxCPM2-LiteRT-INT8)
148
+
149
+ ## License
150
+
151
+ This bundle inherits the upstream model license (**mit**). See the
152
+ linked `base_model` repository for the full terms.