Spaces:
Running
Running
Update org card content
Browse files
README.md
CHANGED
|
@@ -1,10 +1,49 @@
|
|
| 1 |
---
|
| 2 |
title: README
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
title: README
|
| 3 |
+
emoji: "📡"
|
| 4 |
+
colorFrom: gray
|
| 5 |
+
colorTo: blue
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
+
# trunk-reporter
|
| 11 |
+
|
| 12 |
+
Open-source tools for P25 trunked radio transcription and monitoring.
|
| 13 |
+
|
| 14 |
+
We build speech recognition models and infrastructure for public safety radio -- law enforcement, fire, EMS, public works, transit, and more. Everything from real-time transcription to edge deployment on a Raspberry Pi.
|
| 15 |
+
|
| 16 |
+
## Models
|
| 17 |
+
|
| 18 |
+
### IMBE-ASR -- Speech Recognition from Vocoder Parameters
|
| 19 |
+
|
| 20 |
+
Skip audio reconstruction entirely. ASR directly from P25 IMBE codec parameters (4.4 kbps). **1.9% WER** on LibriSpeech-IMBE with beam search.
|
| 21 |
+
|
| 22 |
+
| Model | Params | WER | Use Case |
|
| 23 |
+
|-------|--------|-----|----------|
|
| 24 |
+
| [imbe-asr-large-1024d](https://huggingface.co/trunk-reporter/imbe-asr-large-1024d) | 290M | 1.9% (beam) | Best accuracy, server/desktop |
|
| 25 |
+
| [imbe-asr-base-512d](https://huggingface.co/trunk-reporter/imbe-asr-base-512d) | 48.6M | 18.9% (greedy) | Edge deployment (Pi 5, 15x RT) |
|
| 26 |
+
| [imbe-asr-base-512d-p25](https://huggingface.co/trunk-reporter/imbe-asr-base-512d-p25) | 48.6M | -- | Real P25 dispatch fine-tuned |
|
| 27 |
+
|
| 28 |
+
All models available in SafeTensors, ONNX fp32, and ONNX int8/uint8 for edge deployment. The 290M model runs at 3.6x real-time on a Raspberry Pi 5 via a 70KB C binary.
|
| 29 |
+
|
| 30 |
+
[View collection](https://huggingface.co/collections/trunk-reporter/imbe-asr-speech-recognition-from-vocoder-parameters-69c0a4f68ef670b5bf68449d) | [GitHub](https://github.com/trunk-reporter/imbe-asr)
|
| 31 |
+
|
| 32 |
+
### Qwen3-ASR P25 -- Fine-tuned Audio ASR
|
| 33 |
+
|
| 34 |
+
| Model | Base | Use Case |
|
| 35 |
+
|-------|------|----------|
|
| 36 |
+
| [qwen3-asr-p25-0.6B](https://huggingface.co/trunk-reporter/qwen3-asr-p25-0.6B) | Qwen3-ASR 0.6B | P25 audio transcription, pseudo-labeling |
|
| 37 |
+
|
| 38 |
+
Fine-tuned on P25 dispatch audio for conventional audio-based ASR. Used in our pseudo-labeling pipeline alongside Whisper large-v3.
|
| 39 |
+
|
| 40 |
+
[GitHub](https://github.com/trunk-reporter/qwen3-asr-server)
|
| 41 |
+
|
| 42 |
+
### Coming Soon
|
| 43 |
+
|
| 44 |
+
- **IMBENet** -- Neural IMBE vocoder for high-quality P25 audio reconstruction
|
| 45 |
+
|
| 46 |
+
## Links
|
| 47 |
+
|
| 48 |
+
- [GitHub](https://github.com/trunk-reporter)
|
| 49 |
+
- [trunk-recorder](https://github.com/trunk-reporter/trunk-recorder) (fork with IMBE tap support)
|