cstr's picture
Add model card README
56e0ac3 verified
---
license: mit
language:
- en
pipeline_tag: automatic-speech-recognition
tags:
- audio
- speech-recognition
- transcription
- gguf
- moonshine
- streaming
- lightweight
library_name: ggml
base_model: UsefulSensors/moonshine-streaming-tiny
---
# Moonshine Streaming Tiny -- GGUF
GGUF conversions and quantisations of [`UsefulSensors/moonshine-streaming-tiny`](https://huggingface.co/UsefulSensors/moonshine-streaming-tiny) for use with **[CrispStrobe/CrispASR](https://github.com/CrispStrobe/CrispASR)**.
## Available variants
| File | Quant | Size | Notes |
|---|---|---|---|
| `moonshine-streaming-tiny.gguf` | F32 | 168 MB | Full precision |
| `moonshine-streaming-tiny-q4_k.gguf` | Q4_K | 31 MB | Quantized |
## Model details
- **Architecture:** Streaming encoder-decoder ASR. Raw-waveform audio frontend (no mel) + sliding-window transformer encoder (6L, 320d) + autoregressive transformer decoder (6L, 320d, SiLU-gated MLP, partial RoPE)
- **Parameters:** 34M
- **Languages:** English
- **License:** MIT
- **Source:** [`UsefulSensors/moonshine-streaming-tiny`](https://huggingface.co/UsefulSensors/moonshine-streaming-tiny)
- **Designed for:** Low-latency streaming ASR on edge devices
## Usage with CrispASR
```bash
./build/bin/crispasr --backend moonshine-streaming -m moonshine-streaming-tiny-q4_k.gguf -f audio.wav
```
## Notes
- Tokenizer (`tokenizer.bin`) must be in the same directory as the model file
- Streaming architecture: sliding-window attention with 80ms lookahead
- Audio frontend processes raw waveform (no mel spectrogram needed)