cstr's picture
Add model card README
56e0ac3 verified
metadata
license: mit
language:
  - en
pipeline_tag: automatic-speech-recognition
tags:
  - audio
  - speech-recognition
  - transcription
  - gguf
  - moonshine
  - streaming
  - lightweight
library_name: ggml
base_model: UsefulSensors/moonshine-streaming-tiny

Moonshine Streaming Tiny -- GGUF

GGUF conversions and quantisations of UsefulSensors/moonshine-streaming-tiny for use with CrispStrobe/CrispASR.

Available variants

File Quant Size Notes
moonshine-streaming-tiny.gguf F32 168 MB Full precision
moonshine-streaming-tiny-q4_k.gguf Q4_K 31 MB Quantized

Model details

  • Architecture: Streaming encoder-decoder ASR. Raw-waveform audio frontend (no mel) + sliding-window transformer encoder (6L, 320d) + autoregressive transformer decoder (6L, 320d, SiLU-gated MLP, partial RoPE)
  • Parameters: 34M
  • Languages: English
  • License: MIT
  • Source: UsefulSensors/moonshine-streaming-tiny
  • Designed for: Low-latency streaming ASR on edge devices

Usage with CrispASR

./build/bin/crispasr --backend moonshine-streaming -m moonshine-streaming-tiny-q4_k.gguf -f audio.wav

Notes

  • Tokenizer (tokenizer.bin) must be in the same directory as the model file
  • Streaming architecture: sliding-window attention with 80ms lookahead
  • Audio frontend processes raw waveform (no mel spectrogram needed)