Add model card README

56e0ac3 verified about 1 month ago

1.57 kB

license: mit
language:
  - en
pipeline_tag: automatic-speech-recognition
tags:
  - audio
  - speech-recognition
  - transcription
  - gguf
  - moonshine
  - streaming
  - lightweight
library_name: ggml
base_model: UsefulSensors/moonshine-streaming-tiny

Moonshine Streaming Tiny -- GGUF

GGUF conversions and quantisations of UsefulSensors/moonshine-streaming-tiny for use with CrispStrobe/CrispASR.

Available variants

File	Quant	Size	Notes
`moonshine-streaming-tiny.gguf`	F32	168 MB	Full precision
`moonshine-streaming-tiny-q4_k.gguf`	Q4_K	31 MB	Quantized

Model details

Architecture: Streaming encoder-decoder ASR. Raw-waveform audio frontend (no mel) + sliding-window transformer encoder (6L, 320d) + autoregressive transformer decoder (6L, 320d, SiLU-gated MLP, partial RoPE)
Parameters: 34M
Languages: English
License: MIT
Source: UsefulSensors/moonshine-streaming-tiny
Designed for: Low-latency streaming ASR on edge devices

Usage with CrispASR

./build/bin/crispasr --backend moonshine-streaming -m moonshine-streaming-tiny-q4_k.gguf -f audio.wav

Notes

Tokenizer (tokenizer.bin) must be in the same directory as the model file
Streaming architecture: sliding-window attention with 80ms lookahead
Audio frontend processes raw waveform (no mel spectrogram needed)