GGUF + pure-C++ runtime in CrispASR — Moonshine streaming small

by cstr - opened May 1

May 1

The streaming Moonshine variants run through CrispASR's moonshine-streaming backend (src/moonshine_streaming.cpp) — a separate runtime from offline Moonshine because the encoder is sliding-window + raw-waveform.

Auto-detected from GGUF metadata; same backend handles tiny (~~34M), small (~~110M), and medium (~245M). KV-cached autoregressive decoder, companion tokenizer fetched alongside the GGUF.

Pre-quantised GGUFs (MIT): cstr/moonshine-streaming-small-GGUF

./build/bin/crispasr --backend moonshine-streaming \
    -m moonshine-streaming-small-q4_k.gguf --mic

Sibling sizes: -tiny, -medium. Offline siblings at cstr/moonshine-{tiny,base}-GGUF + 6 language variants.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment