GGUF + pure-C++ runtime in CrispASR — Moonshine streaming small

#2
by cstr - opened

The streaming Moonshine variants run through CrispASR's moonshine-streaming backend (src/moonshine_streaming.cpp) — a separate runtime from offline Moonshine because the encoder is sliding-window + raw-waveform.

Auto-detected from GGUF metadata; same backend handles tiny (34M), small (110M), and medium (~245M). KV-cached autoregressive decoder, companion tokenizer fetched alongside the GGUF.

Pre-quantised GGUFs (MIT): cstr/moonshine-streaming-small-GGUF

./build/bin/crispasr --backend moonshine-streaming \
    -m moonshine-streaming-small-q4_k.gguf --mic

Sibling sizes: -tiny, -medium. Offline siblings at cstr/moonshine-{tiny,base}-GGUF + 6 language variants.

Sign up or log in to comment