GGUF + pure-C++ runtime in CrispASR — Moonshine streaming medium

#4
by cstr - opened

The streaming Moonshine variants run through CrispASR's moonshine-streaming backend (src/moonshine_streaming.cpp) — separate from offline Moonshine because the encoder is sliding-window + raw-waveform.

Same C++ binary, no Python. Auto-detected from GGUF metadata across tiny / small / medium sizes. KV-cached autoregressive decoder.

Pre-quantised GGUFs (MIT): cstr/moonshine-streaming-medium-GGUF

./build/bin/crispasr --backend moonshine-streaming \
    -m moonshine-streaming-medium-q4_k.gguf --mic

Sibling sizes: -tiny, -small. Offline siblings at cstr/moonshine-{tiny,base}-GGUF + 6 language variants.

Sign up or log in to comment