cstr
/

moonshine-streaming-tiny-GGUF

Automatic Speech Recognition

speech-recognition

Model card Files Files and versions

moonshine-streaming-tiny-GGUF / README.md

cstr's picture

Add model card README

56e0ac3 verified about 1 month ago

|

history blame contribute delete

1.57 kB

	---
	license: mit
	language:
	- en
	pipeline_tag: automatic-speech-recognition
	tags:
	- audio
	- speech-recognition
	- transcription
	- gguf
	- moonshine
	- streaming
	- lightweight
	library_name: ggml
	base_model: UsefulSensors/moonshine-streaming-tiny
	---

	# Moonshine Streaming Tiny -- GGUF

	GGUF conversions and quantisations of [`UsefulSensors/moonshine-streaming-tiny`](https://huggingface.co/UsefulSensors/moonshine-streaming-tiny) for use with [CrispStrobe/CrispASR](https://github.com/CrispStrobe/CrispASR).

	## Available variants

	\| File \| Quant \| Size \| Notes \|
	\|---\|---\|---\|---\|
	\| `moonshine-streaming-tiny.gguf` \| F32 \| 168 MB \| Full precision \|
	\| `moonshine-streaming-tiny-q4_k.gguf` \| Q4_K \| 31 MB \| Quantized \|

	## Model details

	- Architecture: Streaming encoder-decoder ASR. Raw-waveform audio frontend (no mel) + sliding-window transformer encoder (6L, 320d) + autoregressive transformer decoder (6L, 320d, SiLU-gated MLP, partial RoPE)
	- Parameters: 34M
	- Languages: English
	- License: MIT
	- Source: [`UsefulSensors/moonshine-streaming-tiny`](https://huggingface.co/UsefulSensors/moonshine-streaming-tiny)
	- Designed for: Low-latency streaming ASR on edge devices

	## Usage with CrispASR

	```bash
	./build/bin/crispasr --backend moonshine-streaming -m moonshine-streaming-tiny-q4_k.gguf -f audio.wav
	```

	## Notes

	- Tokenizer (`tokenizer.bin`) must be in the same directory as the model file
	- Streaming architecture: sliding-window attention with 80ms lookahead
	- Audio frontend processes raw waveform (no mel spectrogram needed)