cstr
/

moonshine-tiny-GGUF

Automatic Speech Recognition

speech-recognition

Model card Files Files and versions

moonshine-tiny-GGUF / README.md

cstr's picture

Upload README.md with huggingface_hub

a476adc verified about 1 month ago

|

history blame contribute delete

1.36 kB

	---
	license: mit
	language:
	- en
	pipeline_tag: automatic-speech-recognition
	tags:
	- audio
	- speech-recognition
	- transcription
	- gguf
	- moonshine
	- lightweight
	library_name: ggml
	base_model: UsefulSensors/moonshine-tiny
	---

	# Moonshine Tiny -- GGUF

	GGUF conversions and quantisations of [`UsefulSensors/moonshine-tiny`](https://huggingface.co/UsefulSensors/moonshine-tiny) for use with [CrispStrobe/CrispASR](https://github.com/CrispStrobe/CrispASR).

	## Available variants

	\| File \| Quant \| Size \| Notes \|
	\|---\|---\|---\|---\|
	\| `moonshine-tiny.gguf` \| F32 \| 104 MB \| Full precision \|
	\| `moonshine-tiny-q8_0.gguf` \| Q8_0 \| 33 MB \| High quality \|
	\| `moonshine-tiny-q4_k.gguf` \| Q4_K \| 21 MB \| Best size/quality tradeoff \|

	All variants produce correct transcription on test audio.

	## Model details

	- Architecture: Conv1d stem + 6L transformer encoder + 6L transformer decoder (288d, 8 heads, partial RoPE, SiLU/GELU)
	- Parameters: 27M
	- Languages: English only
	- WER: 4.55% (LibriSpeech clean), 11.68% (Other)
	- Performance: 11.2x realtime on CPU (F32)
	- License: MIT
	- Source: [moonshine.cpp](https://github.com/csexton-ua/moonshine.cpp) (MIT)

	## Usage with CrispASR

	```bash
	./build/bin/crispasr -m moonshine-tiny-q4_k.gguf -f audio.wav
	./build/bin/crispasr --backend moonshine -m moonshine-tiny-q4_k.gguf -f audio.wav -osrt
	```