OpenASR commited on
Commit
06c21be
Β·
verified Β·
1 Parent(s): a8ebe76

Publish moonshine-tiny OpenASR packs (fp16/q8_0/q4_k)

Browse files
.gitattributes CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ moonshine-tiny-fp16.oasr filter=lfs diff=lfs merge=lfs -text
37
+ moonshine-tiny-q8_0.oasr filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,115 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: UsefulSensors/moonshine-tiny
4
+ pipeline_tag: automatic-speech-recognition
5
+ library_name: openasr
6
+ tags:
7
+ - automatic-speech-recognition
8
+ - speech-to-text
9
+ - openasr
10
+ - oasr
11
+ - moonshine
12
+ ---
13
+
14
+ <div align="center">
15
+
16
+ # Moonshine Tiny Β· OpenASR
17
+
18
+ **Tiny 27M-parameter English ASR built for real-time, on-device transcription**
19
+
20
+ [![License](https://img.shields.io/badge/license-MIT-2563eb.svg)](https://huggingface.co/UsefulSensors/moonshine-tiny)
21
+ [![Format](https://img.shields.io/badge/format-.oasr-7c3aed.svg)](https://github.com/QuintinShaw/OpenASR)
22
+ [![Runtime](https://img.shields.io/badge/runtime-OpenASR-111827.svg)](https://openasr.org)
23
+ [![Base model](https://img.shields.io/badge/base-moonshine--tiny-f59e0b.svg)](https://huggingface.co/UsefulSensors/moonshine-tiny)
24
+
25
+ Native speech-to-text in the **[OpenASR](https://github.com/QuintinShaw/OpenASR)** runtime β€”
26
+ engineered for peak performance on CPU & GPU, **no Python at inference time**.
27
+
28
+ </div>
29
+
30
+ ---
31
+
32
+ ## ✨ Highlights
33
+
34
+ - πŸͺΆ **Just 27M parameters** β€” the smallest Moonshine, sized for memory- and compute-constrained edge hardware
35
+ - ⚑ **Real-time on-device** β€” engineered by Useful Sensors for live transcription and voice commands on low-cost devices
36
+ - 🎯 **Accurate for its size** β€” beats similarly-sized ASR systems on standard English benchmarks (per the Moonshine paper)
37
+ - πŸ—£οΈ **English speech-to-text** β€” sequence-to-sequence ASR trained on 200K hours of audio
38
+ - πŸ¦€ **Native in OpenASR** β€” `.oasr` packs run with no Python at inference, engineered for peak performance on CPU & GPU
39
+
40
+ ## πŸš€ Quickstart
41
+
42
+ ```bash
43
+ # 1. Install the OpenASR CLI Β· https://openasr.org
44
+ # 2. Pull a build (pick a quant β€” see the table below)
45
+ openasr pull moonshine:tiny-q8
46
+
47
+ # 3. Transcribe
48
+ openasr transcribe audio.wav --model moonshine-tiny
49
+ ```
50
+
51
+ All builds for this model:
52
+
53
+ ```bash
54
+ openasr pull moonshine:tiny-fp16
55
+ openasr pull moonshine:tiny-q8
56
+ ```
57
+
58
+ ## πŸ“¦ Available builds
59
+
60
+ | Quant | File (`.oasr`) | Size | RAM peak | RTF Β· M1 CPU | RTF Β· M1 GPU | JFK Ξ”WER vs fp16 |
61
+ |:------|:---------------|-----:|---------:|-------------:|-------------:|-----------------:|
62
+ | fp16 | `moonshine-tiny-fp16.oasr` | 109 MB | 322 MB | 0.02Γ— | 0.03Γ— | 0.0% |
63
+ | q8_0 | `moonshine-tiny-q8_0.oasr` | 34 MB | 305 MB | 0.02Γ— | 0.03Γ— | 0.0% |
64
+
65
+ <sub>RTF = real-time factor on the fixed 11s JFK clip (**lower is faster**); RAM peak measured per pack
66
+ in an isolated subprocess. JFK Ξ”WER compares each quantized build's JFK transcript to this model's
67
+ fp16 JFK transcript, so it measures quantization drift rather than absolute recognition accuracy.
68
+ **q8_0** is the recommended default β€” near-reference quality at a fraction of the
69
+ footprint.</sub>
70
+
71
+ ## 🧠 About Moonshine Tiny
72
+
73
+ Moonshine Tiny is the smallest model in Useful Sensors' **Moonshine** family β€” a 27M-parameter,
74
+ sequence-to-sequence English speech-recognition model designed for **real-time, on-device
75
+ transcription** on hardware that is severely constrained in memory and compute. Trained on 200,000
76
+ hours of audio, it transcribes English speech to text and, despite its size, reports greater accuracy
77
+ than existing ASR systems of comparable scale on standard benchmarks. It targets developers building
78
+ live transcription and voice-command experiences on low-cost devices. Like other autoregressive ASR
79
+ models it can occasionally hallucinate or repeat on very short or clipped segments, so robust
80
+ in-domain evaluation is recommended before deployment. This OpenASR repo repackages the original
81
+ weights as `.oasr` packs that run natively in the OpenASR runtime β€” no Python at inference time. The
82
+ **q8_0** build is the recommended default (near-reference accuracy at roughly a third of the
83
+ footprint); **fp16** is for verification or maximum fidelity.
84
+
85
+ ## βš™οΈ How these packs were made
86
+
87
+ Converted from [UsefulSensors/moonshine-tiny](https://huggingface.co/UsefulSensors/moonshine-tiny) with the OpenASR importer:
88
+
89
+ ```bash
90
+ openasr model-pack import-moonshine-local <src> <out>.oasr \
91
+ --package-id moonshine-tiny --quantization {fp16,q8-0,q4-k}
92
+ ```
93
+
94
+ The `.oasr` container is GGUF-backed; packs use zero-copy mmap weight binding and graph
95
+ buffer reuse to keep peak memory low.
96
+
97
+ ## βš–οΈ License
98
+
99
+ These packs **inherit the upstream model's license: MIT**
100
+ ([source](https://huggingface.co/UsefulSensors/moonshine-tiny)). OpenASR packaging retains the upstream copyright and
101
+ NOTICE; the only modifications are format conversion and quantization.
102
+
103
+ ## πŸ™ Acknowledgements
104
+
105
+ This pack is a redistribution of **Moonshine Tiny**, created and open-sourced by **Useful Sensors**
106
+ ([UsefulSensors/moonshine-tiny](https://huggingface.co/UsefulSensors/moonshine-tiny)). All credit for
107
+ the original architecture, training, and weights belongs to them; the license is inherited from and
108
+ identical to the upstream model (MIT). Thank you to the Moonshine authors β€” Nat Jeffries, Evan King,
109
+ Manjunath Kudlur, Guy Nicholson, James Wang, and Pete Warden β€” for releasing their work openly.
110
+
111
+ ## πŸ”— Links
112
+
113
+ - πŸ¦€ **OpenASR** β€” <https://github.com/QuintinShaw/OpenASR>
114
+ - 🌐 **Website** β€” <https://openasr.org>
115
+ - πŸ€— **Upstream model** β€” [UsefulSensors/moonshine-tiny](https://huggingface.co/UsefulSensors/moonshine-tiny)
moonshine-tiny-fp16.oasr ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:08dc692fcfaec26ce597bb21189406052a68d6c428ea5c38e72a4e73afd3a21d
3
+ size 108862976
moonshine-tiny-q8_0.oasr ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5df858db11aca1aca8d4bb8ba1230e9e683da8292a46137f0993981b89c655a3
3
+ size 34360832