Publish moonshine-tiny OpenASR packs (fp16/q8_0/q4_k)
Browse files- .gitattributes +2 -0
- README.md +115 -0
- moonshine-tiny-fp16.oasr +3 -0
- moonshine-tiny-q8_0.oasr +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
moonshine-tiny-fp16.oasr filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
moonshine-tiny-q8_0.oasr filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,115 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
base_model: UsefulSensors/moonshine-tiny
|
| 4 |
+
pipeline_tag: automatic-speech-recognition
|
| 5 |
+
library_name: openasr
|
| 6 |
+
tags:
|
| 7 |
+
- automatic-speech-recognition
|
| 8 |
+
- speech-to-text
|
| 9 |
+
- openasr
|
| 10 |
+
- oasr
|
| 11 |
+
- moonshine
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
<div align="center">
|
| 15 |
+
|
| 16 |
+
# Moonshine Tiny Β· OpenASR
|
| 17 |
+
|
| 18 |
+
**Tiny 27M-parameter English ASR built for real-time, on-device transcription**
|
| 19 |
+
|
| 20 |
+
[](https://huggingface.co/UsefulSensors/moonshine-tiny)
|
| 21 |
+
[](https://github.com/QuintinShaw/OpenASR)
|
| 22 |
+
[](https://openasr.org)
|
| 23 |
+
[](https://huggingface.co/UsefulSensors/moonshine-tiny)
|
| 24 |
+
|
| 25 |
+
Native speech-to-text in the **[OpenASR](https://github.com/QuintinShaw/OpenASR)** runtime β
|
| 26 |
+
engineered for peak performance on CPU & GPU, **no Python at inference time**.
|
| 27 |
+
|
| 28 |
+
</div>
|
| 29 |
+
|
| 30 |
+
---
|
| 31 |
+
|
| 32 |
+
## β¨ Highlights
|
| 33 |
+
|
| 34 |
+
- πͺΆ **Just 27M parameters** β the smallest Moonshine, sized for memory- and compute-constrained edge hardware
|
| 35 |
+
- β‘ **Real-time on-device** β engineered by Useful Sensors for live transcription and voice commands on low-cost devices
|
| 36 |
+
- π― **Accurate for its size** β beats similarly-sized ASR systems on standard English benchmarks (per the Moonshine paper)
|
| 37 |
+
- π£οΈ **English speech-to-text** β sequence-to-sequence ASR trained on 200K hours of audio
|
| 38 |
+
- π¦ **Native in OpenASR** β `.oasr` packs run with no Python at inference, engineered for peak performance on CPU & GPU
|
| 39 |
+
|
| 40 |
+
## π Quickstart
|
| 41 |
+
|
| 42 |
+
```bash
|
| 43 |
+
# 1. Install the OpenASR CLI Β· https://openasr.org
|
| 44 |
+
# 2. Pull a build (pick a quant β see the table below)
|
| 45 |
+
openasr pull moonshine:tiny-q8
|
| 46 |
+
|
| 47 |
+
# 3. Transcribe
|
| 48 |
+
openasr transcribe audio.wav --model moonshine-tiny
|
| 49 |
+
```
|
| 50 |
+
|
| 51 |
+
All builds for this model:
|
| 52 |
+
|
| 53 |
+
```bash
|
| 54 |
+
openasr pull moonshine:tiny-fp16
|
| 55 |
+
openasr pull moonshine:tiny-q8
|
| 56 |
+
```
|
| 57 |
+
|
| 58 |
+
## π¦ Available builds
|
| 59 |
+
|
| 60 |
+
| Quant | File (`.oasr`) | Size | RAM peak | RTF Β· M1 CPU | RTF Β· M1 GPU | JFK ΞWER vs fp16 |
|
| 61 |
+
|:------|:---------------|-----:|---------:|-------------:|-------------:|-----------------:|
|
| 62 |
+
| fp16 | `moonshine-tiny-fp16.oasr` | 109 MB | 322 MB | 0.02Γ | 0.03Γ | 0.0% |
|
| 63 |
+
| q8_0 | `moonshine-tiny-q8_0.oasr` | 34 MB | 305 MB | 0.02Γ | 0.03Γ | 0.0% |
|
| 64 |
+
|
| 65 |
+
<sub>RTF = real-time factor on the fixed 11s JFK clip (**lower is faster**); RAM peak measured per pack
|
| 66 |
+
in an isolated subprocess. JFK ΞWER compares each quantized build's JFK transcript to this model's
|
| 67 |
+
fp16 JFK transcript, so it measures quantization drift rather than absolute recognition accuracy.
|
| 68 |
+
**q8_0** is the recommended default β near-reference quality at a fraction of the
|
| 69 |
+
footprint.</sub>
|
| 70 |
+
|
| 71 |
+
## π§ About Moonshine Tiny
|
| 72 |
+
|
| 73 |
+
Moonshine Tiny is the smallest model in Useful Sensors' **Moonshine** family β a 27M-parameter,
|
| 74 |
+
sequence-to-sequence English speech-recognition model designed for **real-time, on-device
|
| 75 |
+
transcription** on hardware that is severely constrained in memory and compute. Trained on 200,000
|
| 76 |
+
hours of audio, it transcribes English speech to text and, despite its size, reports greater accuracy
|
| 77 |
+
than existing ASR systems of comparable scale on standard benchmarks. It targets developers building
|
| 78 |
+
live transcription and voice-command experiences on low-cost devices. Like other autoregressive ASR
|
| 79 |
+
models it can occasionally hallucinate or repeat on very short or clipped segments, so robust
|
| 80 |
+
in-domain evaluation is recommended before deployment. This OpenASR repo repackages the original
|
| 81 |
+
weights as `.oasr` packs that run natively in the OpenASR runtime β no Python at inference time. The
|
| 82 |
+
**q8_0** build is the recommended default (near-reference accuracy at roughly a third of the
|
| 83 |
+
footprint); **fp16** is for verification or maximum fidelity.
|
| 84 |
+
|
| 85 |
+
## βοΈ How these packs were made
|
| 86 |
+
|
| 87 |
+
Converted from [UsefulSensors/moonshine-tiny](https://huggingface.co/UsefulSensors/moonshine-tiny) with the OpenASR importer:
|
| 88 |
+
|
| 89 |
+
```bash
|
| 90 |
+
openasr model-pack import-moonshine-local <src> <out>.oasr \
|
| 91 |
+
--package-id moonshine-tiny --quantization {fp16,q8-0,q4-k}
|
| 92 |
+
```
|
| 93 |
+
|
| 94 |
+
The `.oasr` container is GGUF-backed; packs use zero-copy mmap weight binding and graph
|
| 95 |
+
buffer reuse to keep peak memory low.
|
| 96 |
+
|
| 97 |
+
## βοΈ License
|
| 98 |
+
|
| 99 |
+
These packs **inherit the upstream model's license: MIT**
|
| 100 |
+
([source](https://huggingface.co/UsefulSensors/moonshine-tiny)). OpenASR packaging retains the upstream copyright and
|
| 101 |
+
NOTICE; the only modifications are format conversion and quantization.
|
| 102 |
+
|
| 103 |
+
## π Acknowledgements
|
| 104 |
+
|
| 105 |
+
This pack is a redistribution of **Moonshine Tiny**, created and open-sourced by **Useful Sensors**
|
| 106 |
+
([UsefulSensors/moonshine-tiny](https://huggingface.co/UsefulSensors/moonshine-tiny)). All credit for
|
| 107 |
+
the original architecture, training, and weights belongs to them; the license is inherited from and
|
| 108 |
+
identical to the upstream model (MIT). Thank you to the Moonshine authors β Nat Jeffries, Evan King,
|
| 109 |
+
Manjunath Kudlur, Guy Nicholson, James Wang, and Pete Warden β for releasing their work openly.
|
| 110 |
+
|
| 111 |
+
## π Links
|
| 112 |
+
|
| 113 |
+
- π¦ **OpenASR** β <https://github.com/QuintinShaw/OpenASR>
|
| 114 |
+
- π **Website** β <https://openasr.org>
|
| 115 |
+
- π€ **Upstream model** β [UsefulSensors/moonshine-tiny](https://huggingface.co/UsefulSensors/moonshine-tiny)
|
moonshine-tiny-fp16.oasr
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:08dc692fcfaec26ce597bb21189406052a68d6c428ea5c38e72a4e73afd3a21d
|
| 3 |
+
size 108862976
|
moonshine-tiny-q8_0.oasr
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5df858db11aca1aca8d4bb8ba1230e9e683da8292a46137f0993981b89c655a3
|
| 3 |
+
size 34360832
|