Update README
Browse files
README.md
CHANGED
|
@@ -5,12 +5,50 @@ language:
|
|
| 5 |
base_model:
|
| 6 |
- UsefulSensors/moonshine
|
| 7 |
pipeline_tag: automatic-speech-recognition
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
---
|
| 9 |
|
| 10 |
-
# Moonshine
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
-
Moonshine models for regression/performance testing. Derived from `UsefulSensors/moonshine`.
|
| 13 |
|
| 14 |
## Repository contents
|
| 15 |
- `models/{bf16,quantized}/onnx`: Static ONNX models exported from [torq-tools](https://github.com/synaptics-torq/torq-tools).
|
| 16 |
- `models/{bf16}/vmfb`: Compiled VMFB binaries for use in applications.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
base_model:
|
| 6 |
- UsefulSensors/moonshine
|
| 7 |
pipeline_tag: automatic-speech-recognition
|
| 8 |
+
library_name: torq
|
| 9 |
+
tags:
|
| 10 |
+
- Automatic Speech Recognition
|
| 11 |
+
- Astra SL
|
| 12 |
+
- SL2600
|
| 13 |
+
- MPU
|
| 14 |
---
|
| 15 |
|
| 16 |
+
# Moonshine Tiny
|
| 17 |
+
|
| 18 |
+
## Model Overview
|
| 19 |
+
|
| 20 |
+
Moonshine is a high-efficiency automatic speech recognition (ASR) model designed specifically for real-time speech recognition. Unlike Whisper, which processes audio in fixed 30-second chunks, Moonshine uses a variable-length architecture that only computes the actual duration of the speech received.
|
| 21 |
+
|
| 22 |
+
Useful Sensors developed Moonshine and released the English model as open-source. There are 2 models of different sizes and capabilities - base and tiny. The tiny version utilizes 27M parameters.
|
| 23 |
+
|
| 24 |
+
|
| 25 |
+
Moonshine Tiny has been optimized for the Synaptics Astra™ **SL2610-Series processors** with Torq NPU.
|
| 26 |
|
|
|
|
| 27 |
|
| 28 |
## Repository contents
|
| 29 |
- `models/{bf16,quantized}/onnx`: Static ONNX models exported from [torq-tools](https://github.com/synaptics-torq/torq-tools).
|
| 30 |
- `models/{bf16}/vmfb`: Compiled VMFB binaries for use in applications.
|
| 31 |
+
|
| 32 |
+
|
| 33 |
+
## Model Features
|
| 34 |
+
- **Model Type:** Automatic Speech Recognition
|
| 35 |
+
- **Input:** Raw waveform (1D array of floats) 16kHz mono audio up to 30 seconds
|
| 36 |
+
- **Output:** Sequence of token IDs (integers)
|
| 37 |
+
|
| 38 |
+
|
| 39 |
+
## Deployment
|
| 40 |
+
|
| 41 |
+
The compiled model files are available for download on Huggingface at [Synaptics/Moonshine](https://huggingface.co/Synaptics/Moonshine).
|
| 42 |
+
|
| 43 |
+
Usage tutorial to be available in the future at [Synaptics AI Developer Zone](https://developer.synaptics.com/docs/sl/sl2600/introduction).
|
| 44 |
+
|
| 45 |
+
|
| 46 |
+
## License
|
| 47 |
+
|
| 48 |
+
Both the source model and the compiled model for on-device deployment are licensed under [MIT License](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md).
|
| 49 |
+
|
| 50 |
+
## Learn More
|
| 51 |
+
|
| 52 |
+
- [Synaptics AI Developer Zone](https://developer.synaptics.com?utm_source=hf): Get started with documentation, tutorials and resources for your Edge AI journey.
|
| 53 |
+
- [Astra Support Portal](https://synacsm.atlassian.net/servicedesk/customer/portal/543?utm_source=hf): Connect with our engineering team and community.
|
| 54 |
+
|