lkeller-synaptics commited on
Commit
fc05284
·
verified ·
1 Parent(s): 83c119e

Update README

Browse files
Files changed (1) hide show
  1. README.md +40 -2
README.md CHANGED
@@ -5,12 +5,50 @@ language:
5
  base_model:
6
  - UsefulSensors/moonshine
7
  pipeline_tag: automatic-speech-recognition
 
 
 
 
 
 
8
  ---
9
 
10
- # Moonshine Torq test models
 
 
 
 
 
 
 
 
 
11
 
12
- Moonshine models for regression/performance testing. Derived from `UsefulSensors/moonshine`.
13
 
14
  ## Repository contents
15
  - `models/{bf16,quantized}/onnx`: Static ONNX models exported from [torq-tools](https://github.com/synaptics-torq/torq-tools).
16
  - `models/{bf16}/vmfb`: Compiled VMFB binaries for use in applications.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  base_model:
6
  - UsefulSensors/moonshine
7
  pipeline_tag: automatic-speech-recognition
8
+ library_name: torq
9
+ tags:
10
+ - Automatic Speech Recognition
11
+ - Astra SL
12
+ - SL2600
13
+ - MPU
14
  ---
15
 
16
+ # Moonshine Tiny
17
+
18
+ ## Model Overview
19
+
20
+ Moonshine is a high-efficiency automatic speech recognition (ASR) model designed specifically for real-time speech recognition. Unlike Whisper, which processes audio in fixed 30-second chunks, Moonshine uses a variable-length architecture that only computes the actual duration of the speech received.
21
+
22
+ Useful Sensors developed Moonshine and released the English model as open-source. There are 2 models of different sizes and capabilities - base and tiny. The tiny version utilizes 27M parameters.
23
+
24
+
25
+ Moonshine Tiny has been optimized for the Synaptics Astra™ **SL2610-Series processors** with Torq NPU.
26
 
 
27
 
28
  ## Repository contents
29
  - `models/{bf16,quantized}/onnx`: Static ONNX models exported from [torq-tools](https://github.com/synaptics-torq/torq-tools).
30
  - `models/{bf16}/vmfb`: Compiled VMFB binaries for use in applications.
31
+
32
+
33
+ ## Model Features
34
+ - **Model Type:** Automatic Speech Recognition
35
+ - **Input:** Raw waveform (1D array of floats) 16kHz mono audio up to 30 seconds
36
+ - **Output:** Sequence of token IDs (integers)
37
+
38
+
39
+ ## Deployment
40
+
41
+ The compiled model files are available for download on Huggingface at [Synaptics/Moonshine](https://huggingface.co/Synaptics/Moonshine).
42
+
43
+ Usage tutorial to be available in the future at [Synaptics AI Developer Zone](https://developer.synaptics.com/docs/sl/sl2600/introduction).
44
+
45
+
46
+ ## License
47
+
48
+ Both the source model and the compiled model for on-device deployment are licensed under [MIT License](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md).
49
+
50
+ ## Learn More
51
+
52
+ - [Synaptics AI Developer Zone](https://developer.synaptics.com?utm_source=hf): Get started with documentation, tutorials and resources for your Edge AI journey.
53
+ - [Astra Support Portal](https://synacsm.atlassian.net/servicedesk/customer/portal/543?utm_source=hf): Connect with our engineering team and community.
54
+