v0.34.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.34.0 for changelog.
README.md
CHANGED
|
@@ -24,6 +24,7 @@ More details on model performance across various devices, can be found
|
|
| 24 |
[here](https://aihub.qualcomm.com/models/whisper_small_en).
|
| 25 |
|
| 26 |
|
|
|
|
| 27 |
### Model Details
|
| 28 |
|
| 29 |
- **Model Type:** Model_use_case.speech_recognition
|
|
@@ -38,28 +39,28 @@ More details on model performance across various devices, can be found
|
|
| 38 |
|
| 39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 40 |
|---|---|---|---|---|---|---|---|---|
|
| 41 |
-
| WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE |
|
| 42 |
-
| WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE |
|
| 43 |
-
| WhisperEncoderInf | float |
|
| 44 |
-
| WhisperEncoderInf | float |
|
| 45 |
-
| WhisperEncoderInf | float |
|
| 46 |
-
| WhisperEncoderInf | float |
|
| 47 |
-
| WhisperEncoderInf | float |
|
| 48 |
-
| WhisperEncoderInf | float |
|
| 49 |
-
| WhisperEncoderInf | float |
|
| 50 |
-
| WhisperEncoderInf | float | Samsung Galaxy
|
| 51 |
-
| WhisperEncoderInf | float |
|
| 52 |
-
| WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.
|
| 53 |
-
| WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE |
|
| 54 |
-
| WhisperDecoderInf | float |
|
| 55 |
-
| WhisperDecoderInf | float |
|
| 56 |
-
| WhisperDecoderInf | float |
|
| 57 |
-
| WhisperDecoderInf | float |
|
| 58 |
-
| WhisperDecoderInf | float |
|
| 59 |
-
| WhisperDecoderInf | float |
|
| 60 |
-
| WhisperDecoderInf | float |
|
| 61 |
-
| WhisperDecoderInf | float | Samsung Galaxy
|
| 62 |
-
| WhisperDecoderInf | float |
|
| 63 |
|
| 64 |
|
| 65 |
|
|
@@ -117,26 +118,7 @@ device. This script does the following:
|
|
| 117 |
```bash
|
| 118 |
python -m qai_hub_models.models.whisper_small_en.export
|
| 119 |
```
|
| 120 |
-
|
| 121 |
-
Profiling Results
|
| 122 |
-
------------------------------------------------------------
|
| 123 |
-
WhisperEncoderInf
|
| 124 |
-
Device : cs_8275 (ANDROID 14)
|
| 125 |
-
Runtime : TFLITE
|
| 126 |
-
Estimated inference time (ms) : 3230.8
|
| 127 |
-
Estimated peak memory usage (MB): [89, 122]
|
| 128 |
-
Total # Ops : 911
|
| 129 |
-
Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
|
| 130 |
-
|
| 131 |
-
------------------------------------------------------------
|
| 132 |
-
WhisperDecoderInf
|
| 133 |
-
Device : cs_8275 (ANDROID 14)
|
| 134 |
-
Runtime : TFLITE
|
| 135 |
-
Estimated inference time (ms) : 71.0
|
| 136 |
-
Estimated peak memory usage (MB): [16, 384]
|
| 137 |
-
Total # Ops : 2573
|
| 138 |
-
Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)
|
| 139 |
-
```
|
| 140 |
|
| 141 |
|
| 142 |
## How does this work?
|
|
|
|
| 24 |
[here](https://aihub.qualcomm.com/models/whisper_small_en).
|
| 25 |
|
| 26 |
|
| 27 |
+
|
| 28 |
### Model Details
|
| 29 |
|
| 30 |
- **Model Type:** Model_use_case.speech_recognition
|
|
|
|
| 39 |
|
| 40 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 41 |
|---|---|---|---|---|---|---|---|---|
|
| 42 |
+
| WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 3232.589 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 43 |
+
| WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 907.681 ms | 110 - 207 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 44 |
+
| WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 1270.067 ms | 93 - 125 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 45 |
+
| WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 3232.589 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 46 |
+
| WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 1123.233 ms | 22 - 77 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 47 |
+
| WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 656.611 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 48 |
+
| WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 685.947 ms | 97 - 119 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 49 |
+
| WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 1270.067 ms | 93 - 125 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 50 |
+
| WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 696.19 ms | 65 - 182 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 51 |
+
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 531.086 ms | 108 - 200 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 52 |
+
| WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 463.534 ms | 109 - 139 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 53 |
+
| WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.04 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 54 |
+
| WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 47.2 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 55 |
+
| WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 49.446 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 56 |
+
| WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 71.04 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 57 |
+
| WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 49.535 ms | 16 - 48 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 58 |
+
| WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 42.734 ms | 16 - 353 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 59 |
+
| WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 49.183 ms | 11 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 60 |
+
| WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 49.446 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 61 |
+
| WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 49.082 ms | 8 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 62 |
+
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 43.076 ms | 16 - 425 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 63 |
+
| WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 36.925 ms | 15 - 386 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 64 |
|
| 65 |
|
| 66 |
|
|
|
|
| 118 |
```bash
|
| 119 |
python -m qai_hub_models.models.whisper_small_en.export
|
| 120 |
```
|
| 121 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 122 |
|
| 123 |
|
| 124 |
## How does this work?
|