qaihm-bot commited on
Commit
6ea879e
·
verified ·
1 Parent(s): 32337a7

See https://github.com/quic/ai-hub-models/releases/v0.34.0 for changelog.

Files changed (1) hide show
  1. README.md +24 -42
README.md CHANGED
@@ -24,6 +24,7 @@ More details on model performance across various devices, can be found
24
  [here](https://aihub.qualcomm.com/models/whisper_small_en).
25
 
26
 
 
27
  ### Model Details
28
 
29
  - **Model Type:** Model_use_case.speech_recognition
@@ -38,28 +39,28 @@ More details on model performance across various devices, can be found
38
 
39
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
- | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 3230.789 ms | 89 - 122 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
42
- | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 965.99 ms | 39 - 138 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
43
- | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 675.072 ms | 45 - 119 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
44
- | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 1268.791 ms | 108 - 140 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
45
- | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 3230.789 ms | 89 - 122 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
46
- | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 677.316 ms | 103 - 165 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
47
- | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 655.714 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
48
- | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 856.713 ms | 90 - 254 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
49
- | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 1268.791 ms | 108 - 140 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
50
- | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 686.994 ms | 85 - 167 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
51
- | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 518.045 ms | 109 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
52
- | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
53
- | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 49.307 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
54
- | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 49.176 ms | 5 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
55
- | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 49.431 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
56
- | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
57
- | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 51.019 ms | 10 - 42 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
58
- | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 42.717 ms | 11 - 349 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
59
- | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 48.824 ms | 16 - 51 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
60
- | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 49.431 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
61
- | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 50.949 ms | 16 - 49 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
62
- | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.926 ms | 16 - 419 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
63
 
64
 
65
 
@@ -117,26 +118,7 @@ device. This script does the following:
117
  ```bash
118
  python -m qai_hub_models.models.whisper_small_en.export
119
  ```
120
- ```
121
- Profiling Results
122
- ------------------------------------------------------------
123
- WhisperEncoderInf
124
- Device : cs_8275 (ANDROID 14)
125
- Runtime : TFLITE
126
- Estimated inference time (ms) : 3230.8
127
- Estimated peak memory usage (MB): [89, 122]
128
- Total # Ops : 911
129
- Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
130
-
131
- ------------------------------------------------------------
132
- WhisperDecoderInf
133
- Device : cs_8275 (ANDROID 14)
134
- Runtime : TFLITE
135
- Estimated inference time (ms) : 71.0
136
- Estimated peak memory usage (MB): [16, 384]
137
- Total # Ops : 2573
138
- Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)
139
- ```
140
 
141
 
142
  ## How does this work?
 
24
  [here](https://aihub.qualcomm.com/models/whisper_small_en).
25
 
26
 
27
+
28
  ### Model Details
29
 
30
  - **Model Type:** Model_use_case.speech_recognition
 
39
 
40
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
41
  |---|---|---|---|---|---|---|---|---|
42
+ | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 3232.589 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
43
+ | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 907.681 ms | 110 - 207 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
44
+ | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 1270.067 ms | 93 - 125 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
45
+ | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 3232.589 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
46
+ | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 1123.233 ms | 22 - 77 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
47
+ | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 656.611 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
48
+ | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 685.947 ms | 97 - 119 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
49
+ | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 1270.067 ms | 93 - 125 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
50
+ | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 696.19 ms | 65 - 182 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
51
+ | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 531.086 ms | 108 - 200 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
52
+ | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 463.534 ms | 109 - 139 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
53
+ | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.04 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
54
+ | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 47.2 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
55
+ | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 49.446 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
56
+ | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 71.04 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
57
+ | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 49.535 ms | 16 - 48 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
58
+ | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 42.734 ms | 16 - 353 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
59
+ | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 49.183 ms | 11 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
60
+ | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 49.446 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
61
+ | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 49.082 ms | 8 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
62
+ | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 43.076 ms | 16 - 425 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
63
+ | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 36.925 ms | 15 - 386 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
64
 
65
 
66
 
 
118
  ```bash
119
  python -m qai_hub_models.models.whisper_small_en.export
120
  ```
121
+
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
122
 
123
 
124
  ## How does this work?