qaihm-bot commited on
Commit
d6d71e0
·
verified ·
1 Parent(s): 55dfcf6

See https://github.com/quic/ai-hub-models/releases/v0.34.0 for changelog.

README.md CHANGED
@@ -24,6 +24,7 @@ More details on model performance across various devices, can be found
24
  [here](https://aihub.qualcomm.com/models/whisper_base_en).
25
 
26
 
 
27
  ### Model Details
28
 
29
  - **Model Type:** Model_use_case.speech_recognition
@@ -38,58 +39,58 @@ More details on model performance across various devices, can be found
38
 
39
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
- | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 839.092 ms | 37 - 60 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
42
  | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 315.193 ms | 1 - 1399 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
43
- | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 277.305 ms | 38 - 89 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
44
- | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 200.519 ms | 0 - 84 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
45
  | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 205.831 ms | 0 - 358 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
46
- | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 353.451 ms | 38 - 62 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
47
  | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 204.363 ms | 0 - 1398 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
48
- | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 839.092 ms | 37 - 60 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
49
  | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 315.193 ms | 1 - 1399 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
50
- | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 221.305 ms | 0 - 69 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
51
  | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 167.203 ms | 0 - 355 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
52
- | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 189.893 ms | 38 - 68 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
53
  | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 217.911 ms | 1 - 1392 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
54
- | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 203.378 ms | 0 - 69 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
55
  | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 161.769 ms | 0 - 357 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
56
- | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 353.451 ms | 38 - 62 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
57
  | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 204.363 ms | 0 - 1398 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
58
- | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 201.716 ms | 0 - 68 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
59
  | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 218.463 ms | 0 - 355 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
60
  | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 259.434 ms | 10 - 569 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
61
- | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 176.492 ms | 37 - 83 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
62
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 142.136 ms | 0 - 1370 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
63
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 190.438 ms | 92 - 1646 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
64
- | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 160.081 ms | 39 - 67 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
65
  | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 115.265 ms | 1 - 1377 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
66
  | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 150.849 ms | 90 - 1645 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
67
  | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 162.983 ms | 136 - 136 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
68
  | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 207.702 ms | 133 - 133 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
69
- | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 16.579 ms | 5 - 137 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
70
  | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 6.813 ms | 11 - 80 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
71
- | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 10.759 ms | 6 - 136 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
72
- | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 9.549 ms | 5 - 39 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
73
  | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 4.071 ms | 20 - 47 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
74
- | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 10.754 ms | 0 - 132 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
75
  | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 4.864 ms | 19 - 83 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
76
- | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 16.579 ms | 5 - 137 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
77
  | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 6.813 ms | 11 - 80 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
78
- | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 9.651 ms | 5 - 41 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
79
  | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 4.071 ms | 20 - 46 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
80
- | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 11.602 ms | 5 - 128 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
81
  | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 5.223 ms | 11 - 70 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
82
- | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 9.717 ms | 5 - 34 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
83
  | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 4.036 ms | 20 - 45 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
84
- | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 10.754 ms | 0 - 132 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
85
  | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 4.864 ms | 19 - 83 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
86
- | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 9.691 ms | 5 - 35 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
87
  | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 4.062 ms | 20 - 43 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
88
  | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 10.009 ms | 0 - 142 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
89
- | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 7.411 ms | 5 - 148 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
90
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 3.149 ms | 16 - 91 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
91
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 8.306 ms | 56 - 183 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
92
- | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 7.378 ms | 5 - 134 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
93
  | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 2.704 ms | 19 - 90 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
94
  | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 7.631 ms | 56 - 166 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
95
  | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 3.752 ms | 231 - 231 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
@@ -151,26 +152,7 @@ device. This script does the following:
151
  ```bash
152
  python -m qai_hub_models.models.whisper_base_en.export
153
  ```
154
- ```
155
- Profiling Results
156
- ------------------------------------------------------------
157
- WhisperEncoderInf
158
- Device : cs_8275 (ANDROID 14)
159
- Runtime : TFLITE
160
- Estimated inference time (ms) : 839.1
161
- Estimated peak memory usage (MB): [37, 60]
162
- Total # Ops : 419
163
- Compute Unit(s) : npu (0 ops) gpu (408 ops) cpu (11 ops)
164
-
165
- ------------------------------------------------------------
166
- WhisperDecoderInf
167
- Device : cs_8275 (ANDROID 14)
168
- Runtime : TFLITE
169
- Estimated inference time (ms) : 16.6
170
- Estimated peak memory usage (MB): [5, 137]
171
- Total # Ops : 983
172
- Compute Unit(s) : npu (983 ops) gpu (0 ops) cpu (0 ops)
173
- ```
174
 
175
 
176
  ## How does this work?
 
24
  [here](https://aihub.qualcomm.com/models/whisper_base_en).
25
 
26
 
27
+
28
  ### Model Details
29
 
30
  - **Model Type:** Model_use_case.speech_recognition
 
39
 
40
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
41
  |---|---|---|---|---|---|---|---|---|
42
+ | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 853.873 ms | 36 - 61 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
43
  | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 315.193 ms | 1 - 1399 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
44
+ | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 275.493 ms | 38 - 87 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
45
+ | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 206.249 ms | 0 - 54 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
46
  | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 205.831 ms | 0 - 358 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
47
+ | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 352.898 ms | 38 - 63 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
48
  | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 204.363 ms | 0 - 1398 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
49
+ | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 853.873 ms | 36 - 61 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
50
  | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 315.193 ms | 1 - 1399 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
51
+ | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 204.901 ms | 0 - 58 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
52
  | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 167.203 ms | 0 - 355 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
53
+ | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 187.316 ms | 38 - 69 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
54
  | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 217.911 ms | 1 - 1392 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
55
+ | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 202.667 ms | 0 - 75 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
56
  | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 161.769 ms | 0 - 357 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
57
+ | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 352.898 ms | 38 - 63 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
58
  | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 204.363 ms | 0 - 1398 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
59
+ | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 219.226 ms | 0 - 60 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
60
  | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 218.463 ms | 0 - 355 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
61
  | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 259.434 ms | 10 - 569 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
62
+ | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 160.892 ms | 39 - 81 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
63
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 142.136 ms | 0 - 1370 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
64
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 190.438 ms | 92 - 1646 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
65
+ | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 133.952 ms | 37 - 66 MB | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
66
  | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 115.265 ms | 1 - 1377 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
67
  | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 150.849 ms | 90 - 1645 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
68
  | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 162.983 ms | 136 - 136 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
69
  | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 207.702 ms | 133 - 133 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
70
+ | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 16.58 ms | 0 - 134 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
71
  | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 6.813 ms | 11 - 80 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
72
+ | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 10.663 ms | 5 - 136 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
73
+ | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 9.673 ms | 5 - 38 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
74
  | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 4.071 ms | 20 - 47 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
75
+ | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 10.824 ms | 0 - 132 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
76
  | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 4.864 ms | 19 - 83 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
77
+ | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 16.58 ms | 0 - 134 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
78
  | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 6.813 ms | 11 - 80 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
79
+ | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 9.675 ms | 5 - 38 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
80
  | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 4.071 ms | 20 - 46 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
81
+ | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 11.666 ms | 5 - 129 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
82
  | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 5.223 ms | 11 - 70 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
83
+ | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 9.637 ms | 5 - 41 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
84
  | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 4.036 ms | 20 - 45 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
85
+ | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 10.824 ms | 0 - 132 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
86
  | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 4.864 ms | 19 - 83 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
87
+ | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 9.65 ms | 6 - 35 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
88
  | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 4.062 ms | 20 - 43 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
89
  | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 10.009 ms | 0 - 142 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
90
+ | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 7.488 ms | 5 - 149 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
91
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 3.149 ms | 16 - 91 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
92
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 8.306 ms | 56 - 183 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
93
+ | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 7.381 ms | 4 - 133 MB | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.tflite) |
94
  | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 2.704 ms | 19 - 90 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
95
  | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 7.631 ms | 56 - 166 MB | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.onnx) |
96
  | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 3.752 ms | 231 - 231 MB | NPU | [Whisper-Base-En.dlc](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/Whisper-Base-En.dlc) |
 
152
  ```bash
153
  python -m qai_hub_models.models.whisper_base_en.export
154
  ```
155
+
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
156
 
157
 
158
  ## How does this work?
precompiled/qualcomm-snapdragon-x-elite/Whisper-Base-En_WhisperDecoderInf.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4f04e74c2d1b5f5a02787df00f7b9646ff0fc8b9aec3dd104e7090718782c8ff
3
- size 137760823
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bbdc902fcadf9c24319aa0a1f7e300f3e926a79f18d8830ec1b46498738b49e3
3
+ size 137760824
precompiled/qualcomm-snapdragon-x-elite/Whisper-Base-En_WhisperEncoderInf.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d60156a79d35b8de184e60441d7aa116d7461ecb50308e8355eaac333610f982
3
  size 93904432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e15bf25a17c138e7344b2b21d54e980ab398cb1edabba2842d75ef0bac155eff
3
  size 93904432
precompiled/qualcomm-snapdragon-x-elite/Whisper-Base-En_WhisperEncoderInf.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f308da438c35e95250a80f4bc506138f91400dec375793af490dd3ed4440a4f5
3
- size 55305225
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:02a0a9459561ef91af9c18a177aeb843c45caa4b54f8ef64d4605dd3c54d68b7
3
+ size 55305196
precompiled/qualcomm-snapdragon-x-elite/sdk_versions.yml ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ sdk_versions:
2
+ qnn_context_binary:
3
+ qairt: 2.34.2.250528164111_119506
4
+ precompiled_qnn_onnx:
5
+ qairt: 2.33.2.250410134701_117956