qaihm-bot commited on Dec 2, 2025

Commit

bc8b065

verified ·

1 Parent(s): 1dd38b8

v0.42.0

Browse files

See https://github.com/quic/ai-hub-models/releases/v0.42.0 for changelog.

Files changed (46) hide show

README.md +41 -37
precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +1 -1
precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +1 -1
precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-qcs8275-proxy/tool-versions.yaml +1 -1
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +1 -1
precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-qcs9075-proxy/tool-versions.yaml +1 -1
precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa7255p/tool-versions.yaml +1 -1
precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa8255p-proxy/tool-versions.yaml +1 -1
precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa8650p-proxy/tool-versions.yaml +1 -1
precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-sa8775p/tool-versions.yaml +1 -1
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +3 -0
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +3 -0
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +3 -0
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +3 -0
precompiled/qualcomm-snapdragon-7gen4/tool-versions.yaml +4 -0
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +1 -1

README.md CHANGED Viewed

@@ -35,40 +35,44 @@ More details on model performance across various devices, can be found
 | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
 |---|---|---|---|---|---|---|---|---|
-| WhisperSmallEncoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 466.214 ms | 1 - 10 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 318.755 ms | 0 - 7 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 62.444 ms | 0 - 113 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 269.19 ms | 0 - 10 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 612.628 ms | 52 - 63 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 466.214 ms | 1 - 10 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 323.676 ms | 1 - 3 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 334.22 ms | 0 - 3 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 269.19 ms | 0 - 10 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 251.069 ms | 1 - 19 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 45.312 ms | 56 - 75 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 205.021 ms | 1 - 17 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 35.236 ms | 63 - 78 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 178.557 ms | 0 - 12 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 30.325 ms | 61 - 72 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 243.613 ms | 0 - 0 MB | NPU | Use Export Script |
-| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 61.693 ms | 107 - 107 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 13.38 ms | 26 - 35 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 8.402 ms | 30 - 33 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 8.647 ms | 0 - 192 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 9.436 ms | 29 - 39 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 33.594 ms | 37 - 49 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 13.38 ms | 26 - 35 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 8.503 ms | 28 - 31 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 8.224 ms | 28 - 30 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 9.436 ms | 29 - 39 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 6.455 ms | 30 - 48 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 6.715 ms | 38 - 56 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 4.756 ms | 18 - 35 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 5.136 ms | 27 - 42 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 4.02 ms | 28 - 40 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 4.357 ms | 38 - 48 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 7.731 ms | 30 - 30 MB | NPU | Use Export Script |
-| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 7.792 ms | 185 - 185 MB | NPU | Use Export Script |
@@ -83,9 +87,9 @@ pip install "qai-hub-models[whisper-small-quantized]"
 ```
-## Configure Qualcomm® AI Hub to run this model on a cloud-hosted device
-Sign-in to [Qualcomm® AI Hub](https://app.aihub.qualcomm.com/) with your
 Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
 With this API token, you can configure your client to run models on the cloud
@@ -93,7 +97,7 @@ hosted devices.
 ```bash
 qai-hub configure --api_token API_TOKEN
 ```
-Navigate to [docs](https://app.aihub.qualcomm.com/docs/) for more information.

 | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
 |---|---|---|---|---|---|---|---|---|
+| WhisperSmallEncoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 450.472 ms | 1 - 10 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 321.308 ms | 1 - 3 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 63.07 ms | 63 - 65 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 925.191 ms | 1 - 10 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 604.083 ms | 23 - 32 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 450.472 ms | 1 - 10 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 323.282 ms | 1 - 3 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 324.535 ms | 1 - 4 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 925.191 ms | 1 - 10 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 247.351 ms | 0 - 19 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 45.579 ms | 63 - 82 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 204.196 ms | 1 - 17 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 35.311 ms | 63 - 78 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | QNN_CONTEXT_BINARY | 529.171 ms | 0 - 15 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | PRECOMPILED_QNN_ONNX | 186.282 ms | 53 - 63 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 191.947 ms | 1 - 12 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 28.294 ms | 62 - 73 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 244.642 ms | 0 - 0 MB | NPU | Use Export Script |
+| WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 61.723 ms | 108 - 108 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 13.534 ms | 26 - 36 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 8.366 ms | 30 - 33 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 8.784 ms | 28 - 31 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 9.462 ms | 26 - 36 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 33.78 ms | 32 - 42 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 13.534 ms | 26 - 36 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 8.387 ms | 24 - 26 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 8.439 ms | 30 - 33 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 9.462 ms | 26 - 36 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 6.423 ms | 26 - 45 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 6.71 ms | 33 - 52 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 4.805 ms | 17 - 31 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 5.158 ms | 25 - 36 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | QNN_CONTEXT_BINARY | 11.23 ms | 25 - 40 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | PRECOMPILED_QNN_ONNX | 12.017 ms | 38 - 57 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 4.0 ms | 30 - 42 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 4.347 ms | 36 - 46 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 7.89 ms | 30 - 30 MB | NPU | Use Export Script |
+| WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 7.821 ms | 186 - 186 MB | NPU | Use Export Script |
 ```
+## Configure Qualcomm® AI Hub Workbench to run this model on a cloud-hosted device
+Sign-in to [Qualcomm® AI Hub Workbench](https://workbench.aihub.qualcomm.com/) with your
 Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
 With this API token, you can configure your client to run models on the cloud
 ```bash
 qai-hub configure --api_token API_TOKEN
 ```
+Navigate to [docs](https://workbench.aihub.qualcomm.com/docs/) for more information.

precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:721564bbd831d19ff49da30758f26e915ce7c3c0476fd54e6499978dd418eb2c
 size 193518243

 version https://git-lfs.github.com/spec/v1
+oid sha256:a6def630635b86d43a2e476067fcdde1bf044fedbe5645b13450adab60e99252
 size 193518243

precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3124dd79bfc2734e25871e7c5cb65dda50e4c45b5f24a582dea0de723d22c295
 size 102104982

 version https://git-lfs.github.com/spec/v1
+oid sha256:d7b367ea6e78d5abc4bfa86bf8bd6f2cceb0f5e8383d72a15d675f27d76ca3f0
 size 102104982

precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:087ff30ea3d4ae5fc8972c73d64c1c6d92841fd40d4090955a57e49b2a3a31e5
 size 225382400

 version https://git-lfs.github.com/spec/v1
+oid sha256:24208db19866047ea5ad231d2836f469283e3f637e8b024ff42be251e6c0bdcf
 size 225382400

precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8b2983626c780cef63de140328a36a7e09d647707c789c90c88fc6ee01b9ac63
 size 130682880

 version https://git-lfs.github.com/spec/v1
+oid sha256:4794729267b57f8d32ce5ba27a3419f395db5c1f02c8d4dbdf3b6f7971004e8d
 size 130682880

precompiled/qualcomm-qcs8275-proxy/tool-versions.yaml CHANGED Viewed

@@ -1,3 +1,3 @@
 tool_versions:
   qnn_context_binary:
-    qairt: 2.39.0.250925215840_163802-auto

 tool_versions:
   qnn_context_binary:
+    qairt: 2.40.0.251030114326_189385-auto

precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a8005aaf78182cc56b538a21e08f610c8ed2629e1817075ff228340c9fecef90
 size 225378304

 version https://git-lfs.github.com/spec/v1
+oid sha256:10394e71510154ed77feae4dc48450d289685a1eee515fbd5f5655541c908ef8
 size 225378304

precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c92fbe0d7ada5b3fc7498821d1f12803a2e043b67a01e4e52a4614a8e11b2f5
-size 193590896

 version https://git-lfs.github.com/spec/v1
+oid sha256:fa72357c6814e51f5a9e1ba9f05792c9802706d211d3ec526e68bf54279a00f3
+size 193590918

precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4a6531b2e2abcc30e755408d424e4326dacad604d34a8b9dd492db844e3f1a14
 size 130580480

 version https://git-lfs.github.com/spec/v1
+oid sha256:79df9e8e3a94e3923bf5ae03a60bf4622e28bdf60153246a9336867655b9664c
 size 130580480

precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:701e9e0c513e880ab484bea2fb8d8e513984c8421ed8e5cf7dbe82de1eaf6e13
 size 93996909

 version https://git-lfs.github.com/spec/v1
+oid sha256:01ea1bbf828aa965b3ee8848ac93c53ca78f2126dd9b6f613fec8c3ec55f9ccb
 size 93996909

precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:acd7831a94069a339b2fc1d2571b7a8a42500e0d4a0e2634760edc705b9d7d91
 size 225386496

 version https://git-lfs.github.com/spec/v1
+oid sha256:1843626282087293ceb96dbfe14e5c80689f33b4658c6887092d35b881b0be00
 size 225386496

precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dee4f3da58574993135c8bbf72e2f35ac3c48b5fdc805cc6f047c2f44c0c8191
 size 130678784

 version https://git-lfs.github.com/spec/v1
+oid sha256:ef0733e4636f075193280ac4d3c1beabf6b5b6b4603369ce89b7cf726fd39028
 size 130678784

precompiled/qualcomm-qcs9075-proxy/tool-versions.yaml CHANGED Viewed

@@ -1,3 +1,3 @@
 tool_versions:
   qnn_context_binary:
-    qairt: 2.39.0.250925215840_163802-auto

 tool_versions:
   qnn_context_binary:
+    qairt: 2.40.0.251030114326_189385-auto

precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:087ff30ea3d4ae5fc8972c73d64c1c6d92841fd40d4090955a57e49b2a3a31e5
 size 225382400

 version https://git-lfs.github.com/spec/v1
+oid sha256:24208db19866047ea5ad231d2836f469283e3f637e8b024ff42be251e6c0bdcf
 size 225382400

precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8b2983626c780cef63de140328a36a7e09d647707c789c90c88fc6ee01b9ac63
 size 130682880

 version https://git-lfs.github.com/spec/v1
+oid sha256:4794729267b57f8d32ce5ba27a3419f395db5c1f02c8d4dbdf3b6f7971004e8d
 size 130682880

precompiled/qualcomm-sa7255p/tool-versions.yaml CHANGED Viewed

@@ -1,3 +1,3 @@
 tool_versions:
   qnn_context_binary:
-    qairt: 2.39.0.250925215840_163802-auto

 tool_versions:
   qnn_context_binary:
+    qairt: 2.40.0.251030114326_189385-auto

precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a8005aaf78182cc56b538a21e08f610c8ed2629e1817075ff228340c9fecef90
 size 225378304

 version https://git-lfs.github.com/spec/v1
+oid sha256:10394e71510154ed77feae4dc48450d289685a1eee515fbd5f5655541c908ef8
 size 225378304

precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4a6531b2e2abcc30e755408d424e4326dacad604d34a8b9dd492db844e3f1a14
 size 130580480

 version https://git-lfs.github.com/spec/v1
+oid sha256:79df9e8e3a94e3923bf5ae03a60bf4622e28bdf60153246a9336867655b9664c
 size 130580480

precompiled/qualcomm-sa8255p-proxy/tool-versions.yaml CHANGED Viewed

@@ -1,3 +1,3 @@
 tool_versions:
   qnn_context_binary:
-    qairt: 2.39.0.250925215840_163802

 tool_versions:
   qnn_context_binary:
+    qairt: 2.40.0.251030114326_189385

precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a8005aaf78182cc56b538a21e08f610c8ed2629e1817075ff228340c9fecef90
 size 225378304

 version https://git-lfs.github.com/spec/v1
+oid sha256:10394e71510154ed77feae4dc48450d289685a1eee515fbd5f5655541c908ef8
 size 225378304

precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4a6531b2e2abcc30e755408d424e4326dacad604d34a8b9dd492db844e3f1a14
 size 130580480

 version https://git-lfs.github.com/spec/v1
+oid sha256:79df9e8e3a94e3923bf5ae03a60bf4622e28bdf60153246a9336867655b9664c
 size 130580480

precompiled/qualcomm-sa8650p-proxy/tool-versions.yaml CHANGED Viewed

@@ -1,3 +1,3 @@
 tool_versions:
   qnn_context_binary:
-    qairt: 2.39.0.250925215840_163802

 tool_versions:
   qnn_context_binary:
+    qairt: 2.40.0.251030114326_189385

precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:acd7831a94069a339b2fc1d2571b7a8a42500e0d4a0e2634760edc705b9d7d91
 size 225386496

 version https://git-lfs.github.com/spec/v1
+oid sha256:1843626282087293ceb96dbfe14e5c80689f33b4658c6887092d35b881b0be00
 size 225386496

precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dee4f3da58574993135c8bbf72e2f35ac3c48b5fdc805cc6f047c2f44c0c8191
 size 130678784

 version https://git-lfs.github.com/spec/v1
+oid sha256:ef0733e4636f075193280ac4d3c1beabf6b5b6b4603369ce89b7cf726fd39028
 size 130678784

precompiled/qualcomm-sa8775p/tool-versions.yaml CHANGED Viewed

@@ -1,3 +1,3 @@
 tool_versions:
   qnn_context_binary:
-    qairt: 2.39.0.250925215840_163802-auto

 tool_versions:
   qnn_context_binary:
+    qairt: 2.40.0.251030114326_189385-auto

precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9c4e5b2c9d14cbd89276b447dafe01c47c469125b1c106581c4975095069d1c0
+size 225513472

precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:df0e117dd6db6f920bcb491eb4f41da9fc257f70cc887ceffd93c766d5046c42
+size 193634989

precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d77d64e8b1b4f3d7c9646f9bbc9131d594ab70a9b85fab3a5e9b6779e5e5023c
+size 145821696

precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f7d1cffbee2dcf771de5ca5d6178a2a3c802f68cfbf0b667a036fb70a4c5eefe
+size 110006845

precompiled/qualcomm-snapdragon-7gen4/tool-versions.yaml ADDED Viewed

	@@ -0,0 +1,4 @@

+tool_versions:
+  precompiled_qnn_onnx:
+    qairt: 2.37.1.250807093845_124904
+    onnx_runtime: 1.23.0

precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:618e4089eeeddbd02b095ef303555f31300c55d76f0bf51bfdb2fccfed310c7c
 size 225320960

 version https://git-lfs.github.com/spec/v1
+oid sha256:25fe4234de1f21dc4f86e5d9e892eaf408f14353c0c1d9d26f46c7c87d2d699c
 size 225320960

precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7273e24b09e3c471c1e21788ccc1b38795dc37967aa864e94a44c47557b55c0f
-size 193571490

 version https://git-lfs.github.com/spec/v1
+oid sha256:723971bb5edadea6091da623908770b43e1c48af057ec472bc53a43bd79bf92a
+size 193571492

precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aa14e9aeeb5095d5afbb2de195405b4fa188c19cc14c7858fe09334ee0ad5976
 size 129683456

 version https://git-lfs.github.com/spec/v1
+oid sha256:ad0e827e028aa01bc8efd4b4286ec3763f094e580903521a1f4276a3a3d1280d
 size 129683456

precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7a7e1a44a4f730857843a58a1fe92eff458462d6a54ae2f0c641914a6d630878
-size 93686218

 version https://git-lfs.github.com/spec/v1
+oid sha256:45137e79329b1ab5e466dffcd0947caa148384279b7c7090b854b5fb8213bfa9
+size 93686236

precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:324a685489d7ad27bf787f7ff18f1c983bc5ec8515011e6bce3e58fae5b28f23
 size 225484800

 version https://git-lfs.github.com/spec/v1
+oid sha256:0ae8c828515e159b8c6d31acdd6df36a40a2511a48bb7a743e54e9f8fbb399ec
 size 225484800

precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5bdc6ca5660501ab1196792d16ee2b369544776f3b63d4b6b1352e8dcc726d09
-size 193623884

 version https://git-lfs.github.com/spec/v1
+oid sha256:289fd3ff8ea3bfb4aef8f73f4855bca20a45854af4e30ff3a8de3525381dd0f7
+size 193623880

precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cc5eea5fd9fd79e9b7bdd5160d4864a89a716b6290d2d974a476855bfaba4bae
 size 132378624

 version https://git-lfs.github.com/spec/v1
+oid sha256:68440264fd98da6dff55960c62385e3e7cf8ecf1bcf151eeab3221dba24d7ff4
 size 132378624

precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0fd02d02ed5433f6dafcb471ff0a4383bcfeeb8a5414630af57683c5dd043c1f
-size 94065519

 version https://git-lfs.github.com/spec/v1
+oid sha256:4988fcad96481e5f3baff38017fd53c55dbd8e7e21f1fa0cede0604d81a3213f
+size 94065502

precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:597497f0d94f70473068440d0038a32f639c46ee64a99ae7a6fbf9b7986f9c49
 size 225374208

 version https://git-lfs.github.com/spec/v1
+oid sha256:2ce55c7368a2022bbef3070e71d2fb52a112f46581b371344cba209081ad7b10
 size 225374208

precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:930ac9bd8bc475d55a7096eba85a8101a80d31a24c37fd3de8c747ead3d0018a
-size 193588664

 version https://git-lfs.github.com/spec/v1
+oid sha256:6b0963daf7e027b2d2be8a71d8d7664e7042403f8b193f1310f99dd55934396a
+size 193588769

precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5d4c11759f45fd3cc17df2e06b681c6414448366c887441b361eb18ff36aefd0
 size 130445312

 version https://git-lfs.github.com/spec/v1
+oid sha256:7da541c27e91456e766d6d1caeec55075a9353397d74cd0deaf36ab1db6c2183
 size 130445312

precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:028566a7612a8a7d8194cc8512764e888405e982502dd95f39ce977da7efac32
-size 93711026

 version https://git-lfs.github.com/spec/v1
+oid sha256:13d0dc190113e6efe42b31f7a1032ddcafb8c8961f597453c96bee3f853b79f8
+size 93711016

precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6a9c3e9d8fd2c387822b37a011314abd5737b31052f7d861fd8799a3c0dd30f2
 size 225378304

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c923fdd7d2606ef14af381b451faf9424ab6084840810b4434168a35533fb5f
 size 225378304

precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:85e6bb2b9ba765af4a2fd57d90375d753df6e8e0fd89faafb21ba1eb06c70398
-size 193589978

 version https://git-lfs.github.com/spec/v1
+oid sha256:941fad6b98fbd8c666fb5535e8cdb53128aea792b9dde85856df06dc1ab3cd83
+size 193589938

precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:937ed38bc43b6baa9fc744327dfe088e30fcf9b55c3399e042f89b9b75ada2ab
 size 130580480

 version https://git-lfs.github.com/spec/v1
+oid sha256:e3c2fd6eaaf562d85a24b60017510a58849597abadc416f7684f0ed7692a6522
 size 130580480

precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1e7afc3e2b35515e74b26c94a265a232436bdd5ca8b9896efd7b3091af1484bd
 size 93992314

 version https://git-lfs.github.com/spec/v1
+oid sha256:9490bb9fd417fa6f6009fe2d44462e191c453b4a3e09d3e267e6751f66e55f1b
 size 93992314