qaihm-bot commited on
Commit
bc8b065
·
verified ·
1 Parent(s): 1dd38b8

See https://github.com/quic/ai-hub-models/releases/v0.42.0 for changelog.

Files changed (46) hide show
  1. README.md +41 -37
  2. precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +1 -1
  3. precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +1 -1
  4. precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  5. precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  6. precompiled/qualcomm-qcs8275-proxy/tool-versions.yaml +1 -1
  7. precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  8. precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
  9. precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  10. precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +1 -1
  11. precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  12. precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  13. precompiled/qualcomm-qcs9075-proxy/tool-versions.yaml +1 -1
  14. precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  15. precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  16. precompiled/qualcomm-sa7255p/tool-versions.yaml +1 -1
  17. precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  18. precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  19. precompiled/qualcomm-sa8255p-proxy/tool-versions.yaml +1 -1
  20. precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  21. precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  22. precompiled/qualcomm-sa8650p-proxy/tool-versions.yaml +1 -1
  23. precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  24. precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  25. precompiled/qualcomm-sa8775p/tool-versions.yaml +1 -1
  26. precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +3 -0
  27. precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +3 -0
  28. precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +3 -0
  29. precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +3 -0
  30. precompiled/qualcomm-snapdragon-7gen4/tool-versions.yaml +4 -0
  31. precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  32. precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
  33. precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  34. precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +2 -2
  35. precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  36. precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
  37. precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  38. precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +2 -2
  39. precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  40. precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
  41. precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  42. precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +2 -2
  43. precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin +1 -1
  44. precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip +2 -2
  45. precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin +1 -1
  46. precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip +1 -1
README.md CHANGED
@@ -35,40 +35,44 @@ More details on model performance across various devices, can be found
35
 
36
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
37
  |---|---|---|---|---|---|---|---|---|
38
- | WhisperSmallEncoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 466.214 ms | 1 - 10 MB | NPU | Use Export Script |
39
- | WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 318.755 ms | 0 - 7 MB | NPU | Use Export Script |
40
- | WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 62.444 ms | 0 - 113 MB | NPU | Use Export Script |
41
- | WhisperSmallEncoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 269.19 ms | 0 - 10 MB | NPU | Use Export Script |
42
- | WhisperSmallEncoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 612.628 ms | 52 - 63 MB | NPU | Use Export Script |
43
- | WhisperSmallEncoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 466.214 ms | 1 - 10 MB | NPU | Use Export Script |
44
- | WhisperSmallEncoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 323.676 ms | 1 - 3 MB | NPU | Use Export Script |
45
- | WhisperSmallEncoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 334.22 ms | 0 - 3 MB | NPU | Use Export Script |
46
- | WhisperSmallEncoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 269.19 ms | 0 - 10 MB | NPU | Use Export Script |
47
- | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 251.069 ms | 1 - 19 MB | NPU | Use Export Script |
48
- | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 45.312 ms | 56 - 75 MB | NPU | Use Export Script |
49
- | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 205.021 ms | 1 - 17 MB | NPU | Use Export Script |
50
- | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 35.236 ms | 63 - 78 MB | NPU | Use Export Script |
51
- | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 178.557 ms | 0 - 12 MB | NPU | Use Export Script |
52
- | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 30.325 ms | 61 - 72 MB | NPU | Use Export Script |
53
- | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 243.613 ms | 0 - 0 MB | NPU | Use Export Script |
54
- | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 61.693 ms | 107 - 107 MB | NPU | Use Export Script |
55
- | WhisperSmallDecoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 13.38 ms | 26 - 35 MB | NPU | Use Export Script |
56
- | WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 8.402 ms | 30 - 33 MB | NPU | Use Export Script |
57
- | WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 8.647 ms | 0 - 192 MB | NPU | Use Export Script |
58
- | WhisperSmallDecoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 9.436 ms | 29 - 39 MB | NPU | Use Export Script |
59
- | WhisperSmallDecoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 33.594 ms | 37 - 49 MB | NPU | Use Export Script |
60
- | WhisperSmallDecoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 13.38 ms | 26 - 35 MB | NPU | Use Export Script |
61
- | WhisperSmallDecoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 8.503 ms | 28 - 31 MB | NPU | Use Export Script |
62
- | WhisperSmallDecoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 8.224 ms | 28 - 30 MB | NPU | Use Export Script |
63
- | WhisperSmallDecoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 9.436 ms | 29 - 39 MB | NPU | Use Export Script |
64
- | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 6.455 ms | 30 - 48 MB | NPU | Use Export Script |
65
- | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 6.715 ms | 38 - 56 MB | NPU | Use Export Script |
66
- | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 4.756 ms | 18 - 35 MB | NPU | Use Export Script |
67
- | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 5.136 ms | 27 - 42 MB | NPU | Use Export Script |
68
- | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 4.02 ms | 28 - 40 MB | NPU | Use Export Script |
69
- | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 4.357 ms | 38 - 48 MB | NPU | Use Export Script |
70
- | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 7.731 ms | 30 - 30 MB | NPU | Use Export Script |
71
- | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 7.792 ms | 185 - 185 MB | NPU | Use Export Script |
 
 
 
 
72
 
73
 
74
 
@@ -83,9 +87,9 @@ pip install "qai-hub-models[whisper-small-quantized]"
83
  ```
84
 
85
 
86
- ## Configure Qualcomm® AI Hub to run this model on a cloud-hosted device
87
 
88
- Sign-in to [Qualcomm® AI Hub](https://app.aihub.qualcomm.com/) with your
89
  Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
90
 
91
  With this API token, you can configure your client to run models on the cloud
@@ -93,7 +97,7 @@ hosted devices.
93
  ```bash
94
  qai-hub configure --api_token API_TOKEN
95
  ```
96
- Navigate to [docs](https://app.aihub.qualcomm.com/docs/) for more information.
97
 
98
 
99
 
 
35
 
36
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
37
  |---|---|---|---|---|---|---|---|---|
38
+ | WhisperSmallEncoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 450.472 ms | 1 - 10 MB | NPU | Use Export Script |
39
+ | WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 321.308 ms | 1 - 3 MB | NPU | Use Export Script |
40
+ | WhisperSmallEncoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 63.07 ms | 63 - 65 MB | NPU | Use Export Script |
41
+ | WhisperSmallEncoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 925.191 ms | 1 - 10 MB | NPU | Use Export Script |
42
+ | WhisperSmallEncoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 604.083 ms | 23 - 32 MB | NPU | Use Export Script |
43
+ | WhisperSmallEncoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 450.472 ms | 1 - 10 MB | NPU | Use Export Script |
44
+ | WhisperSmallEncoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 323.282 ms | 1 - 3 MB | NPU | Use Export Script |
45
+ | WhisperSmallEncoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 324.535 ms | 1 - 4 MB | NPU | Use Export Script |
46
+ | WhisperSmallEncoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 925.191 ms | 1 - 10 MB | NPU | Use Export Script |
47
+ | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 247.351 ms | 0 - 19 MB | NPU | Use Export Script |
48
+ | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 45.579 ms | 63 - 82 MB | NPU | Use Export Script |
49
+ | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 204.196 ms | 1 - 17 MB | NPU | Use Export Script |
50
+ | WhisperSmallEncoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 35.311 ms | 63 - 78 MB | NPU | Use Export Script |
51
+ | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | QNN_CONTEXT_BINARY | 529.171 ms | 0 - 15 MB | NPU | Use Export Script |
52
+ | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | PRECOMPILED_QNN_ONNX | 186.282 ms | 53 - 63 MB | NPU | Use Export Script |
53
+ | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 191.947 ms | 1 - 12 MB | NPU | Use Export Script |
54
+ | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 28.294 ms | 62 - 73 MB | NPU | Use Export Script |
55
+ | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 244.642 ms | 0 - 0 MB | NPU | Use Export Script |
56
+ | WhisperSmallEncoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 61.723 ms | 108 - 108 MB | NPU | Use Export Script |
57
+ | WhisperSmallDecoderQuantizable | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_CONTEXT_BINARY | 13.534 ms | 26 - 36 MB | NPU | Use Export Script |
58
+ | WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_CONTEXT_BINARY | 8.366 ms | 30 - 33 MB | NPU | Use Export Script |
59
+ | WhisperSmallDecoderQuantizable | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | PRECOMPILED_QNN_ONNX | 8.784 ms | 28 - 31 MB | NPU | Use Export Script |
60
+ | WhisperSmallDecoderQuantizable | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_CONTEXT_BINARY | 9.462 ms | 26 - 36 MB | NPU | Use Export Script |
61
+ | WhisperSmallDecoderQuantizable | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | PRECOMPILED_QNN_ONNX | 33.78 ms | 32 - 42 MB | NPU | Use Export Script |
62
+ | WhisperSmallDecoderQuantizable | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_CONTEXT_BINARY | 13.534 ms | 26 - 36 MB | NPU | Use Export Script |
63
+ | WhisperSmallDecoderQuantizable | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_CONTEXT_BINARY | 8.387 ms | 24 - 26 MB | NPU | Use Export Script |
64
+ | WhisperSmallDecoderQuantizable | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_CONTEXT_BINARY | 8.439 ms | 30 - 33 MB | NPU | Use Export Script |
65
+ | WhisperSmallDecoderQuantizable | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_CONTEXT_BINARY | 9.462 ms | 26 - 36 MB | NPU | Use Export Script |
66
+ | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 6.423 ms | 26 - 45 MB | NPU | Use Export Script |
67
+ | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | PRECOMPILED_QNN_ONNX | 6.71 ms | 33 - 52 MB | NPU | Use Export Script |
68
+ | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_CONTEXT_BINARY | 4.805 ms | 17 - 31 MB | NPU | Use Export Script |
69
+ | WhisperSmallDecoderQuantizable | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | PRECOMPILED_QNN_ONNX | 5.158 ms | 25 - 36 MB | NPU | Use Export Script |
70
+ | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | QNN_CONTEXT_BINARY | 11.23 ms | 25 - 40 MB | NPU | Use Export Script |
71
+ | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | PRECOMPILED_QNN_ONNX | 12.017 ms | 38 - 57 MB | NPU | Use Export Script |
72
+ | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_CONTEXT_BINARY | 4.0 ms | 30 - 42 MB | NPU | Use Export Script |
73
+ | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | PRECOMPILED_QNN_ONNX | 4.347 ms | 36 - 46 MB | NPU | Use Export Script |
74
+ | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 7.89 ms | 30 - 30 MB | NPU | Use Export Script |
75
+ | WhisperSmallDecoderQuantizable | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | PRECOMPILED_QNN_ONNX | 7.821 ms | 186 - 186 MB | NPU | Use Export Script |
76
 
77
 
78
 
 
87
  ```
88
 
89
 
90
+ ## Configure Qualcomm® AI Hub Workbench to run this model on a cloud-hosted device
91
 
92
+ Sign-in to [Qualcomm® AI Hub Workbench](https://workbench.aihub.qualcomm.com/) with your
93
  Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
94
 
95
  With this API token, you can configure your client to run models on the cloud
 
97
  ```bash
98
  qai-hub configure --api_token API_TOKEN
99
  ```
100
+ Navigate to [docs](https://workbench.aihub.qualcomm.com/docs/) for more information.
101
 
102
 
103
 
precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:721564bbd831d19ff49da30758f26e915ce7c3c0476fd54e6499978dd418eb2c
3
  size 193518243
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a6def630635b86d43a2e476067fcdde1bf044fedbe5645b13450adab60e99252
3
  size 193518243
precompiled/qualcomm-qcs6490-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3124dd79bfc2734e25871e7c5cb65dda50e4c45b5f24a582dea0de723d22c295
3
  size 102104982
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d7b367ea6e78d5abc4bfa86bf8bd6f2cceb0f5e8383d72a15d675f27d76ca3f0
3
  size 102104982
precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:087ff30ea3d4ae5fc8972c73d64c1c6d92841fd40d4090955a57e49b2a3a31e5
3
  size 225382400
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:24208db19866047ea5ad231d2836f469283e3f637e8b024ff42be251e6c0bdcf
3
  size 225382400
precompiled/qualcomm-qcs8275-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8b2983626c780cef63de140328a36a7e09d647707c789c90c88fc6ee01b9ac63
3
  size 130682880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4794729267b57f8d32ce5ba27a3419f395db5c1f02c8d4dbdf3b6f7971004e8d
3
  size 130682880
precompiled/qualcomm-qcs8275-proxy/tool-versions.yaml CHANGED
@@ -1,3 +1,3 @@
1
  tool_versions:
2
  qnn_context_binary:
3
- qairt: 2.39.0.250925215840_163802-auto
 
1
  tool_versions:
2
  qnn_context_binary:
3
+ qairt: 2.40.0.251030114326_189385-auto
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a8005aaf78182cc56b538a21e08f610c8ed2629e1817075ff228340c9fecef90
3
  size 225378304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10394e71510154ed77feae4dc48450d289685a1eee515fbd5f5655541c908ef8
3
  size 225378304
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0c92fbe0d7ada5b3fc7498821d1f12803a2e043b67a01e4e52a4614a8e11b2f5
3
- size 193590896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa72357c6814e51f5a9e1ba9f05792c9802706d211d3ec526e68bf54279a00f3
3
+ size 193590918
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4a6531b2e2abcc30e755408d424e4326dacad604d34a8b9dd492db844e3f1a14
3
  size 130580480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79df9e8e3a94e3923bf5ae03a60bf4622e28bdf60153246a9336867655b9664c
3
  size 130580480
precompiled/qualcomm-qcs8550-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:701e9e0c513e880ab484bea2fb8d8e513984c8421ed8e5cf7dbe82de1eaf6e13
3
  size 93996909
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01ea1bbf828aa965b3ee8848ac93c53ca78f2126dd9b6f613fec8c3ec55f9ccb
3
  size 93996909
precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:acd7831a94069a339b2fc1d2571b7a8a42500e0d4a0e2634760edc705b9d7d91
3
  size 225386496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1843626282087293ceb96dbfe14e5c80689f33b4658c6887092d35b881b0be00
3
  size 225386496
precompiled/qualcomm-qcs9075-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dee4f3da58574993135c8bbf72e2f35ac3c48b5fdc805cc6f047c2f44c0c8191
3
  size 130678784
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef0733e4636f075193280ac4d3c1beabf6b5b6b4603369ce89b7cf726fd39028
3
  size 130678784
precompiled/qualcomm-qcs9075-proxy/tool-versions.yaml CHANGED
@@ -1,3 +1,3 @@
1
  tool_versions:
2
  qnn_context_binary:
3
- qairt: 2.39.0.250925215840_163802-auto
 
1
  tool_versions:
2
  qnn_context_binary:
3
+ qairt: 2.40.0.251030114326_189385-auto
precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:087ff30ea3d4ae5fc8972c73d64c1c6d92841fd40d4090955a57e49b2a3a31e5
3
  size 225382400
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:24208db19866047ea5ad231d2836f469283e3f637e8b024ff42be251e6c0bdcf
3
  size 225382400
precompiled/qualcomm-sa7255p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8b2983626c780cef63de140328a36a7e09d647707c789c90c88fc6ee01b9ac63
3
  size 130682880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4794729267b57f8d32ce5ba27a3419f395db5c1f02c8d4dbdf3b6f7971004e8d
3
  size 130682880
precompiled/qualcomm-sa7255p/tool-versions.yaml CHANGED
@@ -1,3 +1,3 @@
1
  tool_versions:
2
  qnn_context_binary:
3
- qairt: 2.39.0.250925215840_163802-auto
 
1
  tool_versions:
2
  qnn_context_binary:
3
+ qairt: 2.40.0.251030114326_189385-auto
precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a8005aaf78182cc56b538a21e08f610c8ed2629e1817075ff228340c9fecef90
3
  size 225378304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10394e71510154ed77feae4dc48450d289685a1eee515fbd5f5655541c908ef8
3
  size 225378304
precompiled/qualcomm-sa8255p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4a6531b2e2abcc30e755408d424e4326dacad604d34a8b9dd492db844e3f1a14
3
  size 130580480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79df9e8e3a94e3923bf5ae03a60bf4622e28bdf60153246a9336867655b9664c
3
  size 130580480
precompiled/qualcomm-sa8255p-proxy/tool-versions.yaml CHANGED
@@ -1,3 +1,3 @@
1
  tool_versions:
2
  qnn_context_binary:
3
- qairt: 2.39.0.250925215840_163802
 
1
  tool_versions:
2
  qnn_context_binary:
3
+ qairt: 2.40.0.251030114326_189385
precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a8005aaf78182cc56b538a21e08f610c8ed2629e1817075ff228340c9fecef90
3
  size 225378304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10394e71510154ed77feae4dc48450d289685a1eee515fbd5f5655541c908ef8
3
  size 225378304
precompiled/qualcomm-sa8650p-proxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4a6531b2e2abcc30e755408d424e4326dacad604d34a8b9dd492db844e3f1a14
3
  size 130580480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79df9e8e3a94e3923bf5ae03a60bf4622e28bdf60153246a9336867655b9664c
3
  size 130580480
precompiled/qualcomm-sa8650p-proxy/tool-versions.yaml CHANGED
@@ -1,3 +1,3 @@
1
  tool_versions:
2
  qnn_context_binary:
3
- qairt: 2.39.0.250925215840_163802
 
1
  tool_versions:
2
  qnn_context_binary:
3
+ qairt: 2.40.0.251030114326_189385
precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:acd7831a94069a339b2fc1d2571b7a8a42500e0d4a0e2634760edc705b9d7d91
3
  size 225386496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1843626282087293ceb96dbfe14e5c80689f33b4658c6887092d35b881b0be00
3
  size 225386496
precompiled/qualcomm-sa8775p/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dee4f3da58574993135c8bbf72e2f35ac3c48b5fdc805cc6f047c2f44c0c8191
3
  size 130678784
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef0733e4636f075193280ac4d3c1beabf6b5b6b4603369ce89b7cf726fd39028
3
  size 130678784
precompiled/qualcomm-sa8775p/tool-versions.yaml CHANGED
@@ -1,3 +1,3 @@
1
  tool_versions:
2
  qnn_context_binary:
3
- qairt: 2.39.0.250925215840_163802-auto
 
1
  tool_versions:
2
  qnn_context_binary:
3
+ qairt: 2.40.0.251030114326_189385-auto
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c4e5b2c9d14cbd89276b447dafe01c47c469125b1c106581c4975095069d1c0
3
+ size 225513472
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df0e117dd6db6f920bcb491eb4f41da9fc257f70cc887ceffd93c766d5046c42
3
+ size 193634989
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d77d64e8b1b4f3d7c9646f9bbc9131d594ab70a9b85fab3a5e9b6779e5e5023c
3
+ size 145821696
precompiled/qualcomm-snapdragon-7gen4/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7d1cffbee2dcf771de5ca5d6178a2a3c802f68cfbf0b667a036fb70a4c5eefe
3
+ size 110006845
precompiled/qualcomm-snapdragon-7gen4/tool-versions.yaml ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ tool_versions:
2
+ precompiled_qnn_onnx:
3
+ qairt: 2.37.1.250807093845_124904
4
+ onnx_runtime: 1.23.0
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:618e4089eeeddbd02b095ef303555f31300c55d76f0bf51bfdb2fccfed310c7c
3
  size 225320960
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:25fe4234de1f21dc4f86e5d9e892eaf408f14353c0c1d9d26f46c7c87d2d699c
3
  size 225320960
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7273e24b09e3c471c1e21788ccc1b38795dc37967aa864e94a44c47557b55c0f
3
- size 193571490
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:723971bb5edadea6091da623908770b43e1c48af057ec472bc53a43bd79bf92a
3
+ size 193571492
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:aa14e9aeeb5095d5afbb2de195405b4fa188c19cc14c7858fe09334ee0ad5976
3
  size 129683456
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ad0e827e028aa01bc8efd4b4286ec3763f094e580903521a1f4276a3a3d1280d
3
  size 129683456
precompiled/qualcomm-snapdragon-8-elite-for-galaxy/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7a7e1a44a4f730857843a58a1fe92eff458462d6a54ae2f0c641914a6d630878
3
- size 93686218
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:45137e79329b1ab5e466dffcd0947caa148384279b7c7090b854b5fb8213bfa9
3
+ size 93686236
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:324a685489d7ad27bf787f7ff18f1c983bc5ec8515011e6bce3e58fae5b28f23
3
  size 225484800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ae8c828515e159b8c6d31acdd6df36a40a2511a48bb7a743e54e9f8fbb399ec
3
  size 225484800
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5bdc6ca5660501ab1196792d16ee2b369544776f3b63d4b6b1352e8dcc726d09
3
- size 193623884
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:289fd3ff8ea3bfb4aef8f73f4855bca20a45854af4e30ff3a8de3525381dd0f7
3
+ size 193623880
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cc5eea5fd9fd79e9b7bdd5160d4864a89a716b6290d2d974a476855bfaba4bae
3
  size 132378624
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68440264fd98da6dff55960c62385e3e7cf8ecf1bcf151eeab3221dba24d7ff4
3
  size 132378624
precompiled/qualcomm-snapdragon-8-elite-gen5/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0fd02d02ed5433f6dafcb471ff0a4383bcfeeb8a5414630af57683c5dd043c1f
3
- size 94065519
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4988fcad96481e5f3baff38017fd53c55dbd8e7e21f1fa0cede0604d81a3213f
3
+ size 94065502
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:597497f0d94f70473068440d0038a32f639c46ee64a99ae7a6fbf9b7986f9c49
3
  size 225374208
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ce55c7368a2022bbef3070e71d2fb52a112f46581b371344cba209081ad7b10
3
  size 225374208
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:930ac9bd8bc475d55a7096eba85a8101a80d31a24c37fd3de8c747ead3d0018a
3
- size 193588664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b0963daf7e027b2d2be8a71d8d7664e7042403f8b193f1310f99dd55934396a
3
+ size 193588769
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5d4c11759f45fd3cc17df2e06b681c6414448366c887441b361eb18ff36aefd0
3
  size 130445312
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7da541c27e91456e766d6d1caeec55075a9353397d74cd0deaf36ab1db6c2183
3
  size 130445312
precompiled/qualcomm-snapdragon-8gen3/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:028566a7612a8a7d8194cc8512764e888405e982502dd95f39ce977da7efac32
3
- size 93711026
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13d0dc190113e6efe42b31f7a1032ddcafb8c8961f597453c96bee3f853b79f8
3
+ size 93711016
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6a9c3e9d8fd2c387822b37a011314abd5737b31052f7d861fd8799a3c0dd30f2
3
  size 225378304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2c923fdd7d2606ef14af381b451faf9424ab6084840810b4434168a35533fb5f
3
  size 225378304
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallDecoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:85e6bb2b9ba765af4a2fd57d90375d753df6e8e0fd89faafb21ba1eb06c70398
3
- size 193589978
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:941fad6b98fbd8c666fb5535e8cdb53128aea792b9dde85856df06dc1ab3cd83
3
+ size 193589938
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:937ed38bc43b6baa9fc744327dfe088e30fcf9b55c3399e042f89b9b75ada2ab
3
  size 130580480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3c2fd6eaaf562d85a24b60017510a58849597abadc416f7684f0ed7692a6522
3
  size 130580480
precompiled/qualcomm-snapdragon-x-elite/Whisper-Small-Quantized_WhisperSmallEncoderQuantizable_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1e7afc3e2b35515e74b26c94a265a232436bdd5ca8b9896efd7b3091af1484bd
3
  size 93992314
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9490bb9fd417fa6f6009fe2d44462e191c453b4a3e09d3e267e6751f66e55f1b
3
  size 93992314