v0.33.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.33.0 for changelog.
- HuggingFace-WavLM-Base-Plus.dlc +2 -2
- HuggingFace-WavLM-Base-Plus.onnx → HuggingFace-WavLM-Base-Plus.onnx.zip +2 -2
- HuggingFace-WavLM-Base-Plus.tflite +2 -2
- README.md +47 -33
- precompiled/qualcomm-snapdragon-x-elite/HuggingFace-WavLM-Base-Plus.bin +3 -0
- precompiled/qualcomm-snapdragon-x-elite/HuggingFace-WavLM-Base-Plus.onnx.zip +3 -0
- precompiled/qualcomm-snapdragon-x-elite/sdk_versions.yml +5 -0
HuggingFace-WavLM-Base-Plus.dlc
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f3348dd9140ea4cfaf316c80e3e4488ee94eeb2807fd26b29c7df0751ec0de7d
|
| 3 |
+
size 390116909
|
HuggingFace-WavLM-Base-Plus.onnx → HuggingFace-WavLM-Base-Plus.onnx.zip
RENAMED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:42ba6dc424ecfb09bdd4ac1c2d8cf0eb2b8611a8fd2c404d6d3d62a8749ad4a9
|
| 3 |
+
size 347488571
|
HuggingFace-WavLM-Base-Plus.tflite
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7a7ad0d4030779843521f056efb207648b812b3dbe25dc20c08f1568e7ad9e58
|
| 3 |
+
size 389713816
|
README.md
CHANGED
|
@@ -35,35 +35,35 @@ More details on model performance across various devices, can be found
|
|
| 35 |
|
| 36 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 37 |
|---|---|---|---|---|---|---|---|---|
|
| 38 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE |
|
| 39 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC |
|
| 40 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE |
|
| 41 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC |
|
| 42 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE |
|
| 43 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC |
|
| 44 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE |
|
| 45 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC |
|
| 46 |
-
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE |
|
| 47 |
-
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC |
|
| 48 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE |
|
| 49 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC |
|
| 50 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE |
|
| 51 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC |
|
| 52 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE |
|
| 53 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC |
|
| 54 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE |
|
| 55 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC |
|
| 56 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE |
|
| 57 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC |
|
| 58 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX |
|
| 59 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE |
|
| 60 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC |
|
| 61 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX |
|
| 62 |
-
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE |
|
| 63 |
-
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC |
|
| 64 |
-
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX |
|
| 65 |
-
| HuggingFace-WavLM-Base-Plus | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC |
|
| 66 |
-
| HuggingFace-WavLM-Base-Plus | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX |
|
| 67 |
|
| 68 |
|
| 69 |
|
|
@@ -127,10 +127,10 @@ Profiling Results
|
|
| 127 |
HuggingFace-WavLM-Base-Plus
|
| 128 |
Device : cs_8275 (ANDROID 14)
|
| 129 |
Runtime : TFLITE
|
| 130 |
-
Estimated inference time (ms) :
|
| 131 |
-
Estimated peak memory usage (MB): [
|
| 132 |
-
Total # Ops :
|
| 133 |
-
Compute Unit(s) : npu (
|
| 134 |
```
|
| 135 |
|
| 136 |
|
|
@@ -212,6 +212,20 @@ AI Hub. [Sign up for access](https://myaccount.qualcomm.com/signup).
|
|
| 212 |
|
| 213 |
|
| 214 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 215 |
|
| 216 |
## Deploying compiled model to Android
|
| 217 |
|
|
|
|
| 35 |
|
| 36 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 37 |
|---|---|---|---|---|---|---|---|---|
|
| 38 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 813.913 ms | 0 - 806 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 39 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 872.273 ms | 1 - 827 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 40 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 552.123 ms | 0 - 1192 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 41 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 578.052 ms | 0 - 925 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 42 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 259.639 ms | 0 - 55 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 43 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 290.655 ms | 3 - 54 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 44 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 306.442 ms | 0 - 805 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 45 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 324.338 ms | 1 - 826 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 46 |
+
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 813.913 ms | 0 - 806 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 47 |
+
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 872.273 ms | 1 - 827 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 48 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 259.884 ms | 0 - 57 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 49 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 292.19 ms | 0 - 51 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 50 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 580.268 ms | 0 - 1100 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 51 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 448.219 ms | 1 - 976 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 52 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 262.478 ms | 0 - 56 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 53 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 291.549 ms | 0 - 50 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 54 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 306.442 ms | 0 - 805 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 55 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 324.338 ms | 1 - 826 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 56 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 260.269 ms | 0 - 54 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 57 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 293.176 ms | 0 - 50 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 58 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 475.477 ms | 1 - 51 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
| 59 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 181.862 ms | 0 - 856 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 60 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 216.247 ms | 1 - 1015 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 61 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 348.261 ms | 1 - 961 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
| 62 |
+
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 171.769 ms | 0 - 800 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
| 63 |
+
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 221.424 ms | 0 - 703 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 64 |
+
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 278.897 ms | 1 - 813 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
| 65 |
+
| HuggingFace-WavLM-Base-Plus | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 313.971 ms | 285 - 285 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
| 66 |
+
| HuggingFace-WavLM-Base-Plus | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 495.288 ms | 205 - 205 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
| 67 |
|
| 68 |
|
| 69 |
|
|
|
|
| 127 |
HuggingFace-WavLM-Base-Plus
|
| 128 |
Device : cs_8275 (ANDROID 14)
|
| 129 |
Runtime : TFLITE
|
| 130 |
+
Estimated inference time (ms) : 813.9
|
| 131 |
+
Estimated peak memory usage (MB): [0, 806]
|
| 132 |
+
Total # Ops : 873
|
| 133 |
+
Compute Unit(s) : npu (873 ops) gpu (0 ops) cpu (0 ops)
|
| 134 |
```
|
| 135 |
|
| 136 |
|
|
|
|
| 212 |
|
| 213 |
|
| 214 |
|
| 215 |
+
## Run demo on a cloud-hosted device
|
| 216 |
+
|
| 217 |
+
You can also run the demo on-device.
|
| 218 |
+
|
| 219 |
+
```bash
|
| 220 |
+
python -m qai_hub_models.models.huggingface_wavlm_base_plus.demo --eval-mode on-device
|
| 221 |
+
```
|
| 222 |
+
|
| 223 |
+
**NOTE**: If you want running in a Jupyter Notebook or Google Colab like
|
| 224 |
+
environment, please add the following to your cell (instead of the above).
|
| 225 |
+
```
|
| 226 |
+
%run -m qai_hub_models.models.huggingface_wavlm_base_plus.demo -- --eval-mode on-device
|
| 227 |
+
```
|
| 228 |
+
|
| 229 |
|
| 230 |
## Deploying compiled model to Android
|
| 231 |
|
precompiled/qualcomm-snapdragon-x-elite/HuggingFace-WavLM-Base-Plus.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:087d6ce3f5a49a4bdad273f4d6ec6b987848802c2bb9a23cf2a6d4c59ee1c47f
|
| 3 |
+
size 213848952
|
precompiled/qualcomm-snapdragon-x-elite/HuggingFace-WavLM-Base-Plus.onnx.zip
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a163065d9ab5a3d372d49f85a4126c367313457874bae16ae38698e093422635
|
| 3 |
+
size 180613788
|
precompiled/qualcomm-snapdragon-x-elite/sdk_versions.yml
ADDED
|
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
sdk_versions:
|
| 2 |
+
qnn_context_binary:
|
| 3 |
+
qairt: 2.34.2.250528164111_119506
|
| 4 |
+
precompiled_qnn_onnx:
|
| 5 |
+
qairt: 2.33.2.250410134701_117956
|