v0.42.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.42.0 for changelog.
- ConvNext-Tiny_float.dlc +2 -2
- ConvNext-Tiny_float.onnx.zip +1 -1
- ConvNext-Tiny_w8a16.dlc +2 -2
- ConvNext-Tiny_w8a16.onnx.zip +2 -2
- README.md +57 -54
ConvNext-Tiny_float.dlc
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f1019765d73fa4a58b6749048b01f9e673832790585ec5ef4f689bfdd01c8621
|
| 3 |
+
size 114559492
|
ConvNext-Tiny_float.onnx.zip
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 106407688
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6c59381c6cefccef3bb78f8014a4364c42c978462a136c986ae17c42cbb8f9bd
|
| 3 |
size 106407688
|
ConvNext-Tiny_w8a16.dlc
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:457d26c0b09357502f9b3f921fad8078dccb998dd392a12384f183cbe6bfcaf2
|
| 3 |
+
size 30343892
|
ConvNext-Tiny_w8a16.onnx.zip
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d08dc2dc6975dd1ea3b99b02aa18088129effd8df519e99421a0b7fe408a10fa
|
| 3 |
+
size 95086441
|
README.md
CHANGED
|
@@ -36,56 +36,59 @@ More details on model performance across various devices, can be found
|
|
| 36 |
|
| 37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 38 |
|---|---|---|---|---|---|---|---|---|
|
| 39 |
-
| ConvNext-Tiny | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 14.
|
| 40 |
-
| ConvNext-Tiny | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 15.
|
| 41 |
-
| ConvNext-Tiny | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 7.
|
| 42 |
-
| ConvNext-Tiny | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 9.
|
| 43 |
-
| ConvNext-Tiny | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 2.
|
| 44 |
-
| ConvNext-Tiny | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 3.
|
| 45 |
-
| ConvNext-Tiny | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | ONNX | 2.
|
| 46 |
-
| ConvNext-Tiny | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE |
|
| 47 |
-
| ConvNext-Tiny | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC |
|
| 48 |
-
| ConvNext-Tiny | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 14.
|
| 49 |
-
| ConvNext-Tiny | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 15.
|
| 50 |
-
| ConvNext-Tiny | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 2.
|
| 51 |
-
| ConvNext-Tiny | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 3.
|
| 52 |
-
| ConvNext-Tiny | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 8.
|
| 53 |
-
| ConvNext-Tiny | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 9.
|
| 54 |
-
| ConvNext-Tiny | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 2.938 ms | 0 -
|
| 55 |
-
| ConvNext-Tiny | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 3.
|
| 56 |
-
| ConvNext-Tiny | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE |
|
| 57 |
-
| ConvNext-Tiny | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC |
|
| 58 |
-
| ConvNext-Tiny | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 2.
|
| 59 |
-
| ConvNext-Tiny | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 2.
|
| 60 |
-
| ConvNext-Tiny | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 2.
|
| 61 |
-
| ConvNext-Tiny | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | TFLITE | 1.678 ms | 0 -
|
| 62 |
-
| ConvNext-Tiny | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 1.
|
| 63 |
-
| ConvNext-Tiny | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | ONNX | 1.
|
| 64 |
-
| ConvNext-Tiny | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | TFLITE | 1.
|
| 65 |
-
| ConvNext-Tiny | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_DLC | 1.
|
| 66 |
-
| ConvNext-Tiny | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | ONNX | 1.
|
| 67 |
-
| ConvNext-Tiny | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 3.
|
| 68 |
-
| ConvNext-Tiny | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 2.
|
| 69 |
-
| ConvNext-Tiny | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 6.
|
| 70 |
-
| ConvNext-Tiny | w8a16 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 4.
|
| 71 |
-
| ConvNext-Tiny | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 3.
|
| 72 |
-
| ConvNext-Tiny | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | ONNX | 102.
|
| 73 |
-
| ConvNext-Tiny | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 3.
|
| 74 |
-
| ConvNext-Tiny | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) |
|
| 75 |
-
| ConvNext-Tiny | w8a16 |
|
| 76 |
-
| ConvNext-Tiny | w8a16 |
|
| 77 |
-
| ConvNext-Tiny | w8a16 |
|
| 78 |
-
| ConvNext-Tiny | w8a16 |
|
| 79 |
-
| ConvNext-Tiny | w8a16 |
|
| 80 |
-
| ConvNext-Tiny | w8a16 |
|
| 81 |
-
| ConvNext-Tiny | w8a16 |
|
| 82 |
-
| ConvNext-Tiny | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile |
|
| 83 |
-
| ConvNext-Tiny | w8a16 | Samsung Galaxy
|
| 84 |
-
| ConvNext-Tiny | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile |
|
| 85 |
-
| ConvNext-Tiny | w8a16 |
|
| 86 |
-
| ConvNext-Tiny | w8a16 | Snapdragon
|
| 87 |
-
| ConvNext-Tiny | w8a16 | Snapdragon
|
| 88 |
-
| ConvNext-Tiny | w8a16 | Snapdragon
|
|
|
|
|
|
|
|
|
|
| 89 |
|
| 90 |
|
| 91 |
|
|
@@ -99,9 +102,9 @@ pip install qai-hub-models
|
|
| 99 |
```
|
| 100 |
|
| 101 |
|
| 102 |
-
## Configure Qualcomm® AI Hub to run this model on a cloud-hosted device
|
| 103 |
|
| 104 |
-
Sign-in to [Qualcomm® AI Hub](https://
|
| 105 |
Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
|
| 106 |
|
| 107 |
With this API token, you can configure your client to run models on the cloud
|
|
@@ -109,7 +112,7 @@ hosted devices.
|
|
| 109 |
```bash
|
| 110 |
qai-hub configure --api_token API_TOKEN
|
| 111 |
```
|
| 112 |
-
Navigate to [docs](https://
|
| 113 |
|
| 114 |
|
| 115 |
|
|
@@ -220,7 +223,7 @@ With the output of the model, you can compute like PSNR, relative errors or
|
|
| 220 |
spot check the output with expected output.
|
| 221 |
|
| 222 |
**Note**: This on-device profiling and inference requires access to Qualcomm®
|
| 223 |
-
AI Hub. [Sign up for access](https://myaccount.qualcomm.com/signup).
|
| 224 |
|
| 225 |
|
| 226 |
|
|
|
|
| 36 |
|
| 37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 38 |
|---|---|---|---|---|---|---|---|---|
|
| 39 |
+
| ConvNext-Tiny | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 14.239 ms | 0 - 100 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 40 |
+
| ConvNext-Tiny | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 15.173 ms | 1 - 102 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 41 |
+
| ConvNext-Tiny | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 7.811 ms | 0 - 112 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 42 |
+
| ConvNext-Tiny | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 9.553 ms | 1 - 117 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 43 |
+
| ConvNext-Tiny | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 2.94 ms | 0 - 442 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 44 |
+
| ConvNext-Tiny | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 3.598 ms | 0 - 47 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 45 |
+
| ConvNext-Tiny | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | ONNX | 2.92 ms | 0 - 13 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.onnx.zip) |
|
| 46 |
+
| ConvNext-Tiny | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 18.973 ms | 0 - 100 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 47 |
+
| ConvNext-Tiny | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 4.98 ms | 1 - 103 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 48 |
+
| ConvNext-Tiny | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 14.239 ms | 0 - 100 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 49 |
+
| ConvNext-Tiny | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 15.173 ms | 1 - 102 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 50 |
+
| ConvNext-Tiny | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 2.941 ms | 0 - 441 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 51 |
+
| ConvNext-Tiny | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 3.607 ms | 0 - 16 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 52 |
+
| ConvNext-Tiny | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 8.925 ms | 0 - 102 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 53 |
+
| ConvNext-Tiny | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 9.278 ms | 1 - 107 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 54 |
+
| ConvNext-Tiny | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 2.938 ms | 0 - 436 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 55 |
+
| ConvNext-Tiny | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 3.603 ms | 0 - 49 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 56 |
+
| ConvNext-Tiny | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 18.973 ms | 0 - 100 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 57 |
+
| ConvNext-Tiny | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 4.98 ms | 1 - 103 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 58 |
+
| ConvNext-Tiny | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 2.221 ms | 0 - 107 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 59 |
+
| ConvNext-Tiny | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 2.613 ms | 1 - 109 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 60 |
+
| ConvNext-Tiny | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 2.17 ms | 0 - 109 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.onnx.zip) |
|
| 61 |
+
| ConvNext-Tiny | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | TFLITE | 1.678 ms | 0 - 106 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 62 |
+
| ConvNext-Tiny | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 1.982 ms | 1 - 109 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 63 |
+
| ConvNext-Tiny | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | ONNX | 1.808 ms | 0 - 104 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.onnx.zip) |
|
| 64 |
+
| ConvNext-Tiny | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | TFLITE | 1.395 ms | 0 - 105 MB | NPU | [ConvNext-Tiny.tflite](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.tflite) |
|
| 65 |
+
| ConvNext-Tiny | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_DLC | 1.605 ms | 0 - 108 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 66 |
+
| ConvNext-Tiny | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | ONNX | 1.462 ms | 0 - 103 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.onnx.zip) |
|
| 67 |
+
| ConvNext-Tiny | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 3.856 ms | 485 - 485 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.dlc) |
|
| 68 |
+
| ConvNext-Tiny | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 2.898 ms | 57 - 57 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny.onnx.zip) |
|
| 69 |
+
| ConvNext-Tiny | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 6.782 ms | 0 - 62 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 70 |
+
| ConvNext-Tiny | w8a16 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 4.251 ms | 0 - 70 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 71 |
+
| ConvNext-Tiny | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 3.083 ms | 0 - 13 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 72 |
+
| ConvNext-Tiny | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | ONNX | 102.645 ms | 33 - 81 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 73 |
+
| ConvNext-Tiny | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 3.455 ms | 0 - 62 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 74 |
+
| ConvNext-Tiny | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | QNN_DLC | 13.014 ms | 0 - 88 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 75 |
+
| ConvNext-Tiny | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | ONNX | 236.81 ms | 57 - 73 MB | CPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 76 |
+
| ConvNext-Tiny | w8a16 | RB5 (Proxy) | Qualcomm® QCS8250 (Proxy) | ONNX | 194.224 ms | 51 - 63 MB | CPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 77 |
+
| ConvNext-Tiny | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 6.782 ms | 0 - 62 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 78 |
+
| ConvNext-Tiny | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 3.143 ms | 0 - 16 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 79 |
+
| ConvNext-Tiny | w8a16 | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 4.638 ms | 0 - 68 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 80 |
+
| ConvNext-Tiny | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 3.07 ms | 0 - 16 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 81 |
+
| ConvNext-Tiny | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 3.455 ms | 0 - 62 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 82 |
+
| ConvNext-Tiny | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 2.163 ms | 0 - 77 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 83 |
+
| ConvNext-Tiny | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 109.487 ms | 48 - 80 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 84 |
+
| ConvNext-Tiny | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 1.564 ms | 0 - 67 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 85 |
+
| ConvNext-Tiny | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | ONNX | 77.661 ms | 51 - 76 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 86 |
+
| ConvNext-Tiny | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | QNN_DLC | 3.473 ms | 0 - 77 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 87 |
+
| ConvNext-Tiny | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | ONNX | 231.915 ms | 60 - 77 MB | CPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 88 |
+
| ConvNext-Tiny | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_DLC | 1.282 ms | 0 - 68 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 89 |
+
| ConvNext-Tiny | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | ONNX | 71.555 ms | 47 - 71 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 90 |
+
| ConvNext-Tiny | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 3.369 ms | 165 - 165 MB | NPU | [ConvNext-Tiny.dlc](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.dlc) |
|
| 91 |
+
| ConvNext-Tiny | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 64.124 ms | 62 - 62 MB | NPU | [ConvNext-Tiny.onnx.zip](https://huggingface.co/qualcomm/ConvNext-Tiny/blob/main/ConvNext-Tiny_w8a16.onnx.zip) |
|
| 92 |
|
| 93 |
|
| 94 |
|
|
|
|
| 102 |
```
|
| 103 |
|
| 104 |
|
| 105 |
+
## Configure Qualcomm® AI Hub Workbench to run this model on a cloud-hosted device
|
| 106 |
|
| 107 |
+
Sign-in to [Qualcomm® AI Hub Workbench](https://workbench.aihub.qualcomm.com/) with your
|
| 108 |
Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
|
| 109 |
|
| 110 |
With this API token, you can configure your client to run models on the cloud
|
|
|
|
| 112 |
```bash
|
| 113 |
qai-hub configure --api_token API_TOKEN
|
| 114 |
```
|
| 115 |
+
Navigate to [docs](https://workbench.aihub.qualcomm.com/docs/) for more information.
|
| 116 |
|
| 117 |
|
| 118 |
|
|
|
|
| 223 |
spot check the output with expected output.
|
| 224 |
|
| 225 |
**Note**: This on-device profiling and inference requires access to Qualcomm®
|
| 226 |
+
AI Hub Workbench. [Sign up for access](https://myaccount.qualcomm.com/signup).
|
| 227 |
|
| 228 |
|
| 229 |
|