v0.34.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.34.0 for changelog.
README.md
CHANGED
|
@@ -25,6 +25,7 @@ More details on model performance across various devices, can be found
|
|
| 25 |
[here](https://aihub.qualcomm.com/models/efficientvit_b2_cls).
|
| 26 |
|
| 27 |
|
|
|
|
| 28 |
### Model Details
|
| 29 |
|
| 30 |
- **Model Type:** Model_use_case.image_classification
|
|
@@ -36,21 +37,21 @@ More details on model performance across various devices, can be found
|
|
| 36 |
|
| 37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 38 |
|---|---|---|---|---|---|---|---|---|
|
| 39 |
-
| EfficientViT-b2-cls | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE |
|
| 40 |
| EfficientViT-b2-cls | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 12.47 ms | 1 - 61 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 41 |
-
| EfficientViT-b2-cls | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 5.
|
| 42 |
| EfficientViT-b2-cls | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 6.897 ms | 1 - 69 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 43 |
-
| EfficientViT-b2-cls | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE |
|
| 44 |
| EfficientViT-b2-cls | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 5.151 ms | 0 - 16 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 45 |
-
| EfficientViT-b2-cls | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 6.
|
| 46 |
| EfficientViT-b2-cls | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 6.616 ms | 1 - 61 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 47 |
-
| EfficientViT-b2-cls | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE |
|
| 48 |
| EfficientViT-b2-cls | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 5.255 ms | 0 - 16 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 49 |
| EfficientViT-b2-cls | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 5.409 ms | 0 - 124 MB | NPU | [EfficientViT-b2-cls.onnx](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.onnx) |
|
| 50 |
-
| EfficientViT-b2-cls | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 3.
|
| 51 |
| EfficientViT-b2-cls | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 3.647 ms | 1 - 76 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 52 |
| EfficientViT-b2-cls | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 3.71 ms | 0 - 79 MB | NPU | [EfficientViT-b2-cls.onnx](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.onnx) |
|
| 53 |
-
| EfficientViT-b2-cls | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE |
|
| 54 |
| EfficientViT-b2-cls | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 3.111 ms | 1 - 65 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 55 |
| EfficientViT-b2-cls | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 3.252 ms | 1 - 64 MB | NPU | [EfficientViT-b2-cls.onnx](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.onnx) |
|
| 56 |
| EfficientViT-b2-cls | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 6.803 ms | 300 - 300 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
|
@@ -112,17 +113,7 @@ device. This script does the following:
|
|
| 112 |
```bash
|
| 113 |
python -m qai_hub_models.models.efficientvit_b2_cls.export
|
| 114 |
```
|
| 115 |
-
|
| 116 |
-
Profiling Results
|
| 117 |
-
------------------------------------------------------------
|
| 118 |
-
EfficientViT-b2-cls
|
| 119 |
-
Device : cs_8275 (ANDROID 14)
|
| 120 |
-
Runtime : TFLITE
|
| 121 |
-
Estimated inference time (ms) : 50.9
|
| 122 |
-
Estimated peak memory usage (MB): [0, 112]
|
| 123 |
-
Total # Ops : 379
|
| 124 |
-
Compute Unit(s) : npu (379 ops) gpu (0 ops) cpu (0 ops)
|
| 125 |
-
```
|
| 126 |
|
| 127 |
|
| 128 |
## How does this work?
|
|
|
|
| 25 |
[here](https://aihub.qualcomm.com/models/efficientvit_b2_cls).
|
| 26 |
|
| 27 |
|
| 28 |
+
|
| 29 |
### Model Details
|
| 30 |
|
| 31 |
- **Model Type:** Model_use_case.image_classification
|
|
|
|
| 37 |
|
| 38 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 39 |
|---|---|---|---|---|---|---|---|---|
|
| 40 |
+
| EfficientViT-b2-cls | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 12.076 ms | 0 - 113 MB | NPU | [EfficientViT-b2-cls.tflite](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.tflite) |
|
| 41 |
| EfficientViT-b2-cls | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 12.47 ms | 1 - 61 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 42 |
+
| EfficientViT-b2-cls | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 5.842 ms | 0 - 117 MB | NPU | [EfficientViT-b2-cls.tflite](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.tflite) |
|
| 43 |
| EfficientViT-b2-cls | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 6.897 ms | 1 - 69 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 44 |
+
| EfficientViT-b2-cls | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 4.933 ms | 0 - 341 MB | NPU | [EfficientViT-b2-cls.tflite](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.tflite) |
|
| 45 |
| EfficientViT-b2-cls | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 5.151 ms | 0 - 16 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 46 |
+
| EfficientViT-b2-cls | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 6.046 ms | 0 - 112 MB | NPU | [EfficientViT-b2-cls.tflite](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.tflite) |
|
| 47 |
| EfficientViT-b2-cls | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 6.616 ms | 1 - 61 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 48 |
+
| EfficientViT-b2-cls | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 4.957 ms | 0 - 365 MB | NPU | [EfficientViT-b2-cls.tflite](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.tflite) |
|
| 49 |
| EfficientViT-b2-cls | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 5.255 ms | 0 - 16 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 50 |
| EfficientViT-b2-cls | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 5.409 ms | 0 - 124 MB | NPU | [EfficientViT-b2-cls.onnx](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.onnx) |
|
| 51 |
+
| EfficientViT-b2-cls | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 3.444 ms | 0 - 129 MB | NPU | [EfficientViT-b2-cls.tflite](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.tflite) |
|
| 52 |
| EfficientViT-b2-cls | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 3.647 ms | 1 - 76 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 53 |
| EfficientViT-b2-cls | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 3.71 ms | 0 - 79 MB | NPU | [EfficientViT-b2-cls.onnx](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.onnx) |
|
| 54 |
+
| EfficientViT-b2-cls | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 2.835 ms | 0 - 116 MB | NPU | [EfficientViT-b2-cls.tflite](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.tflite) |
|
| 55 |
| EfficientViT-b2-cls | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 3.111 ms | 1 - 65 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
| 56 |
| EfficientViT-b2-cls | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 3.252 ms | 1 - 64 MB | NPU | [EfficientViT-b2-cls.onnx](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.onnx) |
|
| 57 |
| EfficientViT-b2-cls | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 6.803 ms | 300 - 300 MB | NPU | [EfficientViT-b2-cls.dlc](https://huggingface.co/qualcomm/EfficientViT-b2-cls/blob/main/EfficientViT-b2-cls.dlc) |
|
|
|
|
| 113 |
```bash
|
| 114 |
python -m qai_hub_models.models.efficientvit_b2_cls.export
|
| 115 |
```
|
| 116 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 117 |
|
| 118 |
|
| 119 |
## How does this work?
|
precompiled/qualcomm-snapdragon-x-elite/EfficientViT-b2-cls.onnx.zip
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:45e01226f16de4312da89421c77af355e2124cd2d612f48720af7e32ad367763
|
| 3 |
+
size 45623040
|
precompiled/qualcomm-snapdragon-x-elite/sdk_versions.yml
ADDED
|
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
sdk_versions:
|
| 2 |
+
qnn_context_binary:
|
| 3 |
+
qairt: 2.34.2.250528164111_119506
|
| 4 |
+
precompiled_qnn_onnx:
|
| 5 |
+
qairt: 2.33.2.250410134701_117956
|