v0.47.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.47.0 for changelog.
README.md
CHANGED
|
@@ -30,10 +30,11 @@ Below are pre-exported model assets ready for deployment.
|
|
| 30 |
|
| 31 |
| Runtime | Precision | Chipset | SDK Versions | Download |
|
| 32 |
|---|---|---|---|---|
|
| 33 |
-
| ONNX | float | Universal | QAIRT 2.
|
| 34 |
-
| ONNX | w8a16 | Universal | QAIRT 2.
|
| 35 |
-
|
|
| 36 |
-
|
|
|
|
|
| 37 |
|
| 38 |
For more device-specific assets and performance metrics, visit **[EfficientViT-b2-cls on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/efficientvit_b2_cls)**.
|
| 39 |
|
|
@@ -62,27 +63,29 @@ See our repository for [EfficientViT-b2-cls on GitHub](https://github.com/quic/a
|
|
| 62 |
## Performance Summary
|
| 63 |
| Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
|
| 64 |
|---|---|---|---|---|---|---
|
| 65 |
-
| EfficientViT-b2-cls | ONNX | float | Snapdragon® X Elite | 5.
|
| 66 |
-
| EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 3.
|
| 67 |
-
| EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS8550 (Proxy) | 5.
|
| 68 |
-
| EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS9075 | 5.
|
| 69 |
-
| EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.
|
| 70 |
-
| EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.
|
| 71 |
-
| EfficientViT-b2-cls |
|
| 72 |
-
| EfficientViT-b2-cls | QNN_DLC | float | Snapdragon®
|
| 73 |
-
| EfficientViT-b2-cls | QNN_DLC | float |
|
| 74 |
-
| EfficientViT-b2-cls | QNN_DLC | float | Qualcomm®
|
| 75 |
-
| EfficientViT-b2-cls | QNN_DLC | float | Qualcomm®
|
| 76 |
-
| EfficientViT-b2-cls | QNN_DLC | float | Qualcomm®
|
| 77 |
-
| EfficientViT-b2-cls | QNN_DLC | float |
|
| 78 |
-
| EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Elite
|
| 79 |
-
| EfficientViT-b2-cls |
|
| 80 |
-
| EfficientViT-b2-cls |
|
| 81 |
-
| EfficientViT-b2-cls | TFLITE | float |
|
| 82 |
-
| EfficientViT-b2-cls | TFLITE | float | Qualcomm®
|
| 83 |
-
| EfficientViT-b2-cls | TFLITE | float | Qualcomm®
|
| 84 |
-
| EfficientViT-b2-cls | TFLITE | float |
|
| 85 |
-
| EfficientViT-b2-cls | TFLITE | float |
|
|
|
|
|
|
|
| 86 |
|
| 87 |
## License
|
| 88 |
* The license for the original implementation of EfficientViT-b2-cls can be found
|
|
|
|
| 30 |
|
| 31 |
| Runtime | Precision | Chipset | SDK Versions | Download |
|
| 32 |
|---|---|---|---|---|
|
| 33 |
+
| ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-onnx-float.zip)
|
| 34 |
+
| ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-onnx-w8a16.zip)
|
| 35 |
+
| ONNX | w8a16_mixed_fp16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-onnx-w8a16_mixed_fp16.zip)
|
| 36 |
+
| QNN_DLC | float | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-qnn_dlc-float.zip)
|
| 37 |
+
| TFLITE | float | Universal | QAIRT 2.43, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-tflite-float.zip)
|
| 38 |
|
| 39 |
For more device-specific assets and performance metrics, visit **[EfficientViT-b2-cls on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/efficientvit_b2_cls)**.
|
| 40 |
|
|
|
|
| 63 |
## Performance Summary
|
| 64 |
| Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
|
| 65 |
|---|---|---|---|---|---|---
|
| 66 |
+
| EfficientViT-b2-cls | ONNX | float | Snapdragon® X Elite | 5.903 ms | 49 - 49 MB | NPU
|
| 67 |
+
| EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 3.625 ms | 0 - 181 MB | NPU
|
| 68 |
+
| EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS8550 (Proxy) | 5.163 ms | 0 - 58 MB | NPU
|
| 69 |
+
| EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS9075 | 5.828 ms | 1 - 4 MB | NPU
|
| 70 |
+
| EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.693 ms | 0 - 89 MB | NPU
|
| 71 |
+
| EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.272 ms | 0 - 115 MB | NPU
|
| 72 |
+
| EfficientViT-b2-cls | ONNX | float | Snapdragon® X2 Elite | 2.54 ms | 49 - 49 MB | NPU
|
| 73 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® X Elite | 5.981 ms | 1 - 1 MB | NPU
|
| 74 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 3.777 ms | 0 - 164 MB | NPU
|
| 75 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 13.008 ms | 1 - 90 MB | NPU
|
| 76 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 5.352 ms | 1 - 215 MB | NPU
|
| 77 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS9075 | 6.201 ms | 3 - 5 MB | NPU
|
| 78 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 7.193 ms | 0 - 164 MB | NPU
|
| 79 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.79 ms | 0 - 91 MB | NPU
|
| 80 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.334 ms | 1 - 95 MB | NPU
|
| 81 |
+
| EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® X2 Elite | 2.961 ms | 1 - 1 MB | NPU
|
| 82 |
+
| EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 3.794 ms | 0 - 217 MB | NPU
|
| 83 |
+
| EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 13.061 ms | 0 - 148 MB | NPU
|
| 84 |
+
| EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 5.352 ms | 0 - 3 MB | NPU
|
| 85 |
+
| EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS9075 | 6.232 ms | 0 - 52 MB | NPU
|
| 86 |
+
| EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 7.167 ms | 0 - 223 MB | NPU
|
| 87 |
+
| EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.781 ms | 0 - 153 MB | NPU
|
| 88 |
+
| EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.337 ms | 0 - 155 MB | NPU
|
| 89 |
|
| 90 |
## License
|
| 91 |
* The license for the original implementation of EfficientViT-b2-cls can be found
|