qaihm-bot commited on
Commit
21ee65f
·
verified ·
1 Parent(s): e2f5068

See https://github.com/quic/ai-hub-models/releases/v0.47.0 for changelog.

Files changed (1) hide show
  1. README.md +28 -25
README.md CHANGED
@@ -30,10 +30,11 @@ Below are pre-exported model assets ready for deployment.
30
 
31
  | Runtime | Precision | Chipset | SDK Versions | Download |
32
  |---|---|---|---|---|
33
- | ONNX | float | Universal | QAIRT 2.37, ONNX Runtime 1.23.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.46.0/efficientvit_b2_cls-onnx-float.zip)
34
- | ONNX | w8a16 | Universal | QAIRT 2.37, ONNX Runtime 1.23.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.46.0/efficientvit_b2_cls-onnx-w8a16.zip)
35
- | QNN_DLC | float | Universal | QAIRT 2.42 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.46.0/efficientvit_b2_cls-qnn_dlc-float.zip)
36
- | TFLITE | float | Universal | QAIRT 2.42, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.46.0/efficientvit_b2_cls-tflite-float.zip)
 
37
 
38
  For more device-specific assets and performance metrics, visit **[EfficientViT-b2-cls on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/efficientvit_b2_cls)**.
39
 
@@ -62,27 +63,29 @@ See our repository for [EfficientViT-b2-cls on GitHub](https://github.com/quic/a
62
  ## Performance Summary
63
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
64
  |---|---|---|---|---|---|---
65
- | EfficientViT-b2-cls | ONNX | float | Snapdragon® X Elite | 5.878 ms | 49 - 49 MB | NPU
66
- | EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 3.748 ms | 0 - 222 MB | NPU
67
- | EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS8550 (Proxy) | 5.425 ms | 0 - 58 MB | NPU
68
- | EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS9075 | 5.902 ms | 1 - 4 MB | NPU
69
- | EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.858 ms | 0 - 151 MB | NPU
70
- | EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.466 ms | 0 - 151 MB | NPU
71
- | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® X Elite | 6.192 ms | 1 - 1 MB | NPU
72
- | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 3.789 ms | 0 - 164 MB | NPU
73
- | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 12.91 ms | 1 - 90 MB | NPU
74
- | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 5.425 ms | 1 - 166 MB | NPU
75
- | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS9075 | 6.198 ms | 1 - 3 MB | NPU
76
- | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 7.172 ms | 0 - 165 MB | NPU
77
- | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.797 ms | 1 - 92 MB | NPU
78
- | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.335 ms | 1 - 95 MB | NPU
79
- | EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 3.795 ms | 0 - 224 MB | NPU
80
- | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 12.956 ms | 0 - 148 MB | NPU
81
- | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 5.468 ms | 0 - 3 MB | NPU
82
- | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS9075 | 6.169 ms | 0 - 52 MB | NPU
83
- | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 7.164 ms | 0 - 226 MB | NPU
84
- | EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.803 ms | 0 - 140 MB | NPU
85
- | EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.331 ms | 0 - 142 MB | NPU
 
 
86
 
87
  ## License
88
  * The license for the original implementation of EfficientViT-b2-cls can be found
 
30
 
31
  | Runtime | Precision | Chipset | SDK Versions | Download |
32
  |---|---|---|---|---|
33
+ | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-onnx-float.zip)
34
+ | ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-onnx-w8a16.zip)
35
+ | ONNX | w8a16_mixed_fp16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-onnx-w8a16_mixed_fp16.zip)
36
+ | QNN_DLC | float | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-qnn_dlc-float.zip)
37
+ | TFLITE | float | Universal | QAIRT 2.43, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientvit_b2_cls/releases/v0.47.0/efficientvit_b2_cls-tflite-float.zip)
38
 
39
  For more device-specific assets and performance metrics, visit **[EfficientViT-b2-cls on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/efficientvit_b2_cls)**.
40
 
 
63
  ## Performance Summary
64
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
65
  |---|---|---|---|---|---|---
66
+ | EfficientViT-b2-cls | ONNX | float | Snapdragon® X Elite | 5.903 ms | 49 - 49 MB | NPU
67
+ | EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 3.625 ms | 0 - 181 MB | NPU
68
+ | EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS8550 (Proxy) | 5.163 ms | 0 - 58 MB | NPU
69
+ | EfficientViT-b2-cls | ONNX | float | Qualcomm® QCS9075 | 5.828 ms | 1 - 4 MB | NPU
70
+ | EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.693 ms | 0 - 89 MB | NPU
71
+ | EfficientViT-b2-cls | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.272 ms | 0 - 115 MB | NPU
72
+ | EfficientViT-b2-cls | ONNX | float | Snapdragon® X2 Elite | 2.54 ms | 49 - 49 MB | NPU
73
+ | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® X Elite | 5.981 ms | 1 - 1 MB | NPU
74
+ | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 3.777 ms | 0 - 164 MB | NPU
75
+ | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 13.008 ms | 1 - 90 MB | NPU
76
+ | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 5.352 ms | 1 - 215 MB | NPU
77
+ | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS9075 | 6.201 ms | 3 - 5 MB | NPU
78
+ | EfficientViT-b2-cls | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 7.193 ms | 0 - 164 MB | NPU
79
+ | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.79 ms | 0 - 91 MB | NPU
80
+ | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.334 ms | 1 - 95 MB | NPU
81
+ | EfficientViT-b2-cls | QNN_DLC | float | Snapdragon® X2 Elite | 2.961 ms | 1 - 1 MB | NPU
82
+ | EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 3.794 ms | 0 - 217 MB | NPU
83
+ | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 13.061 ms | 0 - 148 MB | NPU
84
+ | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 5.352 ms | 0 - 3 MB | NPU
85
+ | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS9075 | 6.232 ms | 0 - 52 MB | NPU
86
+ | EfficientViT-b2-cls | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 7.167 ms | 0 - 223 MB | NPU
87
+ | EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 2.781 ms | 0 - 153 MB | NPU
88
+ | EfficientViT-b2-cls | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 2.337 ms | 0 - 155 MB | NPU
89
 
90
  ## License
91
  * The license for the original implementation of EfficientViT-b2-cls can be found