qaihm-bot commited on
Commit
5f48a93
·
verified ·
1 Parent(s): b7165e1

See https://github.com/quic/ai-hub-models/releases/v0.47.0 for changelog.

Files changed (1) hide show
  1. README.md +50 -46
README.md CHANGED
@@ -28,13 +28,13 @@ Below are pre-exported model assets ready for deployment.
28
 
29
  | Runtime | Precision | Chipset | SDK Versions | Download |
30
  |---|---|---|---|---|
31
- | ONNX | float | Universal | QAIRT 2.37, ONNX Runtime 1.23.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.46.0/levit-onnx-float.zip)
32
- | ONNX | w8a16 | Universal | QAIRT 2.37, ONNX Runtime 1.23.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.46.0/levit-onnx-w8a16.zip)
33
- | ONNX | w8a16_mixed_int16 | Universal | QAIRT 2.37, ONNX Runtime 1.23.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.46.0/levit-onnx-w8a16_mixed_int16.zip)
34
- | QNN_DLC | float | Universal | QAIRT 2.42 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.46.0/levit-qnn_dlc-float.zip)
35
- | QNN_DLC | w8a16 | Universal | QAIRT 2.42 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.46.0/levit-qnn_dlc-w8a16.zip)
36
- | QNN_DLC | w8a16_mixed_int16 | Universal | QAIRT 2.42 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.46.0/levit-qnn_dlc-w8a16_mixed_int16.zip)
37
- | TFLITE | float | Universal | QAIRT 2.42, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.46.0/levit-tflite-float.zip)
38
 
39
  For more device-specific assets and performance metrics, visit **[LeViT on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/levit)**.
40
 
@@ -64,45 +64,49 @@ See our repository for [LeViT on GitHub](https://github.com/quic/ai-hub-models/b
64
  ## Performance Summary
65
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
66
  |---|---|---|---|---|---|---
67
- | LeViT | ONNX | float | Snapdragon® X Elite | 1.467 ms | 16 - 16 MB | NPU
68
- | LeViT | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 1.094 ms | 0 - 148 MB | NPU
69
- | LeViT | ONNX | float | Qualcomm® QCS8550 (Proxy) | 1.51 ms | 0 - 22 MB | NPU
70
- | LeViT | ONNX | float | Qualcomm® QCS9075 | 1.841 ms | 1 - 3 MB | NPU
71
- | LeViT | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.899 ms | 0 - 124 MB | NPU
72
- | LeViT | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.818 ms | 0 - 123 MB | NPU
73
- | LeViT | QNN_DLC | float | Snapdragon® X Elite | 1.796 ms | 1 - 1 MB | NPU
74
- | LeViT | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 1.073 ms | 0 - 82 MB | NPU
75
- | LeViT | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 3.802 ms | 1 - 59 MB | NPU
76
- | LeViT | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 1.556 ms | 1 - 2 MB | NPU
77
- | LeViT | QNN_DLC | float | Qualcomm® QCS9075 | 1.872 ms | 3 - 5 MB | NPU
78
- | LeViT | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 2.373 ms | 0 - 81 MB | NPU
79
- | LeViT | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.828 ms | 0 - 58 MB | NPU
80
- | LeViT | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.733 ms | 1 - 62 MB | NPU
81
- | LeViT | QNN_DLC | w8a16 | Snapdragon® X Elite | 1.633 ms | 0 - 0 MB | NPU
82
- | LeViT | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 1.005 ms | 0 - 61 MB | NPU
83
- | LeViT | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 2.968 ms | 0 - 41 MB | NPU
84
- | LeViT | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 1.426 ms | 0 - 14 MB | NPU
85
- | LeViT | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 1.725 ms | 0 - 2 MB | NPU
86
- | LeViT | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 5.551 ms | 0 - 165 MB | NPU
87
- | LeViT | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.738 ms | 0 - 42 MB | NPU
88
- | LeViT | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 1.461 ms | 0 - 39 MB | NPU
89
- | LeViT | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.634 ms | 0 - 42 MB | NPU
90
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® X Elite | 1.692 ms | 0 - 0 MB | NPU
91
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 8 Gen 3 Mobile | 1.014 ms | 0 - 62 MB | NPU
92
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCS8275 (Proxy) | 3.041 ms | 0 - 40 MB | NPU
93
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCS8550 (Proxy) | 1.465 ms | 0 - 2 MB | NPU
94
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCS9075 | 1.726 ms | 0 - 2 MB | NPU
95
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCM6690 | 5.877 ms | 0 - 166 MB | NPU
96
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.753 ms | 0 - 41 MB | NPU
97
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 7 Gen 4 Mobile | 1.496 ms | 0 - 39 MB | NPU
98
- | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.647 ms | 0 - 41 MB | NPU
99
- | LeViT | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 1.035 ms | 0 - 97 MB | NPU
100
- | LeViT | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 4.035 ms | 0 - 70 MB | NPU
101
- | LeViT | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 1.514 ms | 0 - 3 MB | NPU
102
- | LeViT | TFLITE | float | Qualcomm® QCS9075 | 1.883 ms | 0 - 19 MB | NPU
103
- | LeViT | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 2.348 ms | 0 - 93 MB | NPU
104
- | LeViT | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.787 ms | 0 - 77 MB | NPU
105
- | LeViT | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.679 ms | 0 - 78 MB | NPU
 
 
 
 
106
 
107
  ## License
108
  * The license for the original implementation of LeViT can be found
 
28
 
29
  | Runtime | Precision | Chipset | SDK Versions | Download |
30
  |---|---|---|---|---|
31
+ | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.47.0/levit-onnx-float.zip)
32
+ | ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.47.0/levit-onnx-w8a16.zip)
33
+ | ONNX | w8a16_mixed_int16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.47.0/levit-onnx-w8a16_mixed_int16.zip)
34
+ | QNN_DLC | float | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.47.0/levit-qnn_dlc-float.zip)
35
+ | QNN_DLC | w8a16 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.47.0/levit-qnn_dlc-w8a16.zip)
36
+ | QNN_DLC | w8a16_mixed_int16 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.47.0/levit-qnn_dlc-w8a16_mixed_int16.zip)
37
+ | TFLITE | float | Universal | QAIRT 2.43, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/levit/releases/v0.47.0/levit-tflite-float.zip)
38
 
39
  For more device-specific assets and performance metrics, visit **[LeViT on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/levit)**.
40
 
 
64
  ## Performance Summary
65
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
66
  |---|---|---|---|---|---|---
67
+ | LeViT | ONNX | float | Snapdragon® X Elite | 1.462 ms | 16 - 16 MB | NPU
68
+ | LeViT | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 0.877 ms | 0 - 101 MB | NPU
69
+ | LeViT | ONNX | float | Qualcomm® QCS8550 (Proxy) | 1.252 ms | 0 - 22 MB | NPU
70
+ | LeViT | ONNX | float | Qualcomm® QCS9075 | 1.65 ms | 1 - 3 MB | NPU
71
+ | LeViT | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.704 ms | 0 - 67 MB | NPU
72
+ | LeViT | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.653 ms | 1 - 76 MB | NPU
73
+ | LeViT | ONNX | float | Snapdragon® X2 Elite | 0.69 ms | 16 - 16 MB | NPU
74
+ | LeViT | QNN_DLC | float | Snapdragon® X Elite | 1.824 ms | 1 - 1 MB | NPU
75
+ | LeViT | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 1.084 ms | 0 - 83 MB | NPU
76
+ | LeViT | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 3.833 ms | 1 - 58 MB | NPU
77
+ | LeViT | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 1.595 ms | 1 - 2 MB | NPU
78
+ | LeViT | QNN_DLC | float | Qualcomm® QCS9075 | 1.882 ms | 3 - 5 MB | NPU
79
+ | LeViT | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 2.374 ms | 0 - 80 MB | NPU
80
+ | LeViT | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.847 ms | 0 - 62 MB | NPU
81
+ | LeViT | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.75 ms | 1 - 62 MB | NPU
82
+ | LeViT | QNN_DLC | float | Snapdragon® X2 Elite | 1.009 ms | 1 - 1 MB | NPU
83
+ | LeViT | QNN_DLC | w8a16 | Snapdragon® X Elite | 1.673 ms | 0 - 0 MB | NPU
84
+ | LeViT | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 1.016 ms | 0 - 61 MB | NPU
85
+ | LeViT | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 2.98 ms | 0 - 40 MB | NPU
86
+ | LeViT | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 1.452 ms | 0 - 26 MB | NPU
87
+ | LeViT | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 1.727 ms | 0 - 2 MB | NPU
88
+ | LeViT | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 5.836 ms | 0 - 164 MB | NPU
89
+ | LeViT | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.747 ms | 0 - 41 MB | NPU
90
+ | LeViT | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 1.47 ms | 0 - 39 MB | NPU
91
+ | LeViT | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.639 ms | 0 - 42 MB | NPU
92
+ | LeViT | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 0.865 ms | 0 - 0 MB | NPU
93
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® X Elite | 1.7 ms | 0 - 0 MB | NPU
94
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 8 Gen 3 Mobile | 1.046 ms | 0 - 63 MB | NPU
95
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCS8275 (Proxy) | 3.027 ms | 0 - 40 MB | NPU
96
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCS8550 (Proxy) | 1.485 ms | 0 - 2 MB | NPU
97
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCS9075 | 1.738 ms | 0 - 2 MB | NPU
98
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Qualcomm® QCM6690 | 6.101 ms | 0 - 166 MB | NPU
99
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.753 ms | 0 - 42 MB | NPU
100
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 7 Gen 4 Mobile | 1.507 ms | 0 - 39 MB | NPU
101
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.649 ms | 0 - 41 MB | NPU
102
+ | LeViT | QNN_DLC | w8a16_mixed_int16 | Snapdragon® X2 Elite | 0.903 ms | 0 - 0 MB | NPU
103
+ | LeViT | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 1.058 ms | 0 - 96 MB | NPU
104
+ | LeViT | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 4.059 ms | 0 - 66 MB | NPU
105
+ | LeViT | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 1.556 ms | 0 - 2 MB | NPU
106
+ | LeViT | TFLITE | float | Qualcomm® QCS9075 | 1.869 ms | 0 - 19 MB | NPU
107
+ | LeViT | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 2.356 ms | 0 - 83 MB | NPU
108
+ | LeViT | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.807 ms | 0 - 66 MB | NPU
109
+ | LeViT | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.679 ms | 0 - 72 MB | NPU
110
 
111
  ## License
112
  * The license for the original implementation of LeViT can be found