qaihm-bot commited on
Commit
b6ca9dc
·
verified ·
1 Parent(s): 0362fe4

See https://github.com/qualcomm/ai-hub-models/releases/v0.50.2 for changelog.

Files changed (2) hide show
  1. README.md +100 -100
  2. release_assets.json +1 -1
README.md CHANGED
@@ -29,14 +29,14 @@ Below are pre-exported model assets ready for deployment.
29
 
30
  | Runtime | Precision | Chipset | SDK Versions | Download |
31
  |---|---|---|---|---|
32
- | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-onnx-float.zip)
33
- | ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-onnx-w8a16.zip)
34
- | ONNX | w8a8 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-onnx-w8a8.zip)
35
- | QNN_DLC | float | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-qnn_dlc-float.zip)
36
- | QNN_DLC | w8a16 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-qnn_dlc-w8a16.zip)
37
- | QNN_DLC | w8a8 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-qnn_dlc-w8a8.zip)
38
- | TFLITE | float | Universal | QAIRT 2.43, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-tflite-float.zip)
39
- | TFLITE | w8a8 | Universal | QAIRT 2.43, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-tflite-w8a8.zip)
40
 
41
  For more device-specific assets and performance metrics, visit **[GPUNet on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/gpunet)**.
42
 
@@ -66,98 +66,98 @@ See our repository for [GPUNet on GitHub](https://github.com/qualcomm/ai-hub-mod
66
  ## Performance Summary
67
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
68
  |---|---|---|---|---|---|---
69
- | GPUNet | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.507 ms | 0 - 36 MB | NPU
70
- | GPUNet | ONNX | float | Snapdragon® X2 Elite | 0.477 ms | 24 - 24 MB | NPU
71
- | GPUNet | ONNX | float | Snapdragon® X Elite | 1.138 ms | 24 - 24 MB | NPU
72
- | GPUNet | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 0.78 ms | 0 - 59 MB | NPU
73
- | GPUNet | ONNX | float | Qualcomm® QCS8550 (Proxy) | 1.045 ms | 1 - 3 MB | NPU
74
- | GPUNet | ONNX | float | Qualcomm® QCS9075 | 1.349 ms | 1 - 3 MB | NPU
75
- | GPUNet | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.601 ms | 0 - 30 MB | NPU
76
- | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.351 ms | 0 - 49 MB | NPU
77
- | GPUNet | ONNX | w8a16 | Snapdragon® X2 Elite | 0.388 ms | 12 - 12 MB | NPU
78
- | GPUNet | ONNX | w8a16 | Snapdragon® X Elite | 0.974 ms | 12 - 12 MB | NPU
79
- | GPUNet | ONNX | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.589 ms | 0 - 61 MB | NPU
80
- | GPUNet | ONNX | w8a16 | Qualcomm® QCS6490 | 100.92 ms | 22 - 32 MB | CPU
81
- | GPUNet | ONNX | w8a16 | Qualcomm® QCS8550 (Proxy) | 0.821 ms | 0 - 3 MB | NPU
82
- | GPUNet | ONNX | w8a16 | Qualcomm® QCS9075 | 0.977 ms | 0 - 3 MB | NPU
83
- | GPUNet | ONNX | w8a16 | Qualcomm® QCM6690 | 49.458 ms | 29 - 37 MB | CPU
84
- | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.43 ms | 0 - 42 MB | NPU
85
- | GPUNet | ONNX | w8a16 | Snapdragon® 7 Gen 4 Mobile | 38.092 ms | 29 - 38 MB | CPU
86
- | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.354 ms | 0 - 48 MB | NPU
87
- | GPUNet | ONNX | w8a8 | Snapdragon® X2 Elite | 0.312 ms | 12 - 12 MB | NPU
88
- | GPUNet | ONNX | w8a8 | Snapdragon® X Elite | 0.722 ms | 12 - 12 MB | NPU
89
- | GPUNet | ONNX | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.473 ms | 0 - 61 MB | NPU
90
- | GPUNet | ONNX | w8a8 | Qualcomm® QCS6490 | 16.265 ms | 1 - 14 MB | CPU
91
- | GPUNet | ONNX | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.616 ms | 0 - 2 MB | NPU
92
- | GPUNet | ONNX | w8a8 | Qualcomm® QCS9075 | 0.718 ms | 0 - 3 MB | NPU
93
- | GPUNet | ONNX | w8a8 | Qualcomm® QCM6690 | 10.304 ms | 6 - 15 MB | CPU
94
- | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.393 ms | 0 - 39 MB | NPU
95
- | GPUNet | ONNX | w8a8 | Snapdragon® 7 Gen 4 Mobile | 7.748 ms | 7 - 16 MB | CPU
96
- | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.569 ms | 1 - 37 MB | NPU
97
- | GPUNet | QNN_DLC | float | Snapdragon® X2 Elite | 0.651 ms | 1 - 1 MB | NPU
98
- | GPUNet | QNN_DLC | float | Snapdragon® X Elite | 1.437 ms | 1 - 1 MB | NPU
99
- | GPUNet | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 0.922 ms | 0 - 63 MB | NPU
100
- | GPUNet | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 4.701 ms | 1 - 33 MB | NPU
101
- | GPUNet | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 1.29 ms | 1 - 2 MB | NPU
102
- | GPUNet | QNN_DLC | float | Qualcomm® SA8775P | 1.757 ms | 1 - 36 MB | NPU
103
- | GPUNet | QNN_DLC | float | Qualcomm® QCS9075 | 1.574 ms | 1 - 3 MB | NPU
104
- | GPUNet | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 2.412 ms | 0 - 61 MB | NPU
105
- | GPUNet | QNN_DLC | float | Qualcomm® SA7255P | 4.701 ms | 1 - 33 MB | NPU
106
- | GPUNet | QNN_DLC | float | Qualcomm® SA8295P | 2.299 ms | 1 - 34 MB | NPU
107
- | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.713 ms | 0 - 36 MB | NPU
108
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.44 ms | 0 - 44 MB | NPU
109
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 0.569 ms | 0 - 0 MB | NPU
110
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® X Elite | 1.235 ms | 0 - 0 MB | NPU
111
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.762 ms | 0 - 58 MB | NPU
112
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS6490 | 3.226 ms | 0 - 2 MB | NPU
113
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 2.462 ms | 0 - 42 MB | NPU
114
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 1.062 ms | 0 - 2 MB | NPU
115
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8775P | 1.295 ms | 0 - 44 MB | NPU
116
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 1.226 ms | 0 - 2 MB | NPU
117
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 6.537 ms | 0 - 159 MB | NPU
118
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8450 (Proxy) | 1.435 ms | 0 - 60 MB | NPU
119
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA7255P | 2.462 ms | 0 - 42 MB | NPU
120
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8295P | 1.715 ms | 0 - 38 MB | NPU
121
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.524 ms | 0 - 40 MB | NPU
122
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 1.274 ms | 0 - 42 MB | NPU
123
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.288 ms | 0 - 44 MB | NPU
124
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® X2 Elite | 0.363 ms | 0 - 0 MB | NPU
125
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® X Elite | 0.73 ms | 0 - 0 MB | NPU
126
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.458 ms | 0 - 55 MB | NPU
127
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS6490 | 2.001 ms | 2 - 4 MB | NPU
128
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.407 ms | 0 - 38 MB | NPU
129
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.612 ms | 0 - 2 MB | NPU
130
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8775P | 0.814 ms | 0 - 42 MB | NPU
131
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS9075 | 0.701 ms | 2 - 4 MB | NPU
132
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCM6690 | 3.53 ms | 0 - 41 MB | NPU
133
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.844 ms | 0 - 58 MB | NPU
134
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA7255P | 1.407 ms | 0 - 38 MB | NPU
135
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8295P | 1.121 ms | 0 - 36 MB | NPU
136
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.346 ms | 0 - 40 MB | NPU
137
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.796 ms | 0 - 38 MB | NPU
138
- | GPUNet | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.566 ms | 0 - 67 MB | NPU
139
- | GPUNet | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 0.899 ms | 0 - 94 MB | NPU
140
- | GPUNet | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 4.632 ms | 0 - 62 MB | NPU
141
- | GPUNet | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 1.279 ms | 0 - 2 MB | NPU
142
- | GPUNet | TFLITE | float | Qualcomm® SA8775P | 1.747 ms | 0 - 65 MB | NPU
143
- | GPUNet | TFLITE | float | Qualcomm® QCS9075 | 1.58 ms | 0 - 27 MB | NPU
144
- | GPUNet | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 2.402 ms | 0 - 94 MB | NPU
145
- | GPUNet | TFLITE | float | Qualcomm® SA7255P | 4.632 ms | 0 - 62 MB | NPU
146
- | GPUNet | TFLITE | float | Qualcomm® SA8295P | 2.26 ms | 0 - 61 MB | NPU
147
- | GPUNet | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.704 ms | 0 - 62 MB | NPU
148
- | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.233 ms | 0 - 43 MB | NPU
149
- | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.343 ms | 0 - 54 MB | NPU
150
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS6490 | 1.639 ms | 0 - 15 MB | NPU
151
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.112 ms | 0 - 37 MB | NPU
152
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.424 ms | 0 - 79 MB | NPU
153
- | GPUNet | TFLITE | w8a8 | Qualcomm® SA8775P | 0.641 ms | 0 - 40 MB | NPU
154
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS9075 | 0.529 ms | 0 - 14 MB | NPU
155
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCM6690 | 3.047 ms | 0 - 38 MB | NPU
156
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.686 ms | 0 - 56 MB | NPU
157
- | GPUNet | TFLITE | w8a8 | Qualcomm® SA7255P | 1.112 ms | 0 - 37 MB | NPU
158
- | GPUNet | TFLITE | w8a8 | Qualcomm® SA8295P | 0.93 ms | 0 - 35 MB | NPU
159
- | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.273 ms | 0 - 40 MB | NPU
160
- | GPUNet | TFLITE | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.615 ms | 0 - 37 MB | NPU
161
 
162
  ## License
163
  * The license for the original implementation of GPUNet can be found
 
29
 
30
  | Runtime | Precision | Chipset | SDK Versions | Download |
31
  |---|---|---|---|---|
32
+ | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-float.zip)
33
+ | ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a16.zip)
34
+ | ONNX | w8a8 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a8.zip)
35
+ | QNN_DLC | float | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-float.zip)
36
+ | QNN_DLC | w8a16 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a16.zip)
37
+ | QNN_DLC | w8a8 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a8.zip)
38
+ | TFLITE | float | Universal | QAIRT 2.43, TFLite 2.19.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-float.zip)
39
+ | TFLITE | w8a8 | Universal | QAIRT 2.43, TFLite 2.19.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-w8a8.zip)
40
 
41
  For more device-specific assets and performance metrics, visit **[GPUNet on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/gpunet)**.
42
 
 
66
  ## Performance Summary
67
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
68
  |---|---|---|---|---|---|---
69
+ | GPUNet | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.506 ms | 1 - 36 MB | NPU
70
+ | GPUNet | ONNX | float | Snapdragon® X2 Elite | 0.469 ms | 24 - 24 MB | NPU
71
+ | GPUNet | ONNX | float | Snapdragon® X Elite | 1.131 ms | 24 - 24 MB | NPU
72
+ | GPUNet | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 0.792 ms | 0 - 58 MB | NPU
73
+ | GPUNet | ONNX | float | Qualcomm® QCS8550 (Proxy) | 1.045 ms | 1 - 2 MB | NPU
74
+ | GPUNet | ONNX | float | Qualcomm® QCS9075 | 1.354 ms | 1 - 3 MB | NPU
75
+ | GPUNet | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.622 ms | 0 - 36 MB | NPU
76
+ | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.358 ms | 0 - 48 MB | NPU
77
+ | GPUNet | ONNX | w8a16 | Snapdragon® X2 Elite | 0.386 ms | 12 - 12 MB | NPU
78
+ | GPUNet | ONNX | w8a16 | Snapdragon® X Elite | 0.986 ms | 12 - 12 MB | NPU
79
+ | GPUNet | ONNX | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.597 ms | 0 - 62 MB | NPU
80
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCS6490 | 98.939 ms | 23 - 32 MB | CPU
81
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCS8550 (Proxy) | 0.828 ms | 0 - 16 MB | NPU
82
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCS9075 | 0.978 ms | 0 - 3 MB | NPU
83
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCM6690 | 49.574 ms | 30 - 37 MB | CPU
84
+ | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.433 ms | 0 - 50 MB | NPU
85
+ | GPUNet | ONNX | w8a16 | Snapdragon® 7 Gen 4 Mobile | 38.247 ms | 29 - 37 MB | CPU
86
+ | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.358 ms | 0 - 47 MB | NPU
87
+ | GPUNet | ONNX | w8a8 | Snapdragon® X2 Elite | 0.308 ms | 12 - 12 MB | NPU
88
+ | GPUNet | ONNX | w8a8 | Snapdragon® X Elite | 0.737 ms | 12 - 12 MB | NPU
89
+ | GPUNet | ONNX | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.473 ms | 0 - 67 MB | NPU
90
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCS6490 | 16.535 ms | 1 - 16 MB | CPU
91
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.616 ms | 0 - 15 MB | NPU
92
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCS9075 | 0.729 ms | 0 - 3 MB | NPU
93
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCM6690 | 10.103 ms | 7 - 14 MB | CPU
94
+ | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.391 ms | 0 - 46 MB | NPU
95
+ | GPUNet | ONNX | w8a8 | Snapdragon® 7 Gen 4 Mobile | 7.718 ms | 7 - 15 MB | CPU
96
+ | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.567 ms | 1 - 37 MB | NPU
97
+ | GPUNet | QNN_DLC | float | Snapdragon® X2 Elite | 0.661 ms | 1 - 1 MB | NPU
98
+ | GPUNet | QNN_DLC | float | Snapdragon® X Elite | 1.43 ms | 1 - 1 MB | NPU
99
+ | GPUNet | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 0.926 ms | 1 - 64 MB | NPU
100
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 4.647 ms | 1 - 33 MB | NPU
101
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 1.29 ms | 1 - 110 MB | NPU
102
+ | GPUNet | QNN_DLC | float | Qualcomm® SA8775P | 1.767 ms | 1 - 36 MB | NPU
103
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS9075 | 1.577 ms | 3 - 5 MB | NPU
104
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 2.412 ms | 0 - 64 MB | NPU
105
+ | GPUNet | QNN_DLC | float | Qualcomm® SA7255P | 4.647 ms | 1 - 33 MB | NPU
106
+ | GPUNet | QNN_DLC | float | Qualcomm® SA8295P | 2.3 ms | 1 - 34 MB | NPU
107
+ | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.707 ms | 1 - 33 MB | NPU
108
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.442 ms | 0 - 45 MB | NPU
109
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 0.571 ms | 0 - 0 MB | NPU
110
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® X Elite | 1.252 ms | 0 - 0 MB | NPU
111
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.754 ms | 0 - 58 MB | NPU
112
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS6490 | 3.232 ms | 2 - 4 MB | NPU
113
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 2.451 ms | 0 - 42 MB | NPU
114
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 1.073 ms | 0 - 2 MB | NPU
115
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8775P | 1.292 ms | 0 - 44 MB | NPU
116
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 1.228 ms | 2 - 4 MB | NPU
117
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 6.556 ms | 0 - 159 MB | NPU
118
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8450 (Proxy) | 1.44 ms | 0 - 61 MB | NPU
119
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA7255P | 2.451 ms | 0 - 42 MB | NPU
120
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8295P | 1.715 ms | 0 - 39 MB | NPU
121
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.521 ms | 0 - 45 MB | NPU
122
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 1.281 ms | 0 - 42 MB | NPU
123
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.297 ms | 0 - 43 MB | NPU
124
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® X2 Elite | 0.356 ms | 0 - 0 MB | NPU
125
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® X Elite | 0.741 ms | 0 - 0 MB | NPU
126
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.461 ms | 0 - 56 MB | NPU
127
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS6490 | 2.01 ms | 2 - 4 MB | NPU
128
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.398 ms | 0 - 38 MB | NPU
129
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.603 ms | 0 - 2 MB | NPU
130
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8775P | 0.819 ms | 0 - 42 MB | NPU
131
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS9075 | 0.699 ms | 2 - 4 MB | NPU
132
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCM6690 | 3.444 ms | 0 - 41 MB | NPU
133
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.857 ms | 0 - 58 MB | NPU
134
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA7255P | 1.398 ms | 0 - 38 MB | NPU
135
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8295P | 1.112 ms | 0 - 36 MB | NPU
136
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.338 ms | 0 - 37 MB | NPU
137
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.791 ms | 0 - 38 MB | NPU
138
+ | GPUNet | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.566 ms | 0 - 66 MB | NPU
139
+ | GPUNet | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 0.893 ms | 0 - 93 MB | NPU
140
+ | GPUNet | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 4.616 ms | 0 - 62 MB | NPU
141
+ | GPUNet | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 1.287 ms | 0 - 3 MB | NPU
142
+ | GPUNet | TFLITE | float | Qualcomm® SA8775P | 1.763 ms | 0 - 65 MB | NPU
143
+ | GPUNet | TFLITE | float | Qualcomm® QCS9075 | 1.582 ms | 0 - 27 MB | NPU
144
+ | GPUNet | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 2.415 ms | 0 - 94 MB | NPU
145
+ | GPUNet | TFLITE | float | Qualcomm® SA7255P | 4.616 ms | 0 - 62 MB | NPU
146
+ | GPUNet | TFLITE | float | Qualcomm® SA8295P | 2.281 ms | 0 - 61 MB | NPU
147
+ | GPUNet | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.703 ms | 0 - 61 MB | NPU
148
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.232 ms | 0 - 45 MB | NPU
149
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.345 ms | 0 - 56 MB | NPU
150
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS6490 | 1.669 ms | 0 - 16 MB | NPU
151
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.108 ms | 0 - 39 MB | NPU
152
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.429 ms | 0 - 7 MB | NPU
153
+ | GPUNet | TFLITE | w8a8 | Qualcomm® SA8775P | 0.633 ms | 0 - 42 MB | NPU
154
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS9075 | 0.52 ms | 0 - 14 MB | NPU
155
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCM6690 | 3.026 ms | 0 - 40 MB | NPU
156
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.698 ms | 0 - 57 MB | NPU
157
+ | GPUNet | TFLITE | w8a8 | Qualcomm® SA7255P | 1.108 ms | 0 - 39 MB | NPU
158
+ | GPUNet | TFLITE | w8a8 | Qualcomm® SA8295P | 0.894 ms | 0 - 37 MB | NPU
159
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.266 ms | 0 - 37 MB | NPU
160
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.619 ms | 0 - 38 MB | NPU
161
 
162
  ## License
163
  * The license for the original implementation of GPUNet can be found
release_assets.json CHANGED
@@ -1 +1 @@
1
- {"version":"0.50.1","precisions":{"float":{"universal_assets":{"tflite":{"tool_versions":{"qairt":"2.43.0.260127150333_193827","tflite":"2.17.0"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-tflite-float.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-qnn_dlc-float.zip"},"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.1"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-onnx-float.zip"}}},"w8a16":{"universal_assets":{"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-qnn_dlc-w8a16.zip"},"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.1"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-onnx-w8a16.zip"}}},"w8a8":{"universal_assets":{"tflite":{"tool_versions":{"qairt":"2.43.0.260127150333_193827","tflite":"2.17.0"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-tflite-w8a8.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-qnn_dlc-w8a8.zip"},"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.1"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.1/gpunet-onnx-w8a8.zip"}}}}}
 
1
+ {"version":"0.50.2","precisions":{"float":{"universal_assets":{"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.3"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-float.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-float.zip"},"tflite":{"tool_versions":{"qairt":"2.43.0.260127150333_193827","tflite":"2.19.1"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-float.zip"}}},"w8a16":{"universal_assets":{"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.3"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a16.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a16.zip"}}},"w8a8":{"universal_assets":{"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.3"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a8.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a8.zip"},"tflite":{"tool_versions":{"qairt":"2.43.0.260127150333_193827","tflite":"2.19.1"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-w8a8.zip"}}}}}