qaihm-bot commited on
Commit
bcfe575
·
verified ·
1 Parent(s): 675db58

See https://github.com/qualcomm/ai-hub-models/releases/v0.51.0 for changelog.

Files changed (2) hide show
  1. README.md +100 -100
  2. release_assets.json +70 -1
README.md CHANGED
@@ -29,14 +29,14 @@ Below are pre-exported model assets ready for deployment.
29
 
30
  | Runtime | Precision | Chipset | SDK Versions | Download |
31
  |---|---|---|---|---|
32
- | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-float.zip)
33
- | ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a16.zip)
34
- | ONNX | w8a8 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a8.zip)
35
- | QNN_DLC | float | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-float.zip)
36
- | QNN_DLC | w8a16 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a16.zip)
37
- | QNN_DLC | w8a8 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a8.zip)
38
- | TFLITE | float | Universal | QAIRT 2.43, TFLite 2.19.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-float.zip)
39
- | TFLITE | w8a8 | Universal | QAIRT 2.43, TFLite 2.19.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-w8a8.zip)
40
 
41
  For more device-specific assets and performance metrics, visit **[GPUNet on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/gpunet)**.
42
 
@@ -66,98 +66,98 @@ See our repository for [GPUNet on GitHub](https://github.com/qualcomm/ai-hub-mod
66
  ## Performance Summary
67
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
68
  |---|---|---|---|---|---|---
69
- | GPUNet | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.506 ms | 1 - 36 MB | NPU
70
- | GPUNet | ONNX | float | Snapdragon® X2 Elite | 0.469 ms | 24 - 24 MB | NPU
71
- | GPUNet | ONNX | float | Snapdragon® X Elite | 1.131 ms | 24 - 24 MB | NPU
72
- | GPUNet | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 0.792 ms | 0 - 58 MB | NPU
73
- | GPUNet | ONNX | float | Qualcomm® QCS8550 (Proxy) | 1.045 ms | 1 - 2 MB | NPU
74
- | GPUNet | ONNX | float | Qualcomm® QCS9075 | 1.354 ms | 1 - 3 MB | NPU
75
- | GPUNet | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.622 ms | 0 - 36 MB | NPU
76
- | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.358 ms | 0 - 48 MB | NPU
77
- | GPUNet | ONNX | w8a16 | Snapdragon® X2 Elite | 0.386 ms | 12 - 12 MB | NPU
78
- | GPUNet | ONNX | w8a16 | Snapdragon® X Elite | 0.986 ms | 12 - 12 MB | NPU
79
- | GPUNet | ONNX | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.597 ms | 0 - 62 MB | NPU
80
- | GPUNet | ONNX | w8a16 | Qualcomm® QCS6490 | 98.939 ms | 23 - 32 MB | CPU
81
- | GPUNet | ONNX | w8a16 | Qualcomm® QCS8550 (Proxy) | 0.828 ms | 0 - 16 MB | NPU
82
- | GPUNet | ONNX | w8a16 | Qualcomm® QCS9075 | 0.978 ms | 0 - 3 MB | NPU
83
- | GPUNet | ONNX | w8a16 | Qualcomm® QCM6690 | 49.574 ms | 30 - 37 MB | CPU
84
- | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.433 ms | 0 - 50 MB | NPU
85
- | GPUNet | ONNX | w8a16 | Snapdragon® 7 Gen 4 Mobile | 38.247 ms | 29 - 37 MB | CPU
86
- | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.358 ms | 0 - 47 MB | NPU
87
- | GPUNet | ONNX | w8a8 | Snapdragon® X2 Elite | 0.308 ms | 12 - 12 MB | NPU
88
- | GPUNet | ONNX | w8a8 | Snapdragon® X Elite | 0.737 ms | 12 - 12 MB | NPU
89
- | GPUNet | ONNX | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.473 ms | 0 - 67 MB | NPU
90
- | GPUNet | ONNX | w8a8 | Qualcomm® QCS6490 | 16.535 ms | 1 - 16 MB | CPU
91
- | GPUNet | ONNX | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.616 ms | 0 - 15 MB | NPU
92
- | GPUNet | ONNX | w8a8 | Qualcomm® QCS9075 | 0.729 ms | 0 - 3 MB | NPU
93
- | GPUNet | ONNX | w8a8 | Qualcomm® QCM6690 | 10.103 ms | 7 - 14 MB | CPU
94
- | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.391 ms | 0 - 46 MB | NPU
95
- | GPUNet | ONNX | w8a8 | Snapdragon® 7 Gen 4 Mobile | 7.718 ms | 7 - 15 MB | CPU
96
- | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.567 ms | 1 - 37 MB | NPU
97
- | GPUNet | QNN_DLC | float | Snapdragon® X2 Elite | 0.661 ms | 1 - 1 MB | NPU
98
- | GPUNet | QNN_DLC | float | Snapdragon® X Elite | 1.43 ms | 1 - 1 MB | NPU
99
- | GPUNet | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 0.926 ms | 1 - 64 MB | NPU
100
- | GPUNet | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 4.647 ms | 1 - 33 MB | NPU
101
- | GPUNet | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 1.29 ms | 1 - 110 MB | NPU
102
- | GPUNet | QNN_DLC | float | Qualcomm® SA8775P | 1.767 ms | 1 - 36 MB | NPU
103
- | GPUNet | QNN_DLC | float | Qualcomm® QCS9075 | 1.577 ms | 3 - 5 MB | NPU
104
- | GPUNet | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 2.412 ms | 0 - 64 MB | NPU
105
- | GPUNet | QNN_DLC | float | Qualcomm® SA7255P | 4.647 ms | 1 - 33 MB | NPU
106
- | GPUNet | QNN_DLC | float | Qualcomm® SA8295P | 2.3 ms | 1 - 34 MB | NPU
107
- | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.707 ms | 1 - 33 MB | NPU
108
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.442 ms | 0 - 45 MB | NPU
109
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 0.571 ms | 0 - 0 MB | NPU
110
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® X Elite | 1.252 ms | 0 - 0 MB | NPU
111
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.754 ms | 0 - 58 MB | NPU
112
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS6490 | 3.232 ms | 2 - 4 MB | NPU
113
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 2.451 ms | 0 - 42 MB | NPU
114
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 1.073 ms | 0 - 2 MB | NPU
115
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8775P | 1.292 ms | 0 - 44 MB | NPU
116
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 1.228 ms | 2 - 4 MB | NPU
117
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 6.556 ms | 0 - 159 MB | NPU
118
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8450 (Proxy) | 1.44 ms | 0 - 61 MB | NPU
119
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA7255P | 2.451 ms | 0 - 42 MB | NPU
120
- | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8295P | 1.715 ms | 0 - 39 MB | NPU
121
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.521 ms | 0 - 45 MB | NPU
122
- | GPUNet | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 1.281 ms | 0 - 42 MB | NPU
123
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.297 ms | 0 - 43 MB | NPU
124
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® X2 Elite | 0.356 ms | 0 - 0 MB | NPU
125
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® X Elite | 0.741 ms | 0 - 0 MB | NPU
126
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.461 ms | 0 - 56 MB | NPU
127
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS6490 | 2.01 ms | 2 - 4 MB | NPU
128
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.398 ms | 0 - 38 MB | NPU
129
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.603 ms | 0 - 2 MB | NPU
130
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8775P | 0.819 ms | 0 - 42 MB | NPU
131
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS9075 | 0.699 ms | 2 - 4 MB | NPU
132
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCM6690 | 3.444 ms | 0 - 41 MB | NPU
133
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.857 ms | 0 - 58 MB | NPU
134
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA7255P | 1.398 ms | 0 - 38 MB | NPU
135
- | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8295P | 1.112 ms | 0 - 36 MB | NPU
136
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.338 ms | 0 - 37 MB | NPU
137
- | GPUNet | QNN_DLC | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.791 ms | 0 - 38 MB | NPU
138
- | GPUNet | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.566 ms | 0 - 66 MB | NPU
139
- | GPUNet | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 0.893 ms | 0 - 93 MB | NPU
140
- | GPUNet | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 4.616 ms | 0 - 62 MB | NPU
141
- | GPUNet | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 1.287 ms | 0 - 3 MB | NPU
142
- | GPUNet | TFLITE | float | Qualcomm® SA8775P | 1.763 ms | 0 - 65 MB | NPU
143
- | GPUNet | TFLITE | float | Qualcomm® QCS9075 | 1.582 ms | 0 - 27 MB | NPU
144
- | GPUNet | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 2.415 ms | 0 - 94 MB | NPU
145
- | GPUNet | TFLITE | float | Qualcomm® SA7255P | 4.616 ms | 0 - 62 MB | NPU
146
- | GPUNet | TFLITE | float | Qualcomm® SA8295P | 2.281 ms | 0 - 61 MB | NPU
147
- | GPUNet | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.703 ms | 0 - 61 MB | NPU
148
- | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.232 ms | 0 - 45 MB | NPU
149
- | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.345 ms | 0 - 56 MB | NPU
150
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS6490 | 1.669 ms | 0 - 16 MB | NPU
151
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.108 ms | 0 - 39 MB | NPU
152
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.429 ms | 0 - 7 MB | NPU
153
- | GPUNet | TFLITE | w8a8 | Qualcomm® SA8775P | 0.633 ms | 0 - 42 MB | NPU
154
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS9075 | 0.52 ms | 0 - 14 MB | NPU
155
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCM6690 | 3.026 ms | 0 - 40 MB | NPU
156
- | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.698 ms | 0 - 57 MB | NPU
157
- | GPUNet | TFLITE | w8a8 | Qualcomm® SA7255P | 1.108 ms | 0 - 39 MB | NPU
158
- | GPUNet | TFLITE | w8a8 | Qualcomm® SA8295P | 0.894 ms | 0 - 37 MB | NPU
159
- | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.266 ms | 0 - 37 MB | NPU
160
- | GPUNet | TFLITE | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.619 ms | 0 - 38 MB | NPU
161
 
162
  ## License
163
  * The license for the original implementation of GPUNet can be found
 
29
 
30
  | Runtime | Precision | Chipset | SDK Versions | Download |
31
  |---|---|---|---|---|
32
+ | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-onnx-float.zip)
33
+ | ONNX | w8a16 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-onnx-w8a16.zip)
34
+ | ONNX | w8a8 | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-onnx-w8a8.zip)
35
+ | QNN_DLC | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-qnn_dlc-float.zip)
36
+ | QNN_DLC | w8a16 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-qnn_dlc-w8a16.zip)
37
+ | QNN_DLC | w8a8 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-qnn_dlc-w8a8.zip)
38
+ | TFLITE | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-tflite-float.zip)
39
+ | TFLITE | w8a8 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-tflite-w8a8.zip)
40
 
41
  For more device-specific assets and performance metrics, visit **[GPUNet on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/gpunet)**.
42
 
 
66
  ## Performance Summary
67
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
68
  |---|---|---|---|---|---|---
69
+ | GPUNet | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.507 ms | 0 - 36 MB | NPU
70
+ | GPUNet | ONNX | float | Snapdragon® X2 Elite | 0.463 ms | 24 - 24 MB | NPU
71
+ | GPUNet | ONNX | float | Snapdragon® X Elite | 1.121 ms | 24 - 24 MB | NPU
72
+ | GPUNet | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 0.782 ms | 0 - 59 MB | NPU
73
+ | GPUNet | ONNX | float | Qualcomm® QCS8550 (Proxy) | 1.043 ms | 0 - 2 MB | NPU
74
+ | GPUNet | ONNX | float | Qualcomm® QCS9075 | 1.357 ms | 1 - 3 MB | NPU
75
+ | GPUNet | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.6 ms | 0 - 30 MB | NPU
76
+ | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.359 ms | 0 - 49 MB | NPU
77
+ | GPUNet | ONNX | w8a16 | Snapdragon® X2 Elite | 0.385 ms | 12 - 12 MB | NPU
78
+ | GPUNet | ONNX | w8a16 | Snapdragon® X Elite | 0.988 ms | 12 - 12 MB | NPU
79
+ | GPUNet | ONNX | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.58 ms | 0 - 64 MB | NPU
80
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCS6490 | 98.781 ms | 22 - 31 MB | CPU
81
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCS8550 (Proxy) | 0.826 ms | 0 - 2 MB | NPU
82
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCS9075 | 0.985 ms | 0 - 3 MB | NPU
83
+ | GPUNet | ONNX | w8a16 | Qualcomm® QCM6690 | 49.699 ms | 30 - 38 MB | CPU
84
+ | GPUNet | ONNX | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.431 ms | 0 - 44 MB | NPU
85
+ | GPUNet | ONNX | w8a16 | Snapdragon® 7 Gen 4 Mobile | 38.163 ms | 30 - 38 MB | CPU
86
+ | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.351 ms | 0 - 48 MB | NPU
87
+ | GPUNet | ONNX | w8a8 | Snapdragon® X2 Elite | 0.311 ms | 12 - 12 MB | NPU
88
+ | GPUNet | ONNX | w8a8 | Snapdragon® X Elite | 0.744 ms | 12 - 12 MB | NPU
89
+ | GPUNet | ONNX | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.47 ms | 0 - 62 MB | NPU
90
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCS6490 | 16.703 ms | 1 - 13 MB | CPU
91
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.618 ms | 0 - 14 MB | NPU
92
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCS9075 | 0.726 ms | 0 - 3 MB | NPU
93
+ | GPUNet | ONNX | w8a8 | Qualcomm® QCM6690 | 10.166 ms | 6 - 14 MB | CPU
94
+ | GPUNet | ONNX | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.394 ms | 0 - 38 MB | NPU
95
+ | GPUNet | ONNX | w8a8 | Snapdragon® 7 Gen 4 Mobile | 7.657 ms | 6 - 15 MB | CPU
96
+ | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.563 ms | 1 - 35 MB | NPU
97
+ | GPUNet | QNN_DLC | float | Snapdragon® X2 Elite | 0.654 ms | 1 - 1 MB | NPU
98
+ | GPUNet | QNN_DLC | float | Snapdragon® X Elite | 1.392 ms | 1 - 1 MB | NPU
99
+ | GPUNet | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 0.918 ms | 0 - 57 MB | NPU
100
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 4.704 ms | 1 - 31 MB | NPU
101
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 1.271 ms | 1 - 2 MB | NPU
102
+ | GPUNet | QNN_DLC | float | Qualcomm® SA8775P | 1.7 ms | 1 - 35 MB | NPU
103
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS9075 | 1.589 ms | 3 - 5 MB | NPU
104
+ | GPUNet | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 2.415 ms | 0 - 62 MB | NPU
105
+ | GPUNet | QNN_DLC | float | Qualcomm® SA7255P | 4.704 ms | 1 - 31 MB | NPU
106
+ | GPUNet | QNN_DLC | float | Qualcomm® SA8295P | 2.267 ms | 0 - 31 MB | NPU
107
+ | GPUNet | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.704 ms | 0 - 30 MB | NPU
108
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 0.434 ms | 0 - 44 MB | NPU
109
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 0.575 ms | 0 - 0 MB | NPU
110
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® X Elite | 1.245 ms | 0 - 0 MB | NPU
111
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 0.77 ms | 0 - 58 MB | NPU
112
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS6490 | 3.221 ms | 0 - 2 MB | NPU
113
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 2.473 ms | 0 - 41 MB | NPU
114
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 1.067 ms | 0 - 2 MB | NPU
115
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8775P | 1.269 ms | 0 - 44 MB | NPU
116
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 1.231 ms | 2 - 4 MB | NPU
117
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 6.525 ms | 0 - 163 MB | NPU
118
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® QCS8450 (Proxy) | 1.451 ms | 0 - 61 MB | NPU
119
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA7255P | 2.473 ms | 0 - 41 MB | NPU
120
+ | GPUNet | QNN_DLC | w8a16 | Qualcomm® SA8295P | 1.688 ms | 0 - 39 MB | NPU
121
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 0.527 ms | 0 - 41 MB | NPU
122
+ | GPUNet | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 1.289 ms | 0 - 42 MB | NPU
123
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.288 ms | 0 - 44 MB | NPU
124
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® X2 Elite | 0.352 ms | 0 - 0 MB | NPU
125
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® X Elite | 0.721 ms | 0 - 0 MB | NPU
126
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.461 ms | 0 - 55 MB | NPU
127
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS6490 | 2.004 ms | 0 - 2 MB | NPU
128
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.41 ms | 0 - 39 MB | NPU
129
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.612 ms | 0 - 1 MB | NPU
130
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8775P | 0.799 ms | 0 - 40 MB | NPU
131
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS9075 | 0.688 ms | 2 - 4 MB | NPU
132
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCM6690 | 3.454 ms | 0 - 42 MB | NPU
133
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.854 ms | 0 - 57 MB | NPU
134
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA7255P | 1.41 ms | 0 - 39 MB | NPU
135
+ | GPUNet | QNN_DLC | w8a8 | Qualcomm® SA8295P | 1.126 ms | 0 - 37 MB | NPU
136
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.346 ms | 0 - 37 MB | NPU
137
+ | GPUNet | QNN_DLC | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.784 ms | 0 - 40 MB | NPU
138
+ | GPUNet | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 0.558 ms | 0 - 47 MB | NPU
139
+ | GPUNet | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 0.904 ms | 0 - 74 MB | NPU
140
+ | GPUNet | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 4.708 ms | 0 - 44 MB | NPU
141
+ | GPUNet | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 1.266 ms | 0 - 2 MB | NPU
142
+ | GPUNet | TFLITE | float | Qualcomm® SA8775P | 1.757 ms | 0 - 45 MB | NPU
143
+ | GPUNet | TFLITE | float | Qualcomm® QCS9075 | 1.579 ms | 0 - 27 MB | NPU
144
+ | GPUNet | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 2.402 ms | 0 - 79 MB | NPU
145
+ | GPUNet | TFLITE | float | Qualcomm® SA7255P | 4.708 ms | 0 - 44 MB | NPU
146
+ | GPUNet | TFLITE | float | Qualcomm® SA8295P | 2.278 ms | 0 - 44 MB | NPU
147
+ | GPUNet | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 0.71 ms | 0 - 47 MB | NPU
148
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 0.239 ms | 0 - 45 MB | NPU
149
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Gen 3 Mobile | 0.33 ms | 0 - 56 MB | NPU
150
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS6490 | 1.599 ms | 0 - 15 MB | NPU
151
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8275 (Proxy) | 1.121 ms | 0 - 39 MB | NPU
152
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8550 (Proxy) | 0.429 ms | 0 - 10 MB | NPU
153
+ | GPUNet | TFLITE | w8a8 | Qualcomm® SA8775P | 0.637 ms | 0 - 42 MB | NPU
154
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS9075 | 0.525 ms | 0 - 14 MB | NPU
155
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCM6690 | 3.033 ms | 0 - 40 MB | NPU
156
+ | GPUNet | TFLITE | w8a8 | Qualcomm® QCS8450 (Proxy) | 0.684 ms | 0 - 58 MB | NPU
157
+ | GPUNet | TFLITE | w8a8 | Qualcomm® SA7255P | 1.121 ms | 0 - 39 MB | NPU
158
+ | GPUNet | TFLITE | w8a8 | Qualcomm® SA8295P | 0.903 ms | 0 - 37 MB | NPU
159
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 0.274 ms | 0 - 38 MB | NPU
160
+ | GPUNet | TFLITE | w8a8 | Snapdragon® 7 Gen 4 Mobile | 0.617 ms | 0 - 38 MB | NPU
161
 
162
  ## License
163
  * The license for the original implementation of GPUNet can be found
release_assets.json CHANGED
@@ -1 +1,70 @@
1
- {"version":"0.50.2","precisions":{"float":{"universal_assets":{"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.3"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-float.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-float.zip"},"tflite":{"tool_versions":{"qairt":"2.43.0.260127150333_193827","tflite":"2.19.1"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-float.zip"}}},"w8a16":{"universal_assets":{"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.3"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a16.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a16.zip"}}},"w8a8":{"universal_assets":{"onnx":{"tool_versions":{"qairt":"2.42.0.251225135753_193295","onnx_runtime":"1.24.3"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-onnx-w8a8.zip"},"qnn_dlc":{"tool_versions":{"qairt":"2.43.0.260127150333_193827"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-qnn_dlc-w8a8.zip"},"tflite":{"tool_versions":{"qairt":"2.43.0.260127150333_193827","tflite":"2.19.1"},"download_url":"https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.50.2/gpunet-tflite-w8a8.zip"}}}}}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "0.51.0",
3
+ "precisions": {
4
+ "w8a8": {
5
+ "universal_assets": {
6
+ "tflite": {
7
+ "tool_versions": {
8
+ "qairt": "2.45.0.260326154327",
9
+ "litert": "1.4.2"
10
+ },
11
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-tflite-w8a8.zip"
12
+ },
13
+ "qnn_dlc": {
14
+ "tool_versions": {
15
+ "qairt": "2.45.0.260326154327"
16
+ },
17
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-qnn_dlc-w8a8.zip"
18
+ },
19
+ "onnx": {
20
+ "tool_versions": {
21
+ "qairt": "2.42.0.251225135753_193295",
22
+ "onnx_runtime": "1.24.3"
23
+ },
24
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-onnx-w8a8.zip"
25
+ }
26
+ }
27
+ },
28
+ "float": {
29
+ "universal_assets": {
30
+ "tflite": {
31
+ "tool_versions": {
32
+ "qairt": "2.45.0.260326154327",
33
+ "litert": "1.4.2"
34
+ },
35
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-tflite-float.zip"
36
+ },
37
+ "qnn_dlc": {
38
+ "tool_versions": {
39
+ "qairt": "2.45.0.260326154327"
40
+ },
41
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-qnn_dlc-float.zip"
42
+ },
43
+ "onnx": {
44
+ "tool_versions": {
45
+ "qairt": "2.42.0.251225135753_193295",
46
+ "onnx_runtime": "1.24.3"
47
+ },
48
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-onnx-float.zip"
49
+ }
50
+ }
51
+ },
52
+ "w8a16": {
53
+ "universal_assets": {
54
+ "qnn_dlc": {
55
+ "tool_versions": {
56
+ "qairt": "2.45.0.260326154327"
57
+ },
58
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-qnn_dlc-w8a16.zip"
59
+ },
60
+ "onnx": {
61
+ "tool_versions": {
62
+ "qairt": "2.42.0.251225135753_193295",
63
+ "onnx_runtime": "1.24.3"
64
+ },
65
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/gpunet/releases/v0.51.0/gpunet-onnx-w8a16.zip"
66
+ }
67
+ }
68
+ }
69
+ }
70
+ }