qaihm-bot commited on
Commit
cfd075c
·
verified ·
1 Parent(s): 528baab

See https://github.com/qualcomm/ai-hub-models/releases/v0.53.0 for changelog.

Files changed (2) hide show
  1. README.md +47 -39
  2. release_assets.json +15 -15
README.md CHANGED
@@ -29,10 +29,10 @@ Below are pre-exported model assets ready for deployment.
29
 
30
  | Runtime | Precision | Chipset | SDK Versions | Download |
31
  |---|---|---|---|---|
32
- | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-onnx-float.zip)
33
- | QNN_DLC | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-qnn_dlc-float.zip)
34
- | QNN_DLC | w8a16 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-qnn_dlc-w8a16.zip)
35
- | TFLITE | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-tflite-float.zip)
36
 
37
  For more device-specific assets and performance metrics, visit **[EfficientNet-B4 on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/efficientnet_b4)**.
38
 
@@ -62,41 +62,49 @@ See our repository for [EfficientNet-B4 on GitHub](https://github.com/qualcomm/a
62
  ## Performance Summary
63
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
64
  |---|---|---|---|---|---|---
65
- | EfficientNet-B4 | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 1.466 ms | 0 - 77 MB | NPU
66
- | EfficientNet-B4 | ONNX | float | Snapdragon® X2 Elite | 1.631 ms | 45 - 45 MB | NPU
67
- | EfficientNet-B4 | ONNX | float | Snapdragon® X Elite | 3.34 ms | 45 - 45 MB | NPU
68
- | EfficientNet-B4 | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 2.255 ms | 0 - 128 MB | NPU
69
- | EfficientNet-B4 | ONNX | float | Qualcomm® QCS8550 (Proxy) | 3.092 ms | 0 - 50 MB | NPU
70
- | EfficientNet-B4 | ONNX | float | Qualcomm® QCS9075 | 4.022 ms | 0 - 4 MB | NPU
71
- | EfficientNet-B4 | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 1.769 ms | 0 - 77 MB | NPU
72
- | EfficientNet-B4 | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 1.498 ms | 1 - 70 MB | NPU
73
- | EfficientNet-B4 | QNN_DLC | float | Snapdragon® X2 Elite | 1.934 ms | 1 - 1 MB | NPU
74
- | EfficientNet-B4 | QNN_DLC | float | Snapdragon® X Elite | 3.618 ms | 1 - 1 MB | NPU
75
- | EfficientNet-B4 | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 2.392 ms | 0 - 116 MB | NPU
76
- | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 12.025 ms | 1 - 65 MB | NPU
77
- | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 3.355 ms | 1 - 169 MB | NPU
78
- | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS9075 | 4.13 ms | 3 - 5 MB | NPU
79
- | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 7.877 ms | 0 - 137 MB | NPU
80
- | EfficientNet-B4 | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 1.857 ms | 1 - 70 MB | NPU
81
- | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 1.309 ms | 0 - 109 MB | NPU
82
- | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 1.712 ms | 0 - 0 MB | NPU
83
- | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® X Elite | 3.782 ms | 0 - 0 MB | NPU
84
- | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 2.292 ms | 0 - 149 MB | NPU
85
- | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS6490 | 8.74 ms | 0 - 2 MB | NPU
86
- | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 6.596 ms | 0 - 101 MB | NPU
87
- | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 3.434 ms | 0 - 8 MB | NPU
88
- | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 3.786 ms | 0 - 2 MB | NPU
89
- | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 16.24 ms | 0 - 231 MB | NPU
90
- | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS8450 (Proxy) | 4.193 ms | 0 - 151 MB | NPU
91
- | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 1.593 ms | 0 - 104 MB | NPU
92
- | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 3.594 ms | 0 - 106 MB | NPU
93
- | EfficientNet-B4 | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 1.507 ms | 0 - 86 MB | NPU
94
- | EfficientNet-B4 | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 2.397 ms | 0 - 146 MB | NPU
95
- | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 12.067 ms | 0 - 82 MB | NPU
96
- | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 3.309 ms | 0 - 2 MB | NPU
97
- | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS9075 | 4.156 ms | 0 - 48 MB | NPU
98
- | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 7.858 ms | 0 - 157 MB | NPU
99
- | EfficientNet-B4 | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 1.839 ms | 0 - 82 MB | NPU
 
 
 
 
 
 
 
 
100
 
101
  ## License
102
  * The license for the original implementation of EfficientNet-B4 can be found
 
29
 
30
  | Runtime | Precision | Chipset | SDK Versions | Download |
31
  |---|---|---|---|---|
32
+ | ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.3 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-onnx-float.zip)
33
+ | QNN_DLC | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-qnn_dlc-float.zip)
34
+ | QNN_DLC | w8a16 | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-qnn_dlc-w8a16.zip)
35
+ | TFLITE | float | Universal | QAIRT 2.45 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-tflite-float.zip)
36
 
37
  For more device-specific assets and performance metrics, visit **[EfficientNet-B4 on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/efficientnet_b4)**.
38
 
 
62
  ## Performance Summary
63
  | Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
64
  |---|---|---|---|---|---|---
65
+ | EfficientNet-B4 | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 1.467 ms | 1 - 78 MB | NPU
66
+ | EfficientNet-B4 | ONNX | float | Snapdragon® 8 Elite Mobile | 1.759 ms | 0 - 71 MB | NPU
67
+ | EfficientNet-B4 | ONNX | float | Snapdragon® X2 Elite | 1.628 ms | 45 - 45 MB | NPU
68
+ | EfficientNet-B4 | ONNX | float | Snapdragon® X Elite | 3.346 ms | 45 - 45 MB | NPU
69
+ | EfficientNet-B4 | ONNX | float | Snapdragon® X Elite | 3.346 ms | 45 - 45 MB | NPU
70
+ | EfficientNet-B4 | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 2.271 ms | 0 - 129 MB | NPU
71
+ | EfficientNet-B4 | ONNX | float | Qualcomm® QCS8550 (Proxy) | 3.055 ms | 0 - 51 MB | NPU
72
+ | EfficientNet-B4 | ONNX | float | Qualcomm® QCS9075 | 4.023 ms | 0 - 4 MB | NPU
73
+ | EfficientNet-B4 | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 1.759 ms | 0 - 71 MB | NPU
74
+ | EfficientNet-B4 | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 1.507 ms | 0 - 68 MB | NPU
75
+ | EfficientNet-B4 | QNN_DLC | float | Snapdragon® 8 Elite Mobile | 1.842 ms | 0 - 69 MB | NPU
76
+ | EfficientNet-B4 | QNN_DLC | float | Snapdragon® X2 Elite | 1.941 ms | 1 - 1 MB | NPU
77
+ | EfficientNet-B4 | QNN_DLC | float | Snapdragon® X Elite | 3.599 ms | 1 - 1 MB | NPU
78
+ | EfficientNet-B4 | QNN_DLC | float | Snapdragon® X Elite | 3.599 ms | 1 - 1 MB | NPU
79
+ | EfficientNet-B4 | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 2.385 ms | 0 - 117 MB | NPU
80
+ | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 12.006 ms | 1 - 65 MB | NPU
81
+ | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 3.347 ms | 0 - 30 MB | NPU
82
+ | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS9075 | 4.132 ms | 1 - 3 MB | NPU
83
+ | EfficientNet-B4 | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 7.865 ms | 0 - 136 MB | NPU
84
+ | EfficientNet-B4 | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 1.842 ms | 0 - 69 MB | NPU
85
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 8 Elite Gen 5 Mobile | 1.317 ms | 0 - 109 MB | NPU
86
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 8 Elite Mobile | 1.595 ms | 0 - 104 MB | NPU
87
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® X2 Elite | 1.701 ms | 0 - 0 MB | NPU
88
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® X Elite | 3.763 ms | 0 - 0 MB | NPU
89
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® X Elite | 3.763 ms | 0 - 0 MB | NPU
90
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 8 Gen 3 Mobile | 2.292 ms | 0 - 147 MB | NPU
91
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS6490 | 8.757 ms | 2 - 4 MB | NPU
92
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS8275 (Proxy) | 6.565 ms | 0 - 101 MB | NPU
93
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS8550 (Proxy) | 3.447 ms | 0 - 2 MB | NPU
94
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS9075 | 3.78 ms | 0 - 2 MB | NPU
95
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCM6690 | 16.121 ms | 0 - 232 MB | NPU
96
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Qualcomm® QCS8450 (Proxy) | 4.191 ms | 0 - 151 MB | NPU
97
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 8 Elite For Galaxy Mobile | 1.595 ms | 0 - 104 MB | NPU
98
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 3.565 ms | 0 - 107 MB | NPU
99
+ | EfficientNet-B4 | QNN_DLC | w8a16 | Snapdragon® 7 Gen 4 Mobile | 3.565 ms | 0 - 107 MB | NPU
100
+ | EfficientNet-B4 | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 1.509 ms | 0 - 85 MB | NPU
101
+ | EfficientNet-B4 | TFLITE | float | Snapdragon® 8 Elite Mobile | 1.842 ms | 0 - 87 MB | NPU
102
+ | EfficientNet-B4 | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 2.397 ms | 0 - 145 MB | NPU
103
+ | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 12.043 ms | 0 - 82 MB | NPU
104
+ | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 3.307 ms | 0 - 2 MB | NPU
105
+ | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS9075 | 4.157 ms | 0 - 48 MB | NPU
106
+ | EfficientNet-B4 | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 7.877 ms | 0 - 162 MB | NPU
107
+ | EfficientNet-B4 | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 1.842 ms | 0 - 87 MB | NPU
108
 
109
  ## License
110
  * The license for the original implementation of EfficientNet-B4 can be found
release_assets.json CHANGED
@@ -1,37 +1,37 @@
1
  {
2
- "version": "0.52.0",
3
  "precisions": {
 
 
 
 
 
 
 
 
 
 
4
  "float": {
5
  "universal_assets": {
6
  "tflite": {
7
  "tool_versions": {
8
  "qairt": "2.45.0.260326154327",
9
- "litert": "1.4.2"
10
  },
11
- "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-tflite-float.zip"
12
  },
13
  "qnn_dlc": {
14
  "tool_versions": {
15
  "qairt": "2.45.0.260326154327"
16
  },
17
- "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-qnn_dlc-float.zip"
18
  },
19
  "onnx": {
20
  "tool_versions": {
21
  "qairt": "2.42.0.251225135753_193295",
22
  "onnx_runtime": "1.24.3"
23
  },
24
- "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-onnx-float.zip"
25
- }
26
- }
27
- },
28
- "w8a16": {
29
- "universal_assets": {
30
- "qnn_dlc": {
31
- "tool_versions": {
32
- "qairt": "2.45.0.260326154327"
33
- },
34
- "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.52.0/efficientnet_b4-qnn_dlc-w8a16.zip"
35
  }
36
  }
37
  }
 
1
  {
2
+ "version": "0.53.0",
3
  "precisions": {
4
+ "w8a16": {
5
+ "universal_assets": {
6
+ "qnn_dlc": {
7
+ "tool_versions": {
8
+ "qairt": "2.45.0.260326154327"
9
+ },
10
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-qnn_dlc-w8a16.zip"
11
+ }
12
+ }
13
+ },
14
  "float": {
15
  "universal_assets": {
16
  "tflite": {
17
  "tool_versions": {
18
  "qairt": "2.45.0.260326154327",
19
+ "litert": "1.4.3"
20
  },
21
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-tflite-float.zip"
22
  },
23
  "qnn_dlc": {
24
  "tool_versions": {
25
  "qairt": "2.45.0.260326154327"
26
  },
27
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-qnn_dlc-float.zip"
28
  },
29
  "onnx": {
30
  "tool_versions": {
31
  "qairt": "2.42.0.251225135753_193295",
32
  "onnx_runtime": "1.24.3"
33
  },
34
+ "download_url": "https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/efficientnet_b4/releases/v0.53.0/efficientnet_b4-onnx-float.zip"
 
 
 
 
 
 
 
 
 
 
35
  }
36
  }
37
  }