qaihm-bot commited on
Commit
3101d82
·
verified ·
1 Parent(s): 14ab736

See https://github.com/quic/ai-hub-models/releases/v0.30.2 for changelog.

EfficientNet-B4.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:45b42ab33f324c1e72f638e2ade8bba742758bf4a4b49898d12bc324a25abdb4
3
- size 47398368
 
 
 
 
EfficientNet-B4.so DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:73bf08b8d51b9c6f25aabdb1a515c8853d1224f41e043a27368b9d511f94dad5
3
- size 78579824
 
 
 
 
EfficientNet-B4_w8a16.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:7243d7eab5e77ec56021452387b462b9ebb01f2d1195e18de8b7b30869f3d4f4
3
- size 24776176
 
 
 
 
EfficientNet-B4_w8a16.so DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:0ea05031bfa31ffe4b7815ae096daa615fd3a6d13a17942caf4ce30cc900f8de
3
- size 22535216
 
 
 
 
README.md CHANGED
@@ -35,38 +35,34 @@ More details on model performance across various devices, can be found
35
 
36
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
37
  |---|---|---|---|---|---|---|---|---|
38
- | EfficientNet-B4 | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 64.826 ms | 0 - 61 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
39
- | EfficientNet-B4 | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 64.576 ms | 1 - 10 MB | NPU | Use Export Script |
40
- | EfficientNet-B4 | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 7.173 ms | 0 - 83 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
41
- | EfficientNet-B4 | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 8.054 ms | 1 - 49 MB | NPU | Use Export Script |
42
- | EfficientNet-B4 | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 3.335 ms | 0 - 385 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
43
- | EfficientNet-B4 | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 3.186 ms | 1 - 3 MB | NPU | Use Export Script |
44
- | EfficientNet-B4 | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 4.371 ms | 0 - 62 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
45
- | EfficientNet-B4 | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 4.201 ms | 1 - 13 MB | NPU | Use Export Script |
46
- | EfficientNet-B4 | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 3.319 ms | 0 - 384 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
47
- | EfficientNet-B4 | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 3.187 ms | 0 - 18 MB | NPU | Use Export Script |
48
- | EfficientNet-B4 | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 3.227 ms | 0 - 115 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
49
- | EfficientNet-B4 | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 2.448 ms | 0 - 80 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
50
- | EfficientNet-B4 | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 2.349 ms | 1 - 49 MB | NPU | Use Export Script |
51
- | EfficientNet-B4 | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 2.398 ms | 0 - 56 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
52
- | EfficientNet-B4 | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 2.288 ms | 0 - 65 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
53
- | EfficientNet-B4 | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 2.172 ms | 1 - 34 MB | NPU | Use Export Script |
54
- | EfficientNet-B4 | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 1.978 ms | 0 - 36 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
55
- | EfficientNet-B4 | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 3.447 ms | 1 - 1 MB | NPU | Use Export Script |
56
- | EfficientNet-B4 | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 3.41 ms | 45 - 45 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
57
- | EfficientNet-B4 | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 6.573 ms | 0 - 10 MB | NPU | Use Export Script |
58
- | EfficientNet-B4 | w8a16 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 4.346 ms | 0 - 62 MB | NPU | Use Export Script |
59
- | EfficientNet-B4 | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 3.425 ms | 0 - 4 MB | NPU | Use Export Script |
60
- | EfficientNet-B4 | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 3.859 ms | 0 - 15 MB | NPU | Use Export Script |
61
- | EfficientNet-B4 | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | QNN | 11.756 ms | 0 - 12 MB | NPU | Use Export Script |
62
- | EfficientNet-B4 | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 3.424 ms | 0 - 18 MB | NPU | Use Export Script |
63
- | EfficientNet-B4 | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 3.699 ms | 0 - 37 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4_w8a16.onnx) |
64
- | EfficientNet-B4 | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 2.287 ms | 0 - 70 MB | NPU | Use Export Script |
65
- | EfficientNet-B4 | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 2.402 ms | 0 - 88 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4_w8a16.onnx) |
66
- | EfficientNet-B4 | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 1.945 ms | 0 - 53 MB | NPU | Use Export Script |
67
- | EfficientNet-B4 | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 2.233 ms | 0 - 69 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4_w8a16.onnx) |
68
- | EfficientNet-B4 | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 3.726 ms | 0 - 0 MB | NPU | Use Export Script |
69
- | EfficientNet-B4 | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 3.816 ms | 27 - 27 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4_w8a16.onnx) |
70
 
71
 
72
 
@@ -130,7 +126,7 @@ Profiling Results
130
  EfficientNet-B4
131
  Device : cs_8275 (ANDROID 14)
132
  Runtime : TFLITE
133
- Estimated inference time (ms) : 64.8
134
  Estimated peak memory usage (MB): [0, 61]
135
  Total # Ops : 482
136
  Compute Unit(s) : npu (482 ops) gpu (0 ops) cpu (0 ops)
 
35
 
36
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
37
  |---|---|---|---|---|---|---|---|---|
38
+ | EfficientNet-B4 | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 12.221 ms | 0 - 61 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
39
+ | EfficientNet-B4 | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 11.894 ms | 1 - 10 MB | NPU | Use Export Script |
40
+ | EfficientNet-B4 | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 7.285 ms | 0 - 78 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
41
+ | EfficientNet-B4 | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 8.057 ms | 1 - 47 MB | NPU | Use Export Script |
42
+ | EfficientNet-B4 | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 3.329 ms | 0 - 386 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
43
+ | EfficientNet-B4 | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 3.16 ms | 1 - 3 MB | NPU | Use Export Script |
44
+ | EfficientNet-B4 | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 4.366 ms | 0 - 62 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
45
+ | EfficientNet-B4 | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 4.178 ms | 1 - 14 MB | NPU | Use Export Script |
46
+ | EfficientNet-B4 | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 3.315 ms | 0 - 385 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
47
+ | EfficientNet-B4 | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 3.197 ms | 0 - 25 MB | NPU | Use Export Script |
48
+ | EfficientNet-B4 | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 3.251 ms | 0 - 101 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
49
+ | EfficientNet-B4 | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 2.451 ms | 0 - 77 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
50
+ | EfficientNet-B4 | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 2.341 ms | 1 - 49 MB | NPU | Use Export Script |
51
+ | EfficientNet-B4 | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 2.416 ms | 0 - 54 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
52
+ | EfficientNet-B4 | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 1.936 ms | 0 - 64 MB | NPU | [EfficientNet-B4.tflite](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.tflite) |
53
+ | EfficientNet-B4 | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 2.171 ms | 1 - 35 MB | NPU | Use Export Script |
54
+ | EfficientNet-B4 | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 2.266 ms | 0 - 37 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
55
+ | EfficientNet-B4 | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 3.468 ms | 1 - 1 MB | NPU | Use Export Script |
56
+ | EfficientNet-B4 | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 3.462 ms | 45 - 45 MB | NPU | [EfficientNet-B4.onnx](https://huggingface.co/qualcomm/EfficientNet-B4/blob/main/EfficientNet-B4.onnx) |
57
+ | EfficientNet-B4 | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 6.53 ms | 0 - 10 MB | NPU | Use Export Script |
58
+ | EfficientNet-B4 | w8a16 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 4.433 ms | 0 - 61 MB | NPU | Use Export Script |
59
+ | EfficientNet-B4 | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 3.425 ms | 0 - 3 MB | NPU | Use Export Script |
60
+ | EfficientNet-B4 | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 3.856 ms | 0 - 15 MB | NPU | Use Export Script |
61
+ | EfficientNet-B4 | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | QNN | 11.665 ms | 0 - 14 MB | NPU | Use Export Script |
62
+ | EfficientNet-B4 | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 3.423 ms | 0 - 18 MB | NPU | Use Export Script |
63
+ | EfficientNet-B4 | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 2.277 ms | 0 - 66 MB | NPU | Use Export Script |
64
+ | EfficientNet-B4 | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 1.941 ms | 0 - 54 MB | NPU | Use Export Script |
65
+ | EfficientNet-B4 | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 3.762 ms | 0 - 0 MB | NPU | Use Export Script |
 
 
 
 
66
 
67
 
68
 
 
126
  EfficientNet-B4
127
  Device : cs_8275 (ANDROID 14)
128
  Runtime : TFLITE
129
+ Estimated inference time (ms) : 12.2
130
  Estimated peak memory usage (MB): [0, 61]
131
  Total # Ops : 482
132
  Compute Unit(s) : npu (482 ops) gpu (0 ops) cpu (0 ops)