qaihm-bot commited on
Commit
0d116a2
·
verified ·
1 Parent(s): 495b0c6

See https://github.com/quic/ai-hub-models/releases/v0.30.2 for changelog.

ConvNext-Base.so DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:9843a00a0c849a599dd8c75b211e5c16f3a872a7bdf1aeffb586e67e598849d4
3
- size 355713864
 
 
 
 
ConvNext-Base_w8a16.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:8973d44682c1d1c768a89cbe08d9bb81051ad91b0fdf13ac2822773e4b3b6286
3
- size 94187328
 
 
 
 
ConvNext-Base.bin → ConvNext-Base_w8a16.onnx RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ff05f68cf939a4ff7666c6c0af429576c4cca60459601d50d2d6699883a8576
3
- size 183873328
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:70839ab7aab1c72e34239f7967e369532403b9a0e9d5e250a89bbfc26228fe57
3
+ size 355196720
ConvNext-Base_w8a16.so DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:f2248545dbb1611020cbfbe6b50ac41ac6b66259016a7653f902daea9d815030
3
- size 92155712
 
 
 
 
README.md CHANGED
@@ -34,38 +34,35 @@ More details on model performance across various devices, can be found
34
 
35
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
36
  |---|---|---|---|---|---|---|---|---|
37
- | ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 41.836 ms | 0 - 259 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
38
- | ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 41.709 ms | 1 - 10 MB | NPU | Use Export Script |
39
- | ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 19.132 ms | 0 - 274 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
40
- | ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 22.92 ms | 0 - 277 MB | NPU | Use Export Script |
41
- | ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 7.451 ms | 0 - 22 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
42
- | ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 8.032 ms | 1 - 4 MB | NPU | Use Export Script |
43
- | ConvNext-Base | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 11.536 ms | 0 - 257 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
44
- | ConvNext-Base | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 11.899 ms | 1 - 11 MB | NPU | Use Export Script |
45
- | ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 7.468 ms | 0 - 21 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
46
- | ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 8.068 ms | 0 - 19 MB | NPU | Use Export Script |
47
- | ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 7.875 ms | 0 - 441 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
48
- | ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 5.563 ms | 0 - 269 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
49
- | ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 6.112 ms | 1 - 276 MB | NPU | Use Export Script |
50
- | ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 5.981 ms | 1 - 284 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
51
- | ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 5.05 ms | 0 - 262 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
52
- | ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 5.187 ms | 1 - 268 MB | NPU | Use Export Script |
53
- | ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 5.278 ms | 0 - 267 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
54
- | ConvNext-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 8.118 ms | 1 - 1 MB | NPU | Use Export Script |
55
- | ConvNext-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 7.874 ms | 176 - 176 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
56
- | ConvNext-Base | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 14.534 ms | 0 - 10 MB | NPU | Use Export Script |
57
- | ConvNext-Base | w8a16 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 10.036 ms | 0 - 129 MB | NPU | Use Export Script |
58
- | ConvNext-Base | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 5.94 ms | 0 - 3 MB | NPU | Use Export Script |
59
- | ConvNext-Base | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 6.152 ms | 0 - 12 MB | NPU | Use Export Script |
60
- | ConvNext-Base | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | QNN | 40.077 ms | 0 - 15 MB | NPU | Use Export Script |
61
- | ConvNext-Base | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 5.918 ms | 0 - 30 MB | NPU | Use Export Script |
62
- | ConvNext-Base | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 243.99 ms | 573 - 976 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
63
- | ConvNext-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 4.168 ms | 0 - 130 MB | NPU | Use Export Script |
64
- | ConvNext-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 181.977 ms | 684 - 1280 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
65
- | ConvNext-Base | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 3.611 ms | 0 - 122 MB | NPU | Use Export Script |
66
- | ConvNext-Base | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 180.001 ms | 695 - 1260 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
67
- | ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 6.253 ms | 0 - 0 MB | NPU | Use Export Script |
68
- | ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 231.81 ms | 924 - 924 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
69
 
70
 
71
 
@@ -130,7 +127,7 @@ ConvNext-Base
130
  Device : cs_8275 (ANDROID 14)
131
  Runtime : TFLITE
132
  Estimated inference time (ms) : 41.8
133
- Estimated peak memory usage (MB): [0, 259]
134
  Total # Ops : 598
135
  Compute Unit(s) : npu (598 ops) gpu (0 ops) cpu (0 ops)
136
  ```
 
34
 
35
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
36
  |---|---|---|---|---|---|---|---|---|
37
+ | ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 41.798 ms | 0 - 258 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
38
+ | ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 41.66 ms | 0 - 9 MB | NPU | Use Export Script |
39
+ | ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 19.208 ms | 0 - 272 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
40
+ | ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 23.154 ms | 0 - 275 MB | NPU | Use Export Script |
41
+ | ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 7.537 ms | 0 - 19 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
42
+ | ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 8.067 ms | 1 - 4 MB | NPU | Use Export Script |
43
+ | ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 7.52 ms | 0 - 12 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
44
+ | ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 8.047 ms | 0 - 18 MB | NPU | Use Export Script |
45
+ | ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 7.866 ms | 0 - 416 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
46
+ | ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 5.561 ms | 0 - 269 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
47
+ | ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 6.147 ms | 1 - 272 MB | NPU | Use Export Script |
48
+ | ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 6.082 ms | 1 - 283 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
49
+ | ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 5.041 ms | 0 - 262 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
50
+ | ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 4.54 ms | 1 - 268 MB | NPU | Use Export Script |
51
+ | ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 4.677 ms | 0 - 267 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
52
+ | ConvNext-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 8.17 ms | 1 - 1 MB | NPU | Use Export Script |
53
+ | ConvNext-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 7.871 ms | 176 - 176 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
54
+ | ConvNext-Base | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 14.487 ms | 0 - 10 MB | NPU | Use Export Script |
55
+ | ConvNext-Base | w8a16 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 10.081 ms | 0 - 130 MB | NPU | Use Export Script |
56
+ | ConvNext-Base | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 5.914 ms | 0 - 3 MB | NPU | Use Export Script |
57
+ | ConvNext-Base | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | QNN | 39.288 ms | 0 - 14 MB | NPU | Use Export Script |
58
+ | ConvNext-Base | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 5.9 ms | 0 - 30 MB | NPU | Use Export Script |
59
+ | ConvNext-Base | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 244.602 ms | 575 - 965 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
60
+ | ConvNext-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 4.186 ms | 0 - 128 MB | NPU | Use Export Script |
61
+ | ConvNext-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 182.381 ms | 685 - 1282 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
62
+ | ConvNext-Base | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 3.087 ms | 0 - 122 MB | NPU | Use Export Script |
63
+ | ConvNext-Base | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 179.182 ms | 673 - 1239 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
64
+ | ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 6.284 ms | 0 - 0 MB | NPU | Use Export Script |
65
+ | ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 224.64 ms | 926 - 926 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
 
 
 
66
 
67
 
68
 
 
127
  Device : cs_8275 (ANDROID 14)
128
  Runtime : TFLITE
129
  Estimated inference time (ms) : 41.8
130
+ Estimated peak memory usage (MB): [0, 258]
131
  Total # Ops : 598
132
  Compute Unit(s) : npu (598 ops) gpu (0 ops) cpu (0 ops)
133
  ```