v0.30.2
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.30.2 for changelog.
- ConvNext-Base.so +0 -3
- ConvNext-Base_w8a16.bin +0 -3
- ConvNext-Base.bin → ConvNext-Base_w8a16.onnx +2 -2
- ConvNext-Base_w8a16.so +0 -3
- README.md +30 -33
ConvNext-Base.so
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:9843a00a0c849a599dd8c75b211e5c16f3a872a7bdf1aeffb586e67e598849d4
|
| 3 |
-
size 355713864
|
|
|
|
|
|
|
|
|
|
|
|
ConvNext-Base_w8a16.bin
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:8973d44682c1d1c768a89cbe08d9bb81051ad91b0fdf13ac2822773e4b3b6286
|
| 3 |
-
size 94187328
|
|
|
|
|
|
|
|
|
|
|
|
ConvNext-Base.bin → ConvNext-Base_w8a16.onnx
RENAMED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:70839ab7aab1c72e34239f7967e369532403b9a0e9d5e250a89bbfc26228fe57
|
| 3 |
+
size 355196720
|
ConvNext-Base_w8a16.so
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:f2248545dbb1611020cbfbe6b50ac41ac6b66259016a7653f902daea9d815030
|
| 3 |
-
size 92155712
|
|
|
|
|
|
|
|
|
|
|
|
README.md
CHANGED
|
@@ -34,38 +34,35 @@ More details on model performance across various devices, can be found
|
|
| 34 |
|
| 35 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 36 |
|---|---|---|---|---|---|---|---|---|
|
| 37 |
-
| ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 41.
|
| 38 |
-
| ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 41.
|
| 39 |
-
| ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 19.
|
| 40 |
-
| ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN |
|
| 41 |
-
| ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 7.
|
| 42 |
-
| ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 8.
|
| 43 |
-
| ConvNext-Base | float |
|
| 44 |
-
| ConvNext-Base | float |
|
| 45 |
-
| ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile |
|
| 46 |
-
| ConvNext-Base | float | Samsung Galaxy
|
| 47 |
-
| ConvNext-Base | float | Samsung Galaxy
|
| 48 |
-
| ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile |
|
| 49 |
-
| ConvNext-Base | float |
|
| 50 |
-
| ConvNext-Base | float |
|
| 51 |
-
| ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile |
|
| 52 |
-
| ConvNext-Base | float | Snapdragon
|
| 53 |
-
| ConvNext-Base | float | Snapdragon
|
| 54 |
-
| ConvNext-Base |
|
| 55 |
-
| ConvNext-Base |
|
| 56 |
-
| ConvNext-Base | w8a16 |
|
| 57 |
-
| ConvNext-Base | w8a16 |
|
| 58 |
-
| ConvNext-Base | w8a16 |
|
| 59 |
-
| ConvNext-Base | w8a16 |
|
| 60 |
-
| ConvNext-Base | w8a16 |
|
| 61 |
-
| ConvNext-Base | w8a16 | Samsung Galaxy
|
| 62 |
-
| ConvNext-Base | w8a16 |
|
| 63 |
-
| ConvNext-Base | w8a16 |
|
| 64 |
-
| ConvNext-Base | w8a16 |
|
| 65 |
-
| ConvNext-Base | w8a16 | Snapdragon
|
| 66 |
-
| ConvNext-Base | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 180.001 ms | 695 - 1260 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
|
| 67 |
-
| ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 6.253 ms | 0 - 0 MB | NPU | Use Export Script |
|
| 68 |
-
| ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 231.81 ms | 924 - 924 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
|
| 69 |
|
| 70 |
|
| 71 |
|
|
@@ -130,7 +127,7 @@ ConvNext-Base
|
|
| 130 |
Device : cs_8275 (ANDROID 14)
|
| 131 |
Runtime : TFLITE
|
| 132 |
Estimated inference time (ms) : 41.8
|
| 133 |
-
Estimated peak memory usage (MB): [0,
|
| 134 |
Total # Ops : 598
|
| 135 |
Compute Unit(s) : npu (598 ops) gpu (0 ops) cpu (0 ops)
|
| 136 |
```
|
|
|
|
| 34 |
|
| 35 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 36 |
|---|---|---|---|---|---|---|---|---|
|
| 37 |
+
| ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 41.798 ms | 0 - 258 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
|
| 38 |
+
| ConvNext-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 41.66 ms | 0 - 9 MB | NPU | Use Export Script |
|
| 39 |
+
| ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 19.208 ms | 0 - 272 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
|
| 40 |
+
| ConvNext-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 23.154 ms | 0 - 275 MB | NPU | Use Export Script |
|
| 41 |
+
| ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 7.537 ms | 0 - 19 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
|
| 42 |
+
| ConvNext-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 8.067 ms | 1 - 4 MB | NPU | Use Export Script |
|
| 43 |
+
| ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 7.52 ms | 0 - 12 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
|
| 44 |
+
| ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 8.047 ms | 0 - 18 MB | NPU | Use Export Script |
|
| 45 |
+
| ConvNext-Base | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 7.866 ms | 0 - 416 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
|
| 46 |
+
| ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 5.561 ms | 0 - 269 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
|
| 47 |
+
| ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 6.147 ms | 1 - 272 MB | NPU | Use Export Script |
|
| 48 |
+
| ConvNext-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 6.082 ms | 1 - 283 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
|
| 49 |
+
| ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 5.041 ms | 0 - 262 MB | NPU | [ConvNext-Base.tflite](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.tflite) |
|
| 50 |
+
| ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 4.54 ms | 1 - 268 MB | NPU | Use Export Script |
|
| 51 |
+
| ConvNext-Base | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 4.677 ms | 0 - 267 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
|
| 52 |
+
| ConvNext-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 8.17 ms | 1 - 1 MB | NPU | Use Export Script |
|
| 53 |
+
| ConvNext-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 7.871 ms | 176 - 176 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base.onnx) |
|
| 54 |
+
| ConvNext-Base | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 14.487 ms | 0 - 10 MB | NPU | Use Export Script |
|
| 55 |
+
| ConvNext-Base | w8a16 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 10.081 ms | 0 - 130 MB | NPU | Use Export Script |
|
| 56 |
+
| ConvNext-Base | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 5.914 ms | 0 - 3 MB | NPU | Use Export Script |
|
| 57 |
+
| ConvNext-Base | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | QNN | 39.288 ms | 0 - 14 MB | NPU | Use Export Script |
|
| 58 |
+
| ConvNext-Base | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 5.9 ms | 0 - 30 MB | NPU | Use Export Script |
|
| 59 |
+
| ConvNext-Base | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 244.602 ms | 575 - 965 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
|
| 60 |
+
| ConvNext-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 4.186 ms | 0 - 128 MB | NPU | Use Export Script |
|
| 61 |
+
| ConvNext-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 182.381 ms | 685 - 1282 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
|
| 62 |
+
| ConvNext-Base | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 3.087 ms | 0 - 122 MB | NPU | Use Export Script |
|
| 63 |
+
| ConvNext-Base | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 179.182 ms | 673 - 1239 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
|
| 64 |
+
| ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 6.284 ms | 0 - 0 MB | NPU | Use Export Script |
|
| 65 |
+
| ConvNext-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 224.64 ms | 926 - 926 MB | NPU | [ConvNext-Base.onnx](https://huggingface.co/qualcomm/ConvNext-Base/blob/main/ConvNext-Base_w8a16.onnx) |
|
|
|
|
|
|
|
|
|
|
| 66 |
|
| 67 |
|
| 68 |
|
|
|
|
| 127 |
Device : cs_8275 (ANDROID 14)
|
| 128 |
Runtime : TFLITE
|
| 129 |
Estimated inference time (ms) : 41.8
|
| 130 |
+
Estimated peak memory usage (MB): [0, 258]
|
| 131 |
Total # Ops : 598
|
| 132 |
Compute Unit(s) : npu (598 ops) gpu (0 ops) cpu (0 ops)
|
| 133 |
```
|