Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -36,8 +36,8 @@ More details on model performance across various devices, can be found
|
|
| 36 |
|
| 37 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
| 38 |
| ---|---|---|---|---|---|---|---|
|
| 39 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 4.
|
| 40 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library |
|
| 41 |
|
| 42 |
|
| 43 |
## Installation
|
|
@@ -94,23 +94,6 @@ device. This script does the following:
|
|
| 94 |
python -m qai_hub_models.models.wideresnet50.export
|
| 95 |
```
|
| 96 |
|
| 97 |
-
```
|
| 98 |
-
Profile Job summary of WideResNet50
|
| 99 |
-
--------------------------------------------------
|
| 100 |
-
Device: Samsung Galaxy S24 (14)
|
| 101 |
-
Estimated Inference Time: 3.60 ms
|
| 102 |
-
Estimated Peak Memory Range: 0.02-91.57 MB
|
| 103 |
-
Compute Units: NPU (77) | Total (77)
|
| 104 |
-
|
| 105 |
-
Profile Job summary of WideResNet50
|
| 106 |
-
--------------------------------------------------
|
| 107 |
-
Device: Samsung Galaxy S24 (14)
|
| 108 |
-
Estimated Inference Time: 3.41 ms
|
| 109 |
-
Estimated Peak Memory Range: 0.59-51.28 MB
|
| 110 |
-
Compute Units: NPU (124) | Total (124)
|
| 111 |
-
|
| 112 |
-
|
| 113 |
-
```
|
| 114 |
## How does this work?
|
| 115 |
|
| 116 |
This [export script](https://github.com/quic/ai-hub-models/blob/main/qai_hub_models/models/WideResNet50/export.py)
|
|
|
|
| 36 |
|
| 37 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
| 38 |
| ---|---|---|---|---|---|---|---|
|
| 39 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 4.9 ms | 0 - 2 MB | FP16 | NPU | [WideResNet50.tflite](https://huggingface.co/qualcomm/WideResNet50/blob/main/WideResNet50.tflite)
|
| 40 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 5.767 ms | 1 - 249 MB | FP16 | NPU | [WideResNet50.so](https://huggingface.co/qualcomm/WideResNet50/blob/main/WideResNet50.so)
|
| 41 |
|
| 42 |
|
| 43 |
## Installation
|
|
|
|
| 94 |
python -m qai_hub_models.models.wideresnet50.export
|
| 95 |
```
|
| 96 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 97 |
## How does this work?
|
| 98 |
|
| 99 |
This [export script](https://github.com/quic/ai-hub-models/blob/main/qai_hub_models/models/WideResNet50/export.py)
|