Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -37,10 +37,10 @@ More details on model performance across various devices, can be found
|
|
| 37 |
|
| 38 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
| 39 |
| ---|---|---|---|---|---|---|---|
|
| 40 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 11.
|
| 41 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary |
|
| 42 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary |
|
| 43 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary |
|
| 44 |
|
| 45 |
|
| 46 |
## Installation
|
|
@@ -101,30 +101,30 @@ python -m qai_hub_models.models.controlnet_quantized.export
|
|
| 101 |
```
|
| 102 |
Profile Job summary of TextEncoder_Quantized
|
| 103 |
--------------------------------------------------
|
| 104 |
-
Device: Samsung Galaxy
|
| 105 |
-
Estimated Inference Time:
|
| 106 |
-
Estimated Peak Memory Range: 0.
|
| 107 |
Compute Units: NPU (570) | Total (570)
|
| 108 |
|
| 109 |
-
Profile Job summary of VAEDecoder_Quantized
|
| 110 |
-
--------------------------------------------------
|
| 111 |
-
Device: Samsung Galaxy S23 Ultra (13)
|
| 112 |
-
Estimated Inference Time: 386.75 ms
|
| 113 |
-
Estimated Peak Memory Range: 0.12-4.28 MB
|
| 114 |
-
Compute Units: NPU (409) | Total (409)
|
| 115 |
-
|
| 116 |
Profile Job summary of UNet_Quantized
|
| 117 |
--------------------------------------------------
|
| 118 |
-
Device: Samsung Galaxy
|
| 119 |
-
Estimated Inference Time:
|
| 120 |
-
Estimated Peak Memory Range:
|
| 121 |
Compute Units: NPU (5434) | Total (5434)
|
| 122 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 123 |
Profile Job summary of ControlNet_Quantized
|
| 124 |
--------------------------------------------------
|
| 125 |
-
Device: Samsung Galaxy
|
| 126 |
-
Estimated Inference Time:
|
| 127 |
-
Estimated Peak Memory Range: 0.
|
| 128 |
Compute Units: NPU (2406) | Total (2406)
|
| 129 |
|
| 130 |
|
|
|
|
| 37 |
|
| 38 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
| 39 |
| ---|---|---|---|---|---|---|---|
|
| 40 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 11.394 ms | 0 - 74 MB | UINT16 | NPU | [TextEncoder_Quantized.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/TextEncoder_Quantized.bin)
|
| 41 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 262.52 ms | 11 - 17 MB | UINT16 | NPU | [UNet_Quantized.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/UNet_Quantized.bin)
|
| 42 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 390.243 ms | 0 - 36 MB | UINT16 | NPU | [VAEDecoder_Quantized.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/VAEDecoder_Quantized.bin)
|
| 43 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 100.33 ms | 2 - 68 MB | UINT16 | NPU | [ControlNet_Quantized.bin](https://huggingface.co/qualcomm/ControlNet/blob/main/ControlNet_Quantized.bin)
|
| 44 |
|
| 45 |
|
| 46 |
## Installation
|
|
|
|
| 101 |
```
|
| 102 |
Profile Job summary of TextEncoder_Quantized
|
| 103 |
--------------------------------------------------
|
| 104 |
+
Device: Samsung Galaxy S24 (14)
|
| 105 |
+
Estimated Inference Time: 8.08 ms
|
| 106 |
+
Estimated Peak Memory Range: 0.01-137.23 MB
|
| 107 |
Compute Units: NPU (570) | Total (570)
|
| 108 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 109 |
Profile Job summary of UNet_Quantized
|
| 110 |
--------------------------------------------------
|
| 111 |
+
Device: Samsung Galaxy S24 (14)
|
| 112 |
+
Estimated Inference Time: 192.79 ms
|
| 113 |
+
Estimated Peak Memory Range: 2.66-1246.59 MB
|
| 114 |
Compute Units: NPU (5434) | Total (5434)
|
| 115 |
|
| 116 |
+
Profile Job summary of VAEDecoder_Quantized
|
| 117 |
+
--------------------------------------------------
|
| 118 |
+
Device: Samsung Galaxy S24 (14)
|
| 119 |
+
Estimated Inference Time: 294.40 ms
|
| 120 |
+
Estimated Peak Memory Range: 0.20-88.33 MB
|
| 121 |
+
Compute Units: NPU (409) | Total (409)
|
| 122 |
+
|
| 123 |
Profile Job summary of ControlNet_Quantized
|
| 124 |
--------------------------------------------------
|
| 125 |
+
Device: Samsung Galaxy S24 (14)
|
| 126 |
+
Estimated Inference Time: 76.94 ms
|
| 127 |
+
Estimated Peak Memory Range: 0.00-532.61 MB
|
| 128 |
Compute Units: NPU (2406) | Total (2406)
|
| 129 |
|
| 130 |
|