Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -38,9 +38,9 @@ More details on model performance across various devices, can be found
|
|
| 38 |
|
| 39 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
| 40 |
| ---|---|---|---|---|---|---|---|
|
| 41 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 11.
|
| 42 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary |
|
| 43 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary |
|
| 44 |
|
| 45 |
|
| 46 |
|
|
@@ -103,22 +103,22 @@ python -m qai_hub_models.models.stable_diffusion_v2_1_quantized.export
|
|
| 103 |
Profile Job summary of TextEncoder_Quantized
|
| 104 |
--------------------------------------------------
|
| 105 |
Device: Snapdragon X Elite CRD (11)
|
| 106 |
-
Estimated Inference Time: 11.
|
| 107 |
-
Estimated Peak Memory Range: 0.
|
| 108 |
Compute Units: NPU (1040) | Total (1040)
|
| 109 |
|
| 110 |
Profile Job summary of VAEDecoder_Quantized
|
| 111 |
--------------------------------------------------
|
| 112 |
Device: Snapdragon X Elite CRD (11)
|
| 113 |
-
Estimated Inference Time: 220.
|
| 114 |
-
Estimated Peak Memory Range: 0.
|
| 115 |
Compute Units: NPU (170) | Total (170)
|
| 116 |
|
| 117 |
Profile Job summary of UNet_Quantized
|
| 118 |
--------------------------------------------------
|
| 119 |
Device: Snapdragon X Elite CRD (11)
|
| 120 |
-
Estimated Inference Time: 102.
|
| 121 |
-
Estimated Peak Memory Range: 0.
|
| 122 |
Compute Units: NPU (6361) | Total (6361)
|
| 123 |
|
| 124 |
|
|
@@ -242,7 +242,7 @@ Explore all available models on [Qualcomm® AI Hub](https://aihub.qualcomm.com/)
|
|
| 242 |
* [Source Model Implementation](https://github.com/CompVis/stable-diffusion/tree/main)
|
| 243 |
|
| 244 |
## Community
|
| 245 |
-
* Join [our AI Hub Slack community](https://qualcomm
|
| 246 |
* For questions or feedback please [reach out to us](mailto:ai-hub-support@qti.qualcomm.com).
|
| 247 |
|
| 248 |
|
|
|
|
| 38 |
|
| 39 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
| 40 |
| ---|---|---|---|---|---|---|---|
|
| 41 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 11.633 ms | 0 - 1 MB | INT8 | NPU | [TextEncoder_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/TextEncoder_Quantized.bin)
|
| 42 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 217.134 ms | 0 - 2 MB | INT8 | NPU | [VAEDecoder_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/VAEDecoder_Quantized.bin)
|
| 43 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 101.094 ms | 0 - 2 MB | INT8 | NPU | [UNet_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/UNet_Quantized.bin)
|
| 44 |
|
| 45 |
|
| 46 |
|
|
|
|
| 103 |
Profile Job summary of TextEncoder_Quantized
|
| 104 |
--------------------------------------------------
|
| 105 |
Device: Snapdragon X Elite CRD (11)
|
| 106 |
+
Estimated Inference Time: 11.77 ms
|
| 107 |
+
Estimated Peak Memory Range: 0.01-0.01 MB
|
| 108 |
Compute Units: NPU (1040) | Total (1040)
|
| 109 |
|
| 110 |
Profile Job summary of VAEDecoder_Quantized
|
| 111 |
--------------------------------------------------
|
| 112 |
Device: Snapdragon X Elite CRD (11)
|
| 113 |
+
Estimated Inference Time: 220.18 ms
|
| 114 |
+
Estimated Peak Memory Range: 0.05-0.05 MB
|
| 115 |
Compute Units: NPU (170) | Total (170)
|
| 116 |
|
| 117 |
Profile Job summary of UNet_Quantized
|
| 118 |
--------------------------------------------------
|
| 119 |
Device: Snapdragon X Elite CRD (11)
|
| 120 |
+
Estimated Inference Time: 102.49 ms
|
| 121 |
+
Estimated Peak Memory Range: 0.19-0.19 MB
|
| 122 |
Compute Units: NPU (6361) | Total (6361)
|
| 123 |
|
| 124 |
|
|
|
|
| 242 |
* [Source Model Implementation](https://github.com/CompVis/stable-diffusion/tree/main)
|
| 243 |
|
| 244 |
## Community
|
| 245 |
+
* Join [our AI Hub Slack community](https://aihub.qualcomm.com/community/slack) to collaborate, post questions and learn more about on-device AI.
|
| 246 |
* For questions or feedback please [reach out to us](mailto:ai-hub-support@qti.qualcomm.com).
|
| 247 |
|
| 248 |
|