sdxl-base-amdnpu / vae_decoder /dd /onnx_report.txt
bconsolvo's picture
initial model upload
efff4eb verified
DynamicDispatch Offload - not offloaded
+-------------------+-------+------------------------+------------------------+
| Op Type | Count | Inputs | Outputs |
+===================+=======+========================+========================+
| CastAvx | 1 | [1,128,128,4] - FLOAT | [1,128,128,4] - |
| | | | BFLOAT16 |
| CastAvx | 1 | [1,1024,1024,3] - | [1,1024,1024,3] - |
| | | BFLOAT16 | FLOAT |
| Transpose | 1 | [1,4,128,128] - FLOAT | [1,128,128,4] - FLOAT |
| Transpose | 1 | [1,1024,1024,3] - | [1,3,1024,1024] - |
| | | FLOAT | FLOAT |
+-------------------+-------+------------------------+------------------------+
| Not offloaded sum | 4 | | |
+-------------------+-------+------------------------+------------------------+
DynamicDispatch Offload - offloaded
+--------------------+-------------+--------------------+---------------------+
| Op Type | Count | Inputs | Outputs |
+====================+=============+====================+=====================+
| DynamicDispatch | 1 | [1,128,128,4] - | [1,1024,1024,3] - |
| | | BFLOAT16 | BFLOAT16 |
+--------------------+-------------+--------------------+---------------------+
| Offloaded sum | 1 | | |
+--------------------+-------------+--------------------+---------------------+
| Offloaded Op Types | SDAdd | | |
| | SDConv | | |
| | SDMHA_VAE | | |
| | SDGroupNorm | | |
| | SDResize | | |
| | SDGemm | | |
+--------------------+-------------+--------------------+---------------------+
| Offloaded sum (dd | 89 | | |
| fusion) | | | |
| Offload Ratio (dd | 95.7% | | |
| fusion) | | | |
+--------------------+-------------+--------------------+---------------------+