File size: 2,835 Bytes
efff4eb | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 | DynamicDispatch Offload - not offloaded
+-------------------+-------+------------------------+------------------------+
| Op Type | Count | Inputs | Outputs |
+===================+=======+========================+========================+
| CastAvx | 1 | [1,128,128,4] - FLOAT | [1,128,128,4] - |
| | | | BFLOAT16 |
| CastAvx | 1 | [1,1024,1024,3] - | [1,1024,1024,3] - |
| | | BFLOAT16 | FLOAT |
| Transpose | 1 | [1,4,128,128] - FLOAT | [1,128,128,4] - FLOAT |
| Transpose | 1 | [1,1024,1024,3] - | [1,3,1024,1024] - |
| | | FLOAT | FLOAT |
+-------------------+-------+------------------------+------------------------+
| Not offloaded sum | 4 | | |
+-------------------+-------+------------------------+------------------------+
DynamicDispatch Offload - offloaded
+--------------------+-------------+--------------------+---------------------+
| Op Type | Count | Inputs | Outputs |
+====================+=============+====================+=====================+
| DynamicDispatch | 1 | [1,128,128,4] - | [1,1024,1024,3] - |
| | | BFLOAT16 | BFLOAT16 |
+--------------------+-------------+--------------------+---------------------+
| Offloaded sum | 1 | | |
+--------------------+-------------+--------------------+---------------------+
| Offloaded Op Types | SDAdd | | |
| | SDConv | | |
| | SDMHA_VAE | | |
| | SDGroupNorm | | |
| | SDResize | | |
| | SDGemm | | |
+--------------------+-------------+--------------------+---------------------+
| Offloaded sum (dd | 89 | | |
| fusion) | | | |
| Offload Ratio (dd | 95.7% | | |
| fusion) | | | |
+--------------------+-------------+--------------------+---------------------+
|