virtaava commited on
Commit
05fa259
·
verified ·
1 Parent(s): 1682189

Fix model card: actual ONNX input names are down_residual_*/mid_residual (not down_block_res_samples_*/mid_block_res_sample)

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -27,8 +27,8 @@ A single fused ONNX FP16 graph combining the SD 1.5 UNet with IP-Adapter image-c
27
  | `timestep` | `[batch]` | FP16 | scheduler timestep |
28
  | `encoder_hidden_states` | `[batch, 77, 768]` | FP16 | text embeds (e.g. from CLIP text encoder) |
29
  | `image_embeds` | `[batch, num_images, 1024]` | FP16 | **rank-3** per diffusers 0.27.2's `MultiIPAdapterImageProjection`. On-device path uses `num_images=1`. |
30
- | `down_block_res_samples_0..11` | 12 tensors | FP16 | ControlNet down-block residuals (canonical SD 1.5 shapes). Pass zeros for Phase-6-only inference. |
31
- | `mid_block_res_sample` | `[batch, 1280, 8, 8]` | FP16 | ControlNet mid-block residual. Pass zeros for Phase-6-only inference. |
32
 
33
  | Output | Shape | dtype |
34
  |---|---|---|
 
27
  | `timestep` | `[batch]` | FP16 | scheduler timestep |
28
  | `encoder_hidden_states` | `[batch, 77, 768]` | FP16 | text embeds (e.g. from CLIP text encoder) |
29
  | `image_embeds` | `[batch, num_images, 1024]` | FP16 | **rank-3** per diffusers 0.27.2's `MultiIPAdapterImageProjection`. On-device path uses `num_images=1`. |
30
+ | `down_residual_0..11` | 12 tensors | FP16 | ControlNet down-block residuals (canonical SD 1.5 shapes). Pass zeros for Phase-6-only inference. |
31
+ | `mid_residual` | `[batch, 1280, 8, 8]` | FP16 | ControlNet mid-block residual. Pass zeros for Phase-6-only inference. |
32
 
33
  | Output | Shape | dtype |
34
  |---|---|---|