Motif-Technologies
/

Motif-Video-2B

video-generation

diffusion-transformer

Model card Files Files and versions

Enable Flash Attention by trimming prompt embedding padding

#8

by gkalstn0 - opened Apr 21

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

feat: trim prompt_embeds padding for batch=1 to enable Flash Attentiond1a46b18

feat: move prompt_embeds trim to __call__ for correct CFG alignmenteabf2fae

feat: skip encoder_attention_mask in guider when mask is None2134e9d0

chore: verify batch>1 preserves original attention_mask path (item 4)8ca961d2

chore: verify I2V compatibility with encoder_attention_mask=None (item 5)9717caf5

docs: add Flash Attention section to README4b01b740

fix: trim pos/neg independently instead of max alignmentb67e1aac

chore: remove debug prints and Flash Attention README section20f9f057

Motif Technologies org Apr 21

Summary

Trim prompt_embeds to actual token length (removing padding) for batch_size=1 inference
Pass attention_mask=None to transformer, allowing PyTorch SDPA to use Flash Attention backend
Positive and negative prompts trimmed independently (guider runs them in separate iterations)
batch_size>1 preserves original attention_mask path for variable-length prompt compatibility

Changes

pipeline_motif_video.py: encode_prompt() computes actual_seq_len, call() trims embeddings and drops mask
No transformer code changes needed (existing None-guard handles it)

Test plan

batch=1 with CFG: trim confirmed (pos 512->117, neg 512->113)
batch>1: mask path preserved (no trim)
Video output quality verified (720p 121f 50 steps)
I2V compatibility: transformer handles encoder_attention_mask=None safely

gkalstn0 changed pull request status to open Apr 21

gkalstn0 changed pull request status to merged Apr 21

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment