The stated VRAM requirements on Github seem almost impossibly high for both versions of the model relative to the claimed parameter count

by DiffusionFanatic1 - opened Jan 24

Jan 24

Is this perhaps related to the current hardcoded requirement of full precision T5-XXL, or something?

Linum AI org Jan 25

Yep, it's due to the T5. You can just free the T5 after this line --

Later this week, we'll make a PR to have the T5 free itself (optionally) in order to reduce overall VRAM.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment