12G vram failed to run

by ZKong - opened 28 days ago

Discussion

ZKong

28 days ago

takes too long, can we set max vram use to not overflow to the shared vram

ZKong

28 days ago

always compile even torch.compile(pipe.transformer) is disabled.
enable_model_cpu_offload still can over 12G, i have to use enable_sequential_cpu_offload but slower, maxvram is 3G, very small vram use!

ZKong changed discussion status to closed 28 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment