How to use ramu0e/diffusion-latent-action-model-final with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("ramu0e/diffusion-latent-action-model-final", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0]
7c2131c 92c2cdb 7c2131c 92c2cdb
1
2
3
4
5
6
7
8
{ "encoder_height": 224, "encoder_width": 304, "height": 480, "processor_class": "LAMProcessor", "vae_scale_factor": 8, "width": 640 }