Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis
Paper • 2505.23325 • Published • 2
import torch
from diffusers import DiffusionPipeline
from diffusers.utils import load_image
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("Kunbyte/DRA-Ctrl", dtype=torch.bfloat16, device_map="cuda")
prompt = "Turn this cat into a dog"
input_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/cat.png")
image = pipe(image=input_image, prompt=prompt).images[0]This repository contains the LoRA weights for DRA-Ctrl across 9 tasks. For instructions on how to use these weights, please refer to our GitHub repository and HuggingFace Space.
Base model
tencent/HunyuanVideo