Mobile-VTON: High-Fidelity On-Device Virtual Try-On
Paper โข 2603.00947 โข Published
import torch
from diffusers import DiffusionPipeline
from diffusers.utils import load_image
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("FlashStight/Mobile-VTON", dtype=torch.bfloat16, device_map="cuda")
prompt = "Turn this cat into a dog"
input_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/cat.png")
image = pipe(image=input_image, prompt=prompt).images[0]This is the official implementation of the paper Mobile-VTON: High-Fidelity On-Device Virtual Try-On
๐ Paper: https://arxiv.org/abs/2603.00947
๐ Project Page: https://zhenchenwan.github.io/Mobile-VTON/
๐ป Code: https://github.com/tmllab/2026_CVPR_Mobile-VTON
Please refer to the official repository for inference scripts.