How to use from the
Use from the
Diffusers library
pip install -U diffusers transformers accelerate
import torch
from diffusers import DiffusionPipeline
from diffusers.utils import load_image

# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("FlashStight/Mobile-VTON", dtype=torch.bfloat16, device_map="cuda")

prompt = "Turn this cat into a dog"
input_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/cat.png")

image = pipe(image=input_image, prompt=prompt).images[0]

Mobile-VTON: High-Fidelity On-Device Virtual Try-On

This is the official implementation of the paper Mobile-VTON: High-Fidelity On-Device Virtual Try-On

๐Ÿ“„ Paper: https://arxiv.org/abs/2603.00947
๐ŸŒ Project Page: https://zhenchenwan.github.io/Mobile-VTON/
๐Ÿ’ป Code: https://github.com/tmllab/2026_CVPR_Mobile-VTON

teaser 

Inference

Please refer to the official repository for inference scripts.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Paper for FlashStight/Mobile-VTON