michaelyuanqwq/roboseg
Updated • 107 • 5
How to use michaelyuanqwq/roboengine-bg-diffusion with Diffusers:
pip install -U diffusers transformers accelerate
import torch
from diffusers import DiffusionPipeline
from diffusers.utils import load_image
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("michaelyuanqwq/roboengine-bg-diffusion", dtype=torch.bfloat16, device_map="cuda")
prompt = "Turn this cat into a dog"
input_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/cat.png")
image = pipe(image=input_image, prompt=prompt).images[0]Chengbo Yuan*, Suraj Joshi*, Shaoting Zhu*, Hang Su, Hang Zhao, Yang Gao.
[Project Website] [Arxiv] [BibTex]
The BG-Diffusion checkpoints of "RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation".
Please checkout https://github.com/michaelyuanqwq/roboengine for more details.
See the Github repo: https://github.com/michaelyuanqwq/roboengine.
@article{yuan2025roboengine,
title={RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation},
author={Yuan, Chengbo and Joshi, Suraj and Zhu, Shaoting and Su, Hang and Zhao, Hang and Gao, Yang},
journal={arXiv preprint arXiv:2503.18738},
year={2025}
}