GeoRemover: Removing Objects and Their Causal Visual Artifacts
Paper • 2509.18538 • Published • 1
import torch
from diffusers import DiffusionPipeline
from diffusers.utils import load_image
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("buxiangzhiren/GeoRemover", dtype=torch.bfloat16, device_map="cuda")
prompt = "Turn this cat into a dog"
input_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/cat.png")
image = pipe(image=input_image, prompt=prompt).images[0]Paper: GeoRemover: Removing Objects and Their Causal Visual Artifacts — https://arxiv.org/abs/2509.18538
Code: https://github.com/buxiangzhiren/GeoRemover
Authors: Zixin Zhu, Haoxiang Li, Xuelu Feng, He Wu, Chunming Qiao, Junsong Yuan
Object removal should delete both the target object and its causal visual artifacts (e.g., shadows, reflections). We propose a geometry-aware two-stage framework: (1) geometry removal on depth with strict mask-aligned supervision; (2) appearance rendering to RGB conditioned on the updated geometry. A preference-driven objective guides stage-1 to remove objects and their artifacts while avoiding new structural insertions.