🐳 MObI: Multimodal Object Inpainting Using Diffusion Models

Pretrained weights for MObI, a diffusion-based model for joint multimodal object inpainting across camera and lidar, conditioned on a single reference image and a 3D bounding box.

📄 Paper: arXiv:2501.03173 💻 Code: github.com/alexbuburuzan/MObI Venue: CVPR Workshop on Data-Driven Autonomous Driving Simulation (DDADS), 2025

Overview

MObI extends Paint-by-Example to:

Jointly inpaint RGB camera, lidar depth, and lidar intensity
Insert objects from a single reference image
Use 3D bounding box conditioning for accurate spatial placement

This combines the realism of reference-based inpainting with the controllability of 3D-aware methods.

File	Description
`mobi_nuscenes_epoch28.ckpt`	MObI trained on nuScenes
`autoencoders/range_autoencoder.ckpt`	Range-view VAE for lidar

Results (nuScenes)

Reference Type	FID ↓	LPIPS ↓	CLIP ↑	D-LPIPS ↓	I-LPIPS ↓
id-ref	6.503	0.114	84.9	0.130	0.147
track-ref	6.703	0.115	83.5	0.129	0.149
in-domain-ref	8.947	0.127	77.5	0.132	0.154
cross-domain-ref	9.046	0.130	76.0	0.132	0.153

Usage

See the GitHub repository for installation, data preprocessing, inference, and training instructions.

git clone https://github.com/alexbuburuzan/MObI.git
cd MObI
bash scripts/download_models.sh
bash scripts/realism_test_bench.sh

Citation

@InProceedings{Buburuzan_2025_CVPR,
    author    = {Buburuzan, Alexandru and Sharma, Anuj and Redford, John and Dokania, Puneet K. and Mueller, Romain},
    title     = {MObI: Multimodal Object Inpainting Using Diffusion Models},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2025},
    pages     = {1999-2009}
}

License

Released under CC BY-NC 4.0. Note that this work builds on Paint-by-Example and BEVFusion, which have their own licenses.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for alexbuburuzan/MObI

MObI: Multimodal Object Inpainting Using Diffusion Models

Paper • 2501.03173 • Published Apr 22, 2025 • 1

alexbuburuzan
/

MObI