anonymous-upload-neurips-2025

Upload 1784 files

e74a7f1 verified 10 months ago

12.9 kB

	Metadata-Version: 2.4
	Name: diffusers
	Version: 0.27.0.dev0
	Summary: State-of-the-art diffusion in PyTorch and JAX.
	Home-page: https://github.com/huggingface/diffusers
	Author: The Hugging Face team (past and future) with the help of all our contributors (https://github.com/huggingface/diffusers/graphs/contributors)
	Author-email: patrick@huggingface.co
	License: Apache 2.0 License
	Keywords: deep learning diffusion jax pytorch stable diffusion audioldm
	Classifier: Development Status :: 5 - Production/Stable
	Classifier: Intended Audience :: Developers
	Classifier: Intended Audience :: Education
	Classifier: Intended Audience :: Science/Research
	Classifier: License :: OSI Approved :: Apache Software License
	Classifier: Operating System :: OS Independent
	Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
	Classifier: Programming Language :: Python :: 3
	Classifier: Programming Language :: Python :: 3.8
	Classifier: Programming Language :: Python :: 3.9
	Classifier: Programming Language :: Python :: 3.10
	Requires-Python: >=3.8.0
	Description-Content-Type: text/markdown
	License-File: LICENSE
	Requires-Dist: importlib_metadata
	Requires-Dist: filelock
	Requires-Dist: huggingface-hub
	Requires-Dist: numpy
	Requires-Dist: regex!=2019.12.17
	Requires-Dist: requests
	Requires-Dist: safetensors>=0.3.1
	Requires-Dist: Pillow
	Provides-Extra: quality
	Requires-Dist: urllib3<=2.0.0; extra == "quality"
	Requires-Dist: isort>=5.5.4; extra == "quality"
	Requires-Dist: ruff==0.1.5; extra == "quality"
	Requires-Dist: hf-doc-builder>=0.3.0; extra == "quality"
	Provides-Extra: docs
	Requires-Dist: hf-doc-builder>=0.3.0; extra == "docs"
	Provides-Extra: training
	Requires-Dist: accelerate>=0.11.0; extra == "training"
	Requires-Dist: datasets; extra == "training"
	Requires-Dist: protobuf<4,>=3.20.3; extra == "training"
	Requires-Dist: tensorboard; extra == "training"
	Requires-Dist: Jinja2; extra == "training"
	Requires-Dist: peft>=0.6.0; extra == "training"
	Provides-Extra: test
	Requires-Dist: compel==0.1.8; extra == "test"
	Requires-Dist: GitPython<3.1.19; extra == "test"
	Requires-Dist: datasets; extra == "test"
	Requires-Dist: Jinja2; extra == "test"
	Requires-Dist: invisible-watermark>=0.2.0; extra == "test"
	Requires-Dist: k-diffusion>=0.0.12; extra == "test"
	Requires-Dist: librosa; extra == "test"
	Requires-Dist: parameterized; extra == "test"
	Requires-Dist: pytest; extra == "test"
	Requires-Dist: pytest-timeout; extra == "test"
	Requires-Dist: pytest-xdist; extra == "test"
	Requires-Dist: requests-mock==1.10.0; extra == "test"
	Requires-Dist: safetensors>=0.3.1; extra == "test"
	Requires-Dist: sentencepiece!=0.1.92,>=0.1.91; extra == "test"
	Requires-Dist: scipy; extra == "test"
	Requires-Dist: torchvision; extra == "test"
	Requires-Dist: transformers>=4.25.1; extra == "test"
	Provides-Extra: torch
	Requires-Dist: torch>=1.4; extra == "torch"
	Requires-Dist: accelerate>=0.11.0; extra == "torch"
	Provides-Extra: flax
	Requires-Dist: jax>=0.4.1; extra == "flax"
	Requires-Dist: jaxlib>=0.4.1; extra == "flax"
	Requires-Dist: flax>=0.4.1; extra == "flax"
	Provides-Extra: dev
	Requires-Dist: urllib3<=2.0.0; extra == "dev"
	Requires-Dist: isort>=5.5.4; extra == "dev"
	Requires-Dist: ruff==0.1.5; extra == "dev"
	Requires-Dist: hf-doc-builder>=0.3.0; extra == "dev"
	Requires-Dist: compel==0.1.8; extra == "dev"
	Requires-Dist: GitPython<3.1.19; extra == "dev"
	Requires-Dist: datasets; extra == "dev"
	Requires-Dist: Jinja2; extra == "dev"
	Requires-Dist: invisible-watermark>=0.2.0; extra == "dev"
	Requires-Dist: k-diffusion>=0.0.12; extra == "dev"
	Requires-Dist: librosa; extra == "dev"
	Requires-Dist: parameterized; extra == "dev"
	Requires-Dist: pytest; extra == "dev"
	Requires-Dist: pytest-timeout; extra == "dev"
	Requires-Dist: pytest-xdist; extra == "dev"
	Requires-Dist: requests-mock==1.10.0; extra == "dev"
	Requires-Dist: safetensors>=0.3.1; extra == "dev"
	Requires-Dist: sentencepiece!=0.1.92,>=0.1.91; extra == "dev"
	Requires-Dist: scipy; extra == "dev"
	Requires-Dist: torchvision; extra == "dev"
	Requires-Dist: transformers>=4.25.1; extra == "dev"
	Requires-Dist: accelerate>=0.11.0; extra == "dev"
	Requires-Dist: datasets; extra == "dev"
	Requires-Dist: protobuf<4,>=3.20.3; extra == "dev"
	Requires-Dist: tensorboard; extra == "dev"
	Requires-Dist: Jinja2; extra == "dev"
	Requires-Dist: peft>=0.6.0; extra == "dev"
	Requires-Dist: hf-doc-builder>=0.3.0; extra == "dev"
	Requires-Dist: torch>=1.4; extra == "dev"
	Requires-Dist: accelerate>=0.11.0; extra == "dev"
	Requires-Dist: jax>=0.4.1; extra == "dev"
	Requires-Dist: jaxlib>=0.4.1; extra == "dev"
	Requires-Dist: flax>=0.4.1; extra == "dev"
	Dynamic: author
	Dynamic: author-email
	Dynamic: classifier
	Dynamic: description
	Dynamic: description-content-type
	Dynamic: home-page
	Dynamic: keywords
	Dynamic: license
	Dynamic: license-file
	Dynamic: provides-extra
	Dynamic: requires-dist
	Dynamic: requires-python
	Dynamic: summary

	# BrushNet

	This repository contains the implementation of the paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

	Keywords: Image Inpainting, Diffusion Models, Image Generation

	> [Xuan Ju](https://github.com/juxuan27)<sup>12</sup>, [Xian Liu](https://alvinliu0.github.io/)<sup>12</sup>, [Xintao Wang](https://xinntao.github.io/)<sup>1</sup>, [Yuxuan Bian](https://scholar.google.com.hk/citations?user=HzemVzoAAAAJ&hl=zh-CN&oi=ao)<sup>2</sup>, [Ying Shan](https://www.linkedin.com/in/YingShanProfile/)<sup>1</sup>, [Qiang Xu](https://cure-lab.github.io/)<sup>2</sup><br>
	> <sup>1</sup>ARC Lab, Tencent PCG <sup>2</sup>The Chinese University of Hong Kong <sup>*</sup>Corresponding Author


	<p align="center">
	<a href="https://tencentarc.github.io/BrushNet/">🌐Project Page</a> \|
	<a href="https://arxiv.org/abs/2403.06976">📜Arxiv</a> \|
	<a href="https://forms.gle/9TgMZ8tm49UYsZ9s5">🗄️Data</a> \|
	<a href="https://drive.google.com/file/d/1IkEBWcd2Fui2WHcckap4QFPcCI0gkHBh/view">📹Video</a> \|
	<a href="https://huggingface.co/spaces/TencentARC/BrushNet">🤗Hugging Face Demo</a> \|
	</p>



	📖 Table of Contents


	- [🛠️ Method Overview](#️-method-overview)
	- [🚀 Getting Started](#-getting-started)
	- [Environment Requirement 🌍](#environment-requirement-)
	- [Data Download ⬇️](#data-download-️)
	- [🏃🏼 Running Scripts](#-running-scripts)
	- [Training 🤯](#training-)
	- [Inference 📜](#inference-)
	- [Evaluation 📏](#evaluation-)
	- [🤝🏼 Cite Us](#-cite-us)
	- [💖 Acknowledgement](#-acknowledgement)


	## TODO


	- [x] Release trainig and inference code
	- [x] Release checkpoint (sdv1.5)
	- [ ] Release checkpoint (sdxl)
	- [x] Release evaluation code
	- [x] Release gradio demo

	## 🛠️ Method Overview

	BrushNet is a diffusion-based text-guided image inpainting model that can be plug-and-play into any pre-trained diffusion model. Our architectural design incorporates two key insights: (1) dividing the masked image features and noisy latent reduces the model's learning load, and (2) leveraging dense per-pixel control over the entire pre-trained model enhances its suitability for image inpainting tasks. More analysis can be found in the main paper.

	![](examples/brushnet/src/model.png)



	## 🚀 Getting Started

	### Environment Requirement 🌍

	BrushNet has been implemented and tested on Pytorch 1.12.1 with python 3.9.

	Clone the repo:

	```
	git clone https://github.com/TencentARC/BrushNet.git
	```

	We recommend you first use `conda` to create virtual environment, and install `pytorch` following [official instructions](https://pytorch.org/). For example:


	```
	conda create -n diffusers python=3.9 -y
	conda activate diffusers
	python -m pip install --upgrade pip
	pip install torch==1.12.1+cu116 torchvision==0.13.1+cu116 torchaudio==0.12.1 --extra-index-url https://download.pytorch.org/whl/cu116
	```

	Then, you can install diffusers (implemented in this repo) with:

	```
	pip install -e .
	```

	After that, you can install required packages thourgh:

	```
	cd examples/brushnet/
	pip install -r requirements.txt
	```

	### Data Download ⬇️


	Dataset

	You can download the BrushData and BrushBench [here](https://forms.gle/9TgMZ8tm49UYsZ9s5) (as well as the EditBench we re-processed), which are used for training and testing the BrushNet. By downloading the data, you are agreeing to the terms and conditions of the license. The data structure should be like:

	```
	\|-- data
	\|-- BrushData
	\|-- 00200.tar
	\|-- 00201.tar
	\|-- ...
	\|-- BrushDench
	\|-- images
	\|-- mapping_file.json
	\|-- EditBench
	\|-- images
	\|-- mapping_file.json
	```


	Noted: We only provide a part of the BrushData due to the space limit. Please write an email to juxuan.27@gmail.com if you need the full dataset.


	Checkpoints

	Checkpoints of BrushNet can be downloaded from [here](https://drive.google.com/drive/folders/1fqmS1CEOvXCxNWFrsSYd_jHYXxrydh1n?usp=drive_link). The ckpt folder contains our pretrained checkpoints and pretrinaed Stable Diffusion checkpoint (e.g., realisticVisionV60B1_v51VAE from [Civitai](https://civitai.com/)). You can use `scripts/convert_original_stable_diffusion_to_diffusers.py` to process other models downloaded from Civitai. The data structure should be like:



	```
	\|-- data
	\|-- BrushData
	\|-- BrushDench
	\|-- EditBench
	\|-- ckpt
	\|-- realisticVisionV60B1_v51VAE
	\|-- model_index.json
	\|-- vae
	\|-- ...
	\|-- segmentation_mask_brushnet_ckpt
	\|-- random_mask_brushnet_ckpt
	\|-- ...
	```

	The checkpoint in `segmentation_mask_brushnet_ckpt` provides checkpoints trained on BrushData, which has segmentation prior (mask are with the same shape of objects). The `random_mask_brushnet_ckpt` provides a more general ckpt for random mask shape.

	## 🏃🏼 Running Scripts


	### Training 🤯

	You can train with segmentation mask using the script:

	```
	accelerate launch examples/brushnet/train_brushnet.py \
	--pretrained_model_name_or_path runwayml/stable-diffusion-v1-5 \
	--output_dir runs/logs/brushnet_segmentationmask \
	--train_data_dir data/BrushData \
	--resolution 512 \
	--learning_rate 1e-5 \
	--train_batch_size 2 \
	--tracker_project_name brushnet \
	--report_to tensorboard \
	--resume_from_checkpoint latest \
	--validation_steps 300
	```

	To use custom dataset, you can process your own data to the format of BrushData and revise `--train_data_dir`.

	You can train with random mask using the script (by adding `--random_mask`):

	```
	accelerate launch examples/brushnet/train_brushnet.py \
	--pretrained_model_name_or_path runwayml/stable-diffusion-v1-5 \
	--output_dir runs/logs/brushnet_randommask \
	--train_data_dir data/BrushData \
	--resolution 512 \
	--learning_rate 1e-5 \
	--train_batch_size 2 \
	--tracker_project_name brushnet \
	--report_to tensorboard \
	--resume_from_checkpoint latest \
	--validation_steps 300 \
	--random_mask
	```



	### Inference 📜

	You can inference with the script:

	```
	python examples/brushnet/test_brushnet.py
	```

	Since BrushNet is trained on Laion, it can only guarantee the performance on general scenarios. We recommend you train on your own data (e.g., product exhibition, virtual try-on) if you have high-quality industrial application requirements. We would also be appreciate if you would like to contribute your trained model!

	You can also inference through gradio demo:

	```
	python examples/brushnet/app_brushnet.py
	```


	### Evaluation 📏

	You can evaluate using the script:

	```
	python examples/brushnet/evaluate_brushnet.py \
	--brushnet_ckpt_path data/ckpt/segmentation_mask_brushnet_ckpt \
	--image_save_path runs/evaluation_result/BrushBench/brushnet_segmask/inside \
	--mapping_file data/BrushBench/mapping_file.json \
	--base_dir data/BrushBench \
	--mask_key inpainting_mask
	```

	The `--mask_key` indicates which kind of mask to use, `inpainting_mask` for inside inpainting and `outpainting_mask` for outside inpainting. The evaluation results (images and metrics) will be saved in `--image_save_path`.



	Noted that you need to ignore the nsfw detector in `src/diffusers/pipelines/brushnet/pipeline_brushnet.py#1261` to get the correct evaluation results. Moreover, we find different machine may generate different images, thus providing the results on our machine [here](https://drive.google.com/drive/folders/1dK3oIB2UvswlTtnIS1iHfx4s57MevWdZ?usp=sharing).


	## 🤝🏼 Cite Us

	```
	@misc{ju2024brushnet,
	title={BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion},
	author={Xuan Ju and Xian Liu and Xintao Wang and Yuxuan Bian and Ying Shan and Qiang Xu},
	year={2024},
	eprint={2403.06976},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
	}
	```


	## 💖 Acknowledgement
	<span id="acknowledgement"></span>

	Our code is modified based on [diffusers](https://github.com/huggingface/diffusers), thanks to all the contributors!