English
File size: 2,191 Bytes
6b6c271
 
5ed275d
 
 
 
 
 
6b6c271
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5a77eed
6b6c271
 
 
 
 
 
 
5ed275d
 
 
 
 
6b6c271
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
---
license: mit
datasets:
- WensongSong/AnyInsertion
language:
- en
base_model:
- black-forest-labs/FLUX.1-Fill-dev
---
<h1 align="center">Insert Anything</h2>
<p align="center">
<a href="https://song-wensong.github.io/"><strong>Wensong Song</strong></a><a href="https://openreview.net/profile?id=~Hong_Jiang4"><strong>Hong Jinag</strong></a><a href="https://z-x-yang.github.io/"><strong>Zongxing Yang</strong></a><a href="https://scholar.google.com/citations?user=WKLRPsAAAAAJ&hl=en"><strong>Ruijie Quan</strong></a><a href="https://scholar.google.com/citations?user=RMSuNFwAAAAJ&hl=en"><strong>Yi Yang</strong></a>
<br>
<br>
<a href="https://arxiv.org/pdf/2504.15009" style="display: inline-block; margin-right: 10px;">
  <img src='https://img.shields.io/badge/arXiv-InsertAnything-red?color=%23aa1a1a' alt='Paper PDF'>
</a>
<a href='https://song-wensong.github.io/insert-anything/' style="display: inline-block; margin-right: 10px;">
  <img src='https://img.shields.io/badge/Project%20Page-InsertAnything-cyan?logoColor=%23FFD21E&color=%23cbe6f2' alt='Project Page'>
</a>
<a href='https://github.com/song-wensong/insert-anything' style="display: inline-block;">
  <img src='https://img.shields.io/badge/GitHub-InsertAnything-black?logoColor=23FFD21E&color=%231d2125'>
</a>
<br>
<b>Zhejiang University &nbsp; | &nbsp; Harvard University &nbsp; | &nbsp;  Nanyang Technological University </b>
</p>

## News
* **[2025.4.25]** Release **AnyInsertion** dataset on [HuggingFace](https://huggingface.co/datasets/WensongSong/AnyInsertion).
* **[2025.4.22]** Release inference & demo code on [GitHub](https://github.com/song-wensong/insert-anything), and mask-prompt pretrained checkpoint.

## Model Introduction
The currently released checkpoint is 20250321_steps5000_pytorch_lora_weights.safetensors, which is for mask-prompt image insertion. Future versions of the checkpoints will be released as updates.

## Citation
```
@article{song2025insert,
  title={Insert Anything: Image Insertion via In-Context Editing in DiT},
  author={Song, Wensong and Jiang, Hong and Yang, Zongxing and Quan, Ruijie and Yang, Yi},
  journal={arXiv preprint arXiv:2504.15009},
  year={2025}
}
```