Upload README.md
Browse files
README.md
CHANGED
|
@@ -23,7 +23,7 @@ Official repo for paper ["Bridge Diffusion Model: bridge non-English language-na
|
|
| 23 |
|
| 24 |
## Method
|
| 25 |
BDM entails the utilization of a backbone-branch network architecture akin to ControlNet[[7]](#7), model structure illustrated in the following
|
| 26 |
-
<p align="center"><img src="
|
| 27 |
<p align="center">Fig.1 BDM model structure</p>
|
| 28 |
|
| 29 |
The backbone part serves as a good diffusion initialization and will be frozen during training, which could be from any pretrained diffusion TTI model. We leverage Stable Diffusion 1.5 in current implementation. The branch part servers as language-native semantics injection module, whose parameters will be trained with language-native text-image pairs.
|
|
@@ -33,9 +33,9 @@ For model inference, language-native positive prompts as well as negative ones w
|
|
| 33 |
|
| 34 |
## Evaluation
|
| 35 |
Here are several image generation illustrations for our BDM, with Chinese-native TTI capability and integrated with different English TTI communty techniques.
|
| 36 |
-
<p align="center"><img src="
|
| 37 |
<p align="center">Fig.2 Chinese unique concepts</p>
|
| 38 |
-
<p align="center"><img src="
|
| 39 |
<p align="center">Fig.3 Different English branch</p>
|
| 40 |
|
| 41 |
For more illustrations and details, please refer to our paper ["Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities"](https://arxiv.org/abs/2309.00952)
|
|
|
|
| 23 |
|
| 24 |
## Method
|
| 25 |
BDM entails the utilization of a backbone-branch network architecture akin to ControlNet[[7]](#7), model structure illustrated in the following
|
| 26 |
+
<p align="center"><img src="BDM_structure.png" alt= “BDM” width="400" height="300"></p>
|
| 27 |
<p align="center">Fig.1 BDM model structure</p>
|
| 28 |
|
| 29 |
The backbone part serves as a good diffusion initialization and will be frozen during training, which could be from any pretrained diffusion TTI model. We leverage Stable Diffusion 1.5 in current implementation. The branch part servers as language-native semantics injection module, whose parameters will be trained with language-native text-image pairs.
|
|
|
|
| 33 |
|
| 34 |
## Evaluation
|
| 35 |
Here are several image generation illustrations for our BDM, with Chinese-native TTI capability and integrated with different English TTI communty techniques.
|
| 36 |
+
<p align="center"><img src="Chinese_concepts.png" alt= “Chinese_concepts” width="600" height="550"></p>
|
| 37 |
<p align="center">Fig.2 Chinese unique concepts</p>
|
| 38 |
+
<p align="center"><img src="different_base_model.png" alt= “different_base_model” width="600" height="650"></p>
|
| 39 |
<p align="center">Fig.3 Different English branch</p>
|
| 40 |
|
| 41 |
For more illustrations and details, please refer to our paper ["Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities"](https://arxiv.org/abs/2309.00952)
|