Diffusers
liushanyuan18 commited on
Commit
e46c4b3
·
verified ·
1 Parent(s): c86ae42

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -23,7 +23,7 @@ Official repo for paper ["Bridge Diffusion Model: bridge non-English language-na
23
 
24
  ## Method
25
  BDM entails the utilization of a backbone-branch network architecture akin to ControlNet[[7]](#7), model structure illustrated in the following
26
- <p align="center"><img src="docs\BDM_structure.png" alt= “BDM” width="400" height="300"></p>
27
  <p align="center">Fig.1 BDM model structure</p>
28
 
29
  The backbone part serves as a good diffusion initialization and will be frozen during training, which could be from any pretrained diffusion TTI model. We leverage Stable Diffusion 1.5 in current implementation. The branch part servers as language-native semantics injection module, whose parameters will be trained with language-native text-image pairs.
@@ -33,9 +33,9 @@ For model inference, language-native positive prompts as well as negative ones w
33
 
34
  ## Evaluation
35
  Here are several image generation illustrations for our BDM, with Chinese-native TTI capability and integrated with different English TTI communty techniques.
36
- <p align="center"><img src="docs\Chinese_concepts.png" alt= “Chinese_concepts” width="600" height="550"></p>
37
  <p align="center">Fig.2 Chinese unique concepts</p>
38
- <p align="center"><img src="docs\different_base_model.png" alt= “different_base_model” width="600" height="650"></p>
39
  <p align="center">Fig.3 Different English branch</p>
40
 
41
  For more illustrations and details, please refer to our paper ["Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities"](https://arxiv.org/abs/2309.00952)
 
23
 
24
  ## Method
25
  BDM entails the utilization of a backbone-branch network architecture akin to ControlNet[[7]](#7), model structure illustrated in the following
26
+ <p align="center"><img src="BDM_structure.png" alt= “BDM” width="400" height="300"></p>
27
  <p align="center">Fig.1 BDM model structure</p>
28
 
29
  The backbone part serves as a good diffusion initialization and will be frozen during training, which could be from any pretrained diffusion TTI model. We leverage Stable Diffusion 1.5 in current implementation. The branch part servers as language-native semantics injection module, whose parameters will be trained with language-native text-image pairs.
 
33
 
34
  ## Evaluation
35
  Here are several image generation illustrations for our BDM, with Chinese-native TTI capability and integrated with different English TTI communty techniques.
36
+ <p align="center"><img src="Chinese_concepts.png" alt= “Chinese_concepts” width="600" height="550"></p>
37
  <p align="center">Fig.2 Chinese unique concepts</p>
38
+ <p align="center"><img src="different_base_model.png" alt= “different_base_model” width="600" height="650"></p>
39
  <p align="center">Fig.3 Different English branch</p>
40
 
41
  For more illustrations and details, please refer to our paper ["Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities"](https://arxiv.org/abs/2309.00952)