KBlueLeaf commited on
Commit
1ae2820
·
verified ·
1 Parent(s): 32e5d0f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -3
README.md CHANGED
@@ -13,7 +13,8 @@ library_name: transformers
13
  ---
14
  # TIPO: Text to Image with text presampling for Prompt Optimization
15
 
16
- 200M LLaMA arch model trained for TIPO.
 
17
 
18
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630593e2fca1d8d92b81d2a1/fc9ovmARapQmgq9DZ7ApJ.png)
19
 
@@ -22,9 +23,15 @@ library_name: transformers
22
  In this project, we introduce "TIPO" (**T**ext to **I**mage with text presampling for **P**rompt **O**ptimization), an innovative framework designed to significantly enhance the quality and usability of Text-to-Image (T2I) generative models. TIPO utilizes the Large Language Models (LLMs) to perform "Text Presampling" within the inference pipeline of text-to-image generative modeling. By refining and extending user input prompts, TIPO enables generative models to produce superior results with minimal user effort, making T2I systems more accessible and effective for a wider range of users.
23
 
24
  ## Usage
25
- Use updated version of DTG extension (renamed to z-tipo-ext), current version of z-tipo-ext support stable-diffusion-webui, stable-diffusion-webui-forge and ComfyUI. SD-Next haven't been tested.
 
26
 
27
- ## Metric
 
 
 
 
 
28
  We have tested TIPO in several metric:
29
 
30
  #### 1. Aesthetic Score (Higher is Better)
 
13
  ---
14
  # TIPO: Text to Image with text presampling for Prompt Optimization
15
 
16
+ 200M LLaMA arch model trained for TIPO.<br>
17
+ Tech Report: https://hackmd.io/@KBlueLeaf/BJULOQBR0
18
 
19
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630593e2fca1d8d92b81d2a1/fc9ovmARapQmgq9DZ7ApJ.png)
20
 
 
23
  In this project, we introduce "TIPO" (**T**ext to **I**mage with text presampling for **P**rompt **O**ptimization), an innovative framework designed to significantly enhance the quality and usability of Text-to-Image (T2I) generative models. TIPO utilizes the Large Language Models (LLMs) to perform "Text Presampling" within the inference pipeline of text-to-image generative modeling. By refining and extending user input prompts, TIPO enables generative models to produce superior results with minimal user effort, making T2I systems more accessible and effective for a wider range of users.
24
 
25
  ## Usage
26
+ Use updated version of DTG extension (renamed to z-tipo-extension), current version of z-tipo-extension support stable-diffusion-webui, stable-diffusion-webui-forge and ComfyUI. SD-Next haven't been tested.
27
+ https://github.com/KohakuBlueleaf/z-tipo-extension
28
 
29
+ ## Model arch and Training
30
+ This model is LLaMA arch with 200M parameters, the training data is combined version of Danbooru2023, GBC10M and Coyo-HD-11M.<br>
31
+ The total token seen is around 40B tokens.<br>
32
+ For more information please refer to the tech report.
33
+
34
+ ### Evaluation
35
  We have tested TIPO in several metric:
36
 
37
  #### 1. Aesthetic Score (Higher is Better)