KBlueLeaf
/

TIPO-200M

Text Generation

text-generation-inference

Model card Files Files and versions

KBlueLeaf commited on Sep 29, 2024

Commit

1ae2820

·

verified ·

1 Parent(s): 32e5d0f

Update README.md

Files changed (1) hide show

README.md +10 -3

README.md CHANGED Viewed

@@ -13,7 +13,8 @@ library_name: transformers
 ---
 # TIPO: Text to Image with text presampling for Prompt Optimization
-200M LLaMA arch model trained for TIPO.
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630593e2fca1d8d92b81d2a1/fc9ovmARapQmgq9DZ7ApJ.png)
@@ -22,9 +23,15 @@ library_name: transformers
 In this project, we introduce "TIPO" (**T**ext to **I**mage with text presampling for **P**rompt **O**ptimization), an innovative framework designed to significantly enhance the quality and usability of Text-to-Image (T2I) generative models. TIPO utilizes the Large Language Models (LLMs) to perform "Text Presampling" within the inference pipeline of text-to-image generative modeling. By refining and extending user input prompts, TIPO enables generative models to produce superior results with minimal user effort, making T2I systems more accessible and effective for a wider range of users.
 ## Usage
-Use updated version of DTG extension (renamed to z-tipo-ext), current version of z-tipo-ext support stable-diffusion-webui, stable-diffusion-webui-forge and ComfyUI. SD-Next haven't been tested.
-## Metric
 We have tested TIPO in several metric:
 #### 1. Aesthetic Score (Higher is Better)

 ---
 # TIPO: Text to Image with text presampling for Prompt Optimization
+200M LLaMA arch model trained for TIPO.<br>
+Tech Report: https://hackmd.io/@KBlueLeaf/BJULOQBR0
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630593e2fca1d8d92b81d2a1/fc9ovmARapQmgq9DZ7ApJ.png)
 In this project, we introduce "TIPO" (**T**ext to **I**mage with text presampling for **P**rompt **O**ptimization), an innovative framework designed to significantly enhance the quality and usability of Text-to-Image (T2I) generative models. TIPO utilizes the Large Language Models (LLMs) to perform "Text Presampling" within the inference pipeline of text-to-image generative modeling. By refining and extending user input prompts, TIPO enables generative models to produce superior results with minimal user effort, making T2I systems more accessible and effective for a wider range of users.
 ## Usage
+Use updated version of DTG extension (renamed to z-tipo-extension), current version of z-tipo-extension support stable-diffusion-webui, stable-diffusion-webui-forge and ComfyUI. SD-Next haven't been tested.
+https://github.com/KohakuBlueleaf/z-tipo-extension
+## Model arch and Training
+This model is LLaMA arch with 200M parameters, the training data is combined version of Danbooru2023, GBC10M and Coyo-HD-11M.<br>
+The total token seen is around 40B tokens.<br>
+For more information please refer to the tech report.
+### Evaluation
 We have tested TIPO in several metric:
 #### 1. Aesthetic Score (Higher is Better)