Update README.md
Browse files
README.md
CHANGED
|
@@ -5,9 +5,9 @@ tags: []
|
|
| 5 |
|
| 6 |
# Scaling Down Text Encoders of Text-to-Image Diffusion Models
|
| 7 |
|
| 8 |
-
Official Repository of the paper: *[Scaling Down Text Encoders of Text-to-Image Diffusion Models](https://github.com/LifuWang-66/
|
| 9 |
|
| 10 |
-
Project Page: https://
|
| 11 |
|
| 12 |
## Model Descriptions:
|
| 13 |
T5-Base distilled from [T5-XXL](https://huggingface.co/google/flan-t5-xxl) using [Flux](https://huggingface.co/runwayml/stable-diffusion-v1-5).
|
|
@@ -22,8 +22,8 @@ It is 50 times smaller and retains most capability of T5-XXL.
|
|
| 22 |
## Usage:
|
| 23 |
1. Setup the environment:
|
| 24 |
```
|
| 25 |
-
git clone https://github.com/LifuWang-66/
|
| 26 |
-
cd
|
| 27 |
conda create -n scaling python=3.12
|
| 28 |
conda activate scaling
|
| 29 |
pip install -r requirements.txt
|
|
@@ -41,7 +41,7 @@ from diffusers import FluxPipeline
|
|
| 41 |
|
| 42 |
|
| 43 |
pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev", torch_dtype=torch.float16)
|
| 44 |
-
text_encoder = T5EncoderWithProjection.from_pretrained('
|
| 45 |
pipe.text_encoder_2 = text_encoder
|
| 46 |
pipe = pipe.to('cuda')
|
| 47 |
|
|
|
|
| 5 |
|
| 6 |
# Scaling Down Text Encoders of Text-to-Image Diffusion Models
|
| 7 |
|
| 8 |
+
Official Repository of the paper: *[Scaling Down Text Encoders of Text-to-Image Diffusion Models](https://github.com/LifuWang-66/DistillT5)*.
|
| 9 |
|
| 10 |
+
Project Page: https://github.com/LifuWang-66/DistillT5.git
|
| 11 |
|
| 12 |
## Model Descriptions:
|
| 13 |
T5-Base distilled from [T5-XXL](https://huggingface.co/google/flan-t5-xxl) using [Flux](https://huggingface.co/runwayml/stable-diffusion-v1-5).
|
|
|
|
| 22 |
## Usage:
|
| 23 |
1. Setup the environment:
|
| 24 |
```
|
| 25 |
+
git clone https://github.com/LifuWang-66/DistillT5.git
|
| 26 |
+
cd DistillT5
|
| 27 |
conda create -n scaling python=3.12
|
| 28 |
conda activate scaling
|
| 29 |
pip install -r requirements.txt
|
|
|
|
| 41 |
|
| 42 |
|
| 43 |
pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev", torch_dtype=torch.float16)
|
| 44 |
+
text_encoder = T5EncoderWithProjection.from_pretrained('LifuWang/DistillT5', torch_dtype=torch.float16)
|
| 45 |
pipe.text_encoder_2 = text_encoder
|
| 46 |
pipe = pipe.to('cuda')
|
| 47 |
|